Quality-aware loss-robust scalable speech streaming based on speech quality estimation

Jin Ah Kang, Seung Ho Choi, Hong Kook Kim

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper proposes a quality-aware loss-robust scalable speech streaming (QLSSS) method to improve the perceived speech quality (PSQ) of a scalable wideband speech streaming (SWSS) system over IP networks. To this end, the proposed method estimates the PSQ and the packet loss rate (PLR) from the received speech data. Subsequently, it decides the amount of redundant speech data (RSD) that a speech decoder can use to reconstruct lost speech signals for high PLRs. According to this decision, the proposed method optimizes a scalable speech coding mode for current speech data (CSD) and RSD bitstreams in order to prevent speech quality from being degraded under the estimated packet loss condition and maintain the transmission bandwidth. The effectiveness of the proposed method is then demonstrated using the ITU-T Recommendations G.729.1 and P.563 as a scalable wideband speech codec and a PSQ estimator, respectively. It is shown from the experiments that an SWSS system employing the proposed QLSSS method significantly improves speech quality under packet loss conditions.

Original languageEnglish
Title of host publicationCommunication and Networking - International Conference, FGCN 2011, Held as Part of the Future Generation Information Technology Conference, FGIT 2011, in Conjunction with GDC 2011, Proceedings
Pages132-142
Number of pages11
EditionPART 2
DOIs
StatePublished - 2011
Event2011 International Conference on Future Generation Communication and Networking, FGCN 2011, Held as Part of the 3rd International Mega-Conference on Future-Generation Information Technology, FGIT 2011, in Conjunction with GDC 2011 - Jeju Island, Korea, Republic of
Duration: 8 Dec 201110 Dec 2011

Publication series

NameCommunications in Computer and Information Science
NumberPART 2
Volume266 CCIS
ISSN (Print)1865-0929

Conference

Conference2011 International Conference on Future Generation Communication and Networking, FGCN 2011, Held as Part of the 3rd International Mega-Conference on Future-Generation Information Technology, FGIT 2011, in Conjunction with GDC 2011
Country/TerritoryKorea, Republic of
CityJeju Island
Period8/12/1110/12/11

Keywords

  • ITU-T G.729.1
  • ITU-T P.563
  • packet loss
  • perceived speech quality
  • redundant speech transmission
  • Scalable wideband speech streaming

Fingerprint

Dive into the research topics of 'Quality-aware loss-robust scalable speech streaming based on speech quality estimation'. Together they form a unique fingerprint.

Cite this