Adaptive redundant speech streaming with scalable speech coding based on speech quality estimation

Jin Ah Kang, Hong Kook Kim, Seung Ho Choi, Sang Ryong Kim

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

This paper proposes an adaptive redundant speech transmission (ARST) technique to improve the perceived speech quality (PSQ) of a speech streaming system over an IP network. To this end, the proposed technique estimates the PSQ from the received speech data under the current network condition. After that, it decides a suitable ARST mode in terms of the amount and bit rate of the redundant speech data (RSD) that can be used in a speech decoder to reconstruct lost speech signals for higher packet loss rates. On the basis of this decision, the proposed technique generates bitstreams of the current speech data (CSD) and the RSD using a scalable speech encoder to maintain the same transmission bandwidth. The effectiveness of the proposed technique is then demonstrated using ITU-T Recommendations G.729.1 and P.563 as a scalable speech coder and a PSQ estimator, respectively. It is shown from the experimental results that a speech streaming system employing the proposed ARST technique significantly improves the PSQ under various packet loss conditions.

Original languageEnglish
Pages (from-to)1921-1931
Number of pages11
JournalInformation (Japan)
Volume17
Issue number5
StatePublished - May 2014

Keywords

  • ITU-T Recommendation G.729.1
  • Perceived speech quality
  • Redundant speech transmission
  • Scalable speech coder
  • Speech quality estimation

Fingerprint

Dive into the research topics of 'Adaptive redundant speech streaming with scalable speech coding based on speech quality estimation'. Together they form a unique fingerprint.

Cite this