Speech recognition using quantized LSP parameters and their transformations in digital communication

Seung Ho Choi, Hong Kook Kim, Hwang Soo Lee

Research output: Contribution to journalArticlepeer-review

19 Scopus citations

Abstract

In digital communication networks, speech recognition systems conventionally first reconstruct speech and then extract feature parameters. In this paper, we consider a useful approach of incorporating speech coding parameters into the speech recognizer. Most speech coders employed in digital communication networks use line spectrum pairs (LSPs) as spectral parameters. We introduce two ways to improve the recognition performance of the LSP-based speech recognizer. One is to devise weighted distance measures of LSPs and the other is to transform LSPs into a new feature set, named pseudo-cepstrum (PCEP). The speaker-independent connected-digit recognition experiments based on the discrete hidden Markov model showed that the weighted distance measures provide better recognition accuracy than unweighted ones do. Additionally, a mel-scale PCEP gives an even better performance than the weighted distance measures do. To clarify the performance improvement of the proposed methods, a significance test is introduced. As a result, the proposed methods achieved higher performances in recognition accuracy, compared with the conventional methods employing mel-frequency cepstral coefficients.

Original languageEnglish
Pages (from-to)223-233
Number of pages11
JournalSpeech Communication
Volume30
Issue number4
DOIs
StatePublished - Apr 2000

Fingerprint

Dive into the research topics of 'Speech recognition using quantized LSP parameters and their transformations in digital communication'. Together they form a unique fingerprint.

Cite this