Dynamic cepstral representations based on order-dependent windowing methods

Hong Kook Kim, Seung Ho Choi, Hwang Soo Lee

Research output: Contribution to journalArticlepeer-review

Abstract

In this paper, we propose dynamic cepstral representations to effectively capture the temporal information of cepstral coefficients. The number of speech frames for the regression analysis to extract a dynamic cepstral coefficient is inversely proportional to the cepstral order since the cepstral coefficients of higher orders are more fluctuating than those of lower orders. By exploiting the relationship between the window length for extracting a dynamic cepstral coefficient and the statistical variance of the cepstral coefficient, we propose three kinds of windowing methods in this work: an utterance-specific variance-ratio windowing method, a statistical variance-ratio windowing method, and an inverse-lifter windowing method. Intra-speaker, inter-speaker, and speaker-independent recognition tests on 100 phonetically balanced words are carried out to evaluate the performance of the proposed order-dependent windowing methods.

Original languageEnglish
Pages (from-to)434-440
Number of pages7
JournalIEICE Transactions on Information and Systems
VolumeE81-D
Issue number5
StatePublished - 1998

Keywords

  • Cepstrum
  • Dynamic cepstrum
  • Order-dependent windowing
  • Speech recognition

Fingerprint

Dive into the research topics of 'Dynamic cepstral representations based on order-dependent windowing methods'. Together they form a unique fingerprint.

Cite this