Abstract
In this paper, we present some techniques of acoustic echo cancellation for far-end (server-side) telephony speech recognition during barge-in situations. We develop a normalized least mean square algorithm for the adaptive filter of an echo canceller, and a double-talk detector for the online speech recognition services. In particular, we devise a voice activity detector for estimating the initial delay due to communication networks. In addition, we propose a hybrid method that uses the log-spectral distance measure, as well as the cross-correlation coefficients, to estimate the initial delay. From the simulation and the experiments in real environments, we conclude that the developed techniques can be successfully used for far-end telephony speech recognition services.
Original language | English |
---|---|
Pages (from-to) | 1113-1120 |
Number of pages | 8 |
Journal | Contemporary Engineering Sciences |
Volume | 7 |
Issue number | 21-24 |
DOIs | |
State | Published - 2014 |
Keywords
- Acoustic echo cancellation
- Barge-in
- Delay estimation
- Double-talk detection
- Speech recognition
- Voice activity detector