장단기 메모리 순환신경망 기반의 비침입적 음성 명료도 추정 방법

Translated title of the contribution: A Non-Intrusive Speech Intelligibility Estimation Method Based on Recurrent Neural Network with Long Short-Term Memory

Research output: Contribution to journalArticlepeer-review

Abstract

This paper proposes a non-intrusive speech intelligibility estimation method with no reference speech signal, which is based on recurrent neural network (RNN) with long short-term memory (LSTM) structure. Conventional standard estimation method P.563 has poor estimation performance and lack of consistency especially in various noise and reverberation environments. The proposed method trains the LSTM RNN model parameters by utilizing the STOI that is the standard intelligibility estimation method with reference speech signal. The input and output of the LSTM RNN are the MFCC vector and the frame-wise STOI values. Experimental results show that the proposed intelligibility estimation method outperforms the conventional standard P.563 in various noise and reverberation environments.
Translated title of the contributionA Non-Intrusive Speech Intelligibility Estimation Method Based on Recurrent Neural Network with Long Short-Term Memory
Original languageKorean
Pages (from-to)1736-1738
Number of pages3
Journal한국통신학회논문지
Volume42
Issue number9
DOIs
StatePublished - Sep 2017

Fingerprint

Dive into the research topics of 'A Non-Intrusive Speech Intelligibility Estimation Method Based on Recurrent Neural Network with Long Short-Term Memory'. Together they form a unique fingerprint.

Cite this