Deep Learning-Based Sound Localization Using Stereo Signals Based on Synchronized ILD

Research output: Contribution to journalArticlepeer-review

Abstract

The interaural level difference (ILD) used for the sound localization using stereo signals is to find the difference in energy that the sound source reaches both ears. The conventional ILD does not consider the time difference of the stereo signals, which is a factor of lowering the accuracy. In this paper, we propose a synchronized ILD that obtains the ILD after synchronizing these time differences. This method uses the cross-correlation function (CCF) to calculate the time difference to reach both ears and use it to obtain synchronized ILD. In order to prove the performance of the proposed method, we conducted two sound localization experiments. In each experiment, the synchronized ILD and CCF or only the synchronized ILD were given as inputs of the deep neural networks (DNN), respectively. In this paper, we evaluate the performance of sound localization with mean error and accuracy of sound localization. Experimental results show that the proposed method has better performance than the conventional methods.
Original languageKorean
Pages (from-to)106-110
Number of pages5
JournalThe International Journal of Internet, Broadcasting and Communication
Volume11
Issue number3
DOIs
StatePublished - Aug 2019

Cite this