Interaural coherence induced ideal binary mask for binaural speech separation and dereverberation

Yi Ting Chen, Tzu Hao Chen, Mao Chang Huang, Tai-Shih Chi

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

We have proposed a spatial-cue based binaural noise reduction algorithm for hearing AIDS. However, in that algorithm, decision parameters are empirically selected. In this paper, we extend the work and propose a supervised classification algorithm for binaural speech enhancement/separation and dereverberation using a modified ideal binary mask (mIBM) as the training target and simple neural networks (NNs) as classifiers. The low complexity of the simple NNs makes the proposed algorithm practical for binaural hearing AIDS. The interaural time difference (ITD) and the interaural level difference (ILD) of each T-F unit are extracted as the basic binaural features. For the purpose of dereverberation, the interaural coherence (IC) is also considered when building the target mIBM and training the NNs. For separation evaluations, our method yields comparable performance to a more complicated benchmark system, which cannot de-reverb the signal. For concurrent separation and dereverberation, our method offers 4 to 5 dB improvement on the frequency-weighted segmental speech-to-noise ratio (SNRfw) over unprocessed speech.

Original languageEnglish
Title of host publicationProceedings of 2016 10th International Symposium on Chinese Spoken Language Processing, ISCSLP 2016
EditorsHsin-Min Wang, Qingzhi Hou, Yuan Wei, Tan Lee, Jianguo Wei, Lei Xie, Hui Feng, Jianwu Dang, Jianwu Dang
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781509042937
DOIs
StatePublished - 2 May 2017
Event10th International Symposium on Chinese Spoken Language Processing, ISCSLP 2016 - Tianjin, China
Duration: 17 Oct 201620 Oct 2016

Publication series

NameProceedings of 2016 10th International Symposium on Chinese Spoken Language Processing, ISCSLP 2016

Conference

Conference10th International Symposium on Chinese Spoken Language Processing, ISCSLP 2016
CountryChina
CityTianjin
Period17/10/1620/10/16

Keywords

  • Binaural speech separation
  • Coherence
  • Dereverberation
  • ILD
  • ITD
  • Neural network

Fingerprint Dive into the research topics of 'Interaural coherence induced ideal binary mask for binaural speech separation and dereverberation'. Together they form a unique fingerprint.

Cite this