TDOA information based vad for robust speech recognition in directional and diffuse noise field

Kuan Lang Huang*, Tai-Shih Chi

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

A two-microphone algorithm is proposed to improve automatic speech recognition (ASR) rates when target speech is corrupted by directional interferences and diffuse noise simultaneously. The algorithm adopts the time difference of arrival (TDOA) to suppress directional interferences and a TDOA-information based voice activity detector (VAD) to suppress diffuse noise. Simulation results show the proposed algorithm is effective in improving ASR rates in a sound field mixed with a directional interference and diffuse noise. Compared with the phase difference (PD) algorithm, the proposed method gives comparable recognition rates when facing a directional interference and much higher and more robust recognition rates when diffuse noise emerges.

Original languageEnglish
Title of host publication2012 8th International Symposium on Chinese Spoken Language Processing, ISCSLP 2012
Pages126-130
Number of pages5
DOIs
StatePublished - 1 Dec 2012
Event2012 8th International Symposium on Chinese Spoken Language Processing, ISCSLP 2012 - Hong Kong, China
Duration: 5 Dec 20128 Dec 2012

Publication series

Name2012 8th International Symposium on Chinese Spoken Language Processing, ISCSLP 2012

Conference

Conference2012 8th International Symposium on Chinese Spoken Language Processing, ISCSLP 2012
CountryChina
CityHong Kong
Period5/12/128/12/12

Keywords

  • Diffuse noise
  • directional interference
  • phase difference
  • time difference of arrival
  • voice activity detector

Fingerprint Dive into the research topics of 'TDOA information based vad for robust speech recognition in directional and diffuse noise field'. Together they form a unique fingerprint.

Cite this