A pitch based VAD adopting quasi-ANSI 1/3 octave filter bank with 11.3 ms latency for monosyllable hearing aids

Yi Cheng Huang, Yi Fan Chiang, Shyh-Jye Jou

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

This paper presents a pitch based voice activity detection (PBVAD) algorithm adopting a quasi-ANSI 1/3 octave filter bank which has low group delay for realistic implementation in hearing aids systems. For compensating the drawback of low resolution resulted from quasi-ASNI filter bank, this pitch based VAD algorithm integrals the features of monosyllable speech such as pitch and corresponding harmonics, onset and time of word length. Simulation results reveal that with more harmonics detection, the accuracy of the proposed PBVAD algorithm improves from 78.9% to 87.7%. Additionally, the proposed VAD algorithm is implemented in ANSI filter bank for comparisons. With the integration of features, the result shows the proposed algorithm can achieve similar VAD accuracy, less than 2.5%, in quasi-ANSI filter bank and ANSI filter bank. Thus, the proposed algorithm can tackle the drawback of quasi- ANSI filter bank and is also suitable for ANSI filter bank. Moreover, the latency incurred by quasi-ANSI filter bank and the proposed VAD algorithm is 11.3ms and this satisfies the requirement of HA systems for practical implementation.

Original languageEnglish
Title of host publication2013 IEEE Workshop on Signal Processing Systems, SiPS 2013
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages48-53
Number of pages6
ISBN (Print)9781467362382
DOIs
StatePublished - 1 Jan 2013
Event2013 IEEE Workshop on Signal Processing Systems, SiPS 2013 - Taipei, Taiwan
Duration: 16 Oct 201318 Oct 2013

Publication series

NameIEEE Workshop on Signal Processing Systems, SiPS: Design and Implementation
ISSN (Print)1520-6130

Conference

Conference2013 IEEE Workshop on Signal Processing Systems, SiPS 2013
CountryTaiwan
CityTaipei
Period16/10/1318/10/13

Keywords

  • Hearing aids
  • Mandarin
  • Non-stationary
  • Pitch
  • Voice activity detection

Fingerprint Dive into the research topics of 'A pitch based VAD adopting quasi-ANSI 1/3 octave filter bank with 11.3 ms latency for monosyllable hearing aids'. Together they form a unique fingerprint.

Cite this