Location classification of nonstationary sound sources using binaural room distribution patterns

Jwu-Sheng Hu*, Wei Han Liu

*Corresponding author for this work

Research output: Contribution to journalArticle

7 Scopus citations

Abstract

This paper discusses the relationships between the nonstationarity of sound sources and the distribution patterns of interaural phase differences (IPDs) and interaural level differences (ILDs) based on short-term frequency analysis. The amplitude variation of nonstationary sound sources is modeled by the exponent of polynomials from the concept of moving pole model. According to the model, the sufficient condition for utilizing the distribution patterns of IPDs and ILDs to localize a nonstationary sound source is suggested and the phenomena of multiple peaks in the distribution pattern can be explained. Simulation is performed to interpret the relation between the distribution patterns of IPD and ILD and the nonstationary sound source. Furthermore, a Gaussian-mixture binaural room distribution model (GMBRDM) is proposed to model distribution patterns of IPDs and ILDs for nonstationary sound source location classification. The effectiveness and performance of the proposed GMBRDM are demonstrated by experimental results.

Original languageEnglish
Article number4804952
Pages (from-to)682-692
Number of pages11
JournalIEEE Transactions on Audio, Speech and Language Processing
Volume17
Issue number4
DOIs
StatePublished - 1 May 2009

Keywords

  • Head-related transfer function (HRTF)
  • Interaural level difference (ILD)
  • Interaural phase difference (IPD)
  • Sound source localization

Fingerprint Dive into the research topics of 'Location classification of nonstationary sound sources using binaural room distribution patterns'. Together they form a unique fingerprint.

  • Cite this