Explicit exploitation of stochastic characteristics of test utterance for text-independent speaker identification

Wei Ho Tsai, Wen-Whei Chang, Chao Shih Huang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Scopus citations

Abstract

In this paper, a novel speaker-identification (speaker-ID) technique based on explicit exploitation of stochastic characteristics of test utterance is proposed. Unlike the conventional approach which hypothesizes the identity of a test speaker by determining which client's model maximizes the likelihood for the test utterance, it is aimed to bilaterally compare test speaker's voices with client speakers' voices instead of simply taking the unilateral likelihoods into account. We study two approaches respectively based on cross likelihood ratio and Bayesian information criterion to accomplish this aim. Performance of the proposed approaches was evaluated by close-set text-independent speaker ID experiments and was shown to be superior to that of the con-ventional approach based on maximum likelihood decision rule.

Original languageEnglish
Title of host publicationEUROSPEECH 2001 - SCANDINAVIA - 7th European Conference on Speech Communication and Technology
EditorsBorge Lindberg, Henrik Benner, Paul Dalsgaard, Zheng-Hua Tan
PublisherInternational Speech Communication Association
Pages771-774
Number of pages4
ISBN (Electronic)8790834100, 9788790834104
StatePublished - 1 Jan 2001
Event7th European Conference on Speech Communication and Technology - Scandinavia, EUROSPEECH 2001 - Aalborg, Denmark
Duration: 3 Sep 20017 Sep 2001

Publication series

NameEUROSPEECH 2001 - SCANDINAVIA - 7th European Conference on Speech Communication and Technology

Conference

Conference7th European Conference on Speech Communication and Technology - Scandinavia, EUROSPEECH 2001
CountryDenmark
CityAalborg
Period3/09/017/09/01

Fingerprint Dive into the research topics of 'Explicit exploitation of stochastic characteristics of test utterance for text-independent speaker identification'. Together they form a unique fingerprint.

  • Cite this

    Tsai, W. H., Chang, W-W., & Huang, C. S. (2001). Explicit exploitation of stochastic characteristics of test utterance for text-independent speaker identification. In B. Lindberg, H. Benner, P. Dalsgaard, & Z-H. Tan (Eds.), EUROSPEECH 2001 - SCANDINAVIA - 7th European Conference on Speech Communication and Technology (pp. 771-774). (EUROSPEECH 2001 - SCANDINAVIA - 7th European Conference on Speech Communication and Technology). International Speech Communication Association.