Unsupervised hierarchical adaptation using reliable selection of cluster-dependent parameters

Jen-Tzung Chien, Jean Claude Junqua

Research output: Contribution to journalArticle

17 Scopus citations

Abstract

Adaptation of speaker-independent hidden Markov models (HMMs) to a new speaker using speaker-specific data is an effective approach to improve speech recognition performance for the enrolled speaker. Practically, it is desirable to flexibly perform the adaptation without any prior knowledge or limitation on the enrolled adaptation data (e.g. data transcription, length and content). However, the inevitable transcription errors may cause unreliability in the model adaptation (or transformation). The variable length and content of adaptation data usually make it necessary to dynamically control the degree of sharing in transformation-based adaptation. This paper presents an unsupervised hierarchical adaptation algorithm for flexible speaker adaptation. We build a tree structure of HMMs such that the control of transformation sharing can be achieved. To perform the unsupervised learning, we apply Bayesian theory to estimate the transformation parameters and data transcription. To select the parameters for hierarchical model transformation, we developed a new algorithm based on the maximum confidence measure (MCM) and minimum description length (MDL) criteria. Experimental comparisons on unsupervised speaker adaptation show that the hybrid adaptation scheme based on MCM and MDL criteria achieves the best recognition results for any lengths of enrollment data.

Original languageEnglish
Pages (from-to)235-253
Number of pages19
JournalSpeech Communication
Volume30
Issue number4
DOIs
StatePublished - 1 Jan 2000

Fingerprint Dive into the research topics of 'Unsupervised hierarchical adaptation using reliable selection of cluster-dependent parameters'. Together they form a unique fingerprint.

  • Cite this