Improving GMM-based spectral conversion with optimal conversion function selection

Hsin Te Hwang*, Wen Liang Wu, Sin-Horng Chen

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We address the problem in the conventional Gaussian mixture model (GMM)-based spectral conversion from the viewpoint of optimal conversion function selection. The proposed method is motivated by that if the optimal conversion function based on minimum mel-cepstral distortion (MMCD) criterion can be selected during the conversion stage, the conversion performance in terms of mel-cepstral distortion (MCD) can be improved dramatically. To this end, our goal is to improve the accuracy rate of the optimal conversion function selection by the MMCD-based data clustering with Linear Discriminant Analysis (LDA). Experiment results confirmed that the proposed method can effectively improve the conventional method.

Original languageEnglish
Title of host publication2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010 - Proceedings
Pages392-396
Number of pages5
DOIs
StatePublished - 1 Dec 2010
Event2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010 - Tainan, Taiwan
Duration: 29 Nov 20103 Dec 2010

Publication series

Name2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010 - Proceedings

Conference

Conference2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010
CountryTaiwan
CityTainan
Period29/11/103/12/10

Keywords

  • Gaussian mixture model (GMM)
  • Voice conversion (VC)

Fingerprint Dive into the research topics of 'Improving GMM-based spectral conversion with optimal conversion function selection'. Together they form a unique fingerprint.

  • Cite this

    Hwang, H. T., Wu, W. L., & Chen, S-H. (2010). Improving GMM-based spectral conversion with optimal conversion function selection. In 2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010 - Proceedings (pp. 392-396). [5684860] (2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010 - Proceedings). https://doi.org/10.1109/ISCSLP.2010.5684860