Perception-based objective speech quality assessment

Ting Yu Yen*, Jian Hueng Chen, Tai-Shih Chi

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Scopus citations

Abstract

A joint spectro-temporal auditory model is utilized to assess speech quality objectively. The model mimics early and central auditory functions and serves as a spectro-temporal modulation filterbank. Three perceptual relevant parameters, intelligibility, clarity and naturalness, are addressed by the model and are combined to estimate the subjective mean opinion score (MOS) for speech quality measure. Through a simple multiple linear regression analysis, we demonstrate the performance of our proposed perception-based objective speech quality measure is better than that of the state-of-theart P.563 standard in estimating MOS of the codec-distorted speech in ITU-T Supp. 23 database.

Original languageEnglish
Title of host publication2009 IEEE International Conference on Acoustics, Speech, and Signal Processing - Proceedings, ICASSP 2009
Pages4521-4524
Number of pages4
DOIs
StatePublished - 23 Sep 2009
Event2009 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2009 - Taipei, Taiwan
Duration: 19 Apr 200924 Apr 2009

Publication series

NameICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
ISSN (Print)1520-6149

Conference

Conference2009 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2009
CountryTaiwan
CityTaipei
Period19/04/0924/04/09

Keywords

  • Auditory representation
  • Clarity
  • Intelligibility
  • Naturalness
  • Objective speech quality

Fingerprint Dive into the research topics of 'Perception-based objective speech quality assessment'. Together they form a unique fingerprint.

  • Cite this

    Yen, T. Y., Chen, J. H., & Chi, T-S. (2009). Perception-based objective speech quality assessment. In 2009 IEEE International Conference on Acoustics, Speech, and Signal Processing - Proceedings, ICASSP 2009 (pp. 4521-4524). [4960635] (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings). https://doi.org/10.1109/ICASSP.2009.4960635