Perceptual factor analysis for speech enhancement

Chuan Wei Ting, Jen-Tzung Chien

Research output: Contribution to conferencePaperpeer-review

Abstract

This paper presents a new speech enhancement approach originated from factor analysis (FA) framework. FA is a data analysis model where the relevant common factors can be extracted from observations. A factor loading matrix is found and a resulting model error is introduced for each observation. Interestingly, FA is a subspace approach properly representing the noisy speech. This approach partitions the space of noisy speech into a principal subspace containing clean speech and a complimentary (minor) subspace containing the residual speech and noise. We show that FA is a generalized data model compared to signal subspace approach. To perform FA speech enhancement, we present a perceptual optimization procedure that minimizes the signal distortion subject to the energies of residual speech and noise under a specified level. Importantly, we present a hypothesis testing approach to optimally perform subspace decomposition. In the experiments, we implement perceptual FA speech enhancement using Aurora2 corpus. We find that proposed approach achieves desirable speech recognition rates especially when signal-to-noise ratio is lower than 5 dB.

Original languageEnglish
StatePublished - 1 Dec 2005
Event17th Conference on Computational Linguistics and Speech Processing, ROCLING 2005 - Tainan, Taiwan
Duration: 15 Sep 200516 Sep 2005

Conference

Conference17th Conference on Computational Linguistics and Speech Processing, ROCLING 2005
CountryTaiwan
CityTainan
Period15/09/0516/09/05

Fingerprint Dive into the research topics of 'Perceptual factor analysis for speech enhancement'. Together they form a unique fingerprint.

Cite this