Recent Progress of Mandrain Spontaneous Speech Recognition on Mandrain Conversation Dialogue Corpus

Yu Chih Deng, Yih Ru Wang, Sin Horng Chen, Chen Yu Chiang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

This paper presents a progress report on a relatively difficult ASR task on a spontaneous speech corpus-Mandarin Conversational Dialogue Corpus (MCDC). A DNN-based acoustic model is constructed based on the CLDNN structure with a large dataset that comprises two spontaneous-speech corpora and one read-speech corpus. The study uses a large text dataset formed by seven corpora to train an efficient general language model (LM). Two adapted LMs specially for spontaneous speech recognition are also constructed. Experimental results showed that the best performances of 26.3% in character error rate (CER) and 32.5% in word error rate (WER) were reached on MCDC. They represented 27.9% and 22.2% of relative CER and WER reductions as compared with the performances by the previous best HMM-based method. This confirms that the proposed method is promising in tackling on Mandarin spontaneous speech recognition.

Original languageEnglish
Title of host publication2019 22nd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, O-COCOSDA 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781728124490
DOIs
StatePublished - Oct 2019
Event22nd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, O-COCOSDA 2019 - Cebu, Philippines
Duration: 25 Oct 201927 Oct 2019

Publication series

Name2019 22nd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, O-COCOSDA 2019

Conference

Conference22nd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, O-COCOSDA 2019
CountryPhilippines
CityCebu
Period25/10/1927/10/19

Keywords

  • CLDNN
  • MCDC Corpus
  • Spontaneous Speech Recognition

Fingerprint Dive into the research topics of 'Recent Progress of Mandrain Spontaneous Speech Recognition on Mandrain Conversation Dialogue Corpus'. Together they form a unique fingerprint.

Cite this