A two-stage sample-based phone boundary detector using segmental similarity features

Yih-Ru Wang*

*Corresponding author for this work

Research output: Contribution to journalConference article

1 Scopus citations

Abstract

In this paper, a two-stage sample-based phone boundary detection algorithm is proposed. In the first stage, some local sample-based acoustic parameters are used to pre-select some phone boundary candidates. Then, in the second stage, some high-order statistics of the log-likelihood differences of two adjacent speech segments around each boundary candidate are calculated to serve as similarity measure for candidate verification. Experimental results on the TIMIT speech corpus showed that EERs of 8.6% and 7.6% were achieved for onestage and two-stage sample-based phone boundary detections, respectively. Moreover, for the two-stage system, 42.1% and 81.9% of boundaries detected were within 5- and 15-sample error tolerance from manual labeling results.

Original languageEnglish
Pages (from-to)413-416
Number of pages4
JournalProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
StatePublished - 1 Dec 2011
Event12th Annual Conference of the International Speech Communication Association, INTERSPEECH 2011 - Florence, Italy
Duration: 27 Aug 201131 Aug 2011

Keywords

  • Phone boundary detection
  • Similarity measure

Fingerprint Dive into the research topics of 'A two-stage sample-based phone boundary detector using segmental similarity features'. Together they form a unique fingerprint.

  • Cite this