A new approach for efficient text analyzer is proposed. The prosody generator driven method is employed to design an efficient text analyzer for Mandarin text-to-speech. Three heuristic and theoretical methods are used to examine the capability of each linguistic feature. Firstly, the contribution of each linguistic feature on prosody generator is examined experimentally. Secondly, the cross-influence of each linguistic feature on the prosody generator is analyzed. Thirdly, the problem of over- and under- classification on the linguistic feature will be inspected. Finally, these three analytic results are referenced to design an efficient text analyzer. More than 39,103 Chinese characters are employed to examine the performance of our text analyzer. Less than 78ms is need for word tagging under P4-1.4G PC. The correction rate with 97% is achieved. It confirms that the performance of our text analyzer is very good. Moreover, more natural and fluent speech is obtained under the lower computation.
|Number of pages||4|
|Journal||ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings|
|State||Published - 25 Sep 2003|
|Event||2003 IEEE International Conference on Accoustics, Speech, and Signal Processing - Hong Kong, Hong Kong|
Duration: 6 Apr 2003 → 10 Apr 2003