By taking advantage of the four-tone structure in the pitch contour of Mandarin speech, text-independent speaker identification using an orthogonal pitch parameter is described. Slopes, mean, and duration of the pitch contour of each word in an utterance are taken as recognition features. An 85% identification rate is achieved by using parameters of pitch contour only. When incorporating parameters of pitch contour with parameters of the vocal tract, this system outperforms that of using parameters of pitch contour or vocal tract only. A recognition rate of 99. 2% is reached using such a system.
|Number of pages||4|
|Journal||ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings|
|State||Published - 1 Jan 1987|