Robust speaker adaptation using a piecewise linear acoustic mapping

J.R. Bellegarda; P.V. de Souza; A.J. Nadas; D. Nahamoo; M.A. Picheny; L.R. Bahl

doi:10.1109/ICASSP.1992.225876

Acoustics, Speech, and Signal Processing, IEEE International Conference on

Robust speaker adaptation using a piecewise linear acoustic mapping

Year: 1992, Volume: 1, Pages: 445-448

DOI Bookmark: 10.1109/ICASSP.1992.225876

Authors

J.R. Bellegarda, IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
P.V. de Souza, IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
A.J. Nadas, IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
D. Nahamoo, IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
M.A. Picheny, IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
L.R. Bahl, IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA

Abstract

In a large vocabulary speech recognition system, it is desirable to make use of previously acquired speech data when encountering new speakers. The authors describe an adaptation strategy based on a piecewise linear mapping between the feature space of a new speaker and that of a reference speaker. This speaker-normalizing mapping is used to transform the previously acquired parameters of the reference speaker onto the space of the new speaker. This results in a robust speaker adaptation procedure which allows for a drastic reduction in the amount of training data required from the new speaker. The performance of this method is illustrated on an isolated utterance speech recognition task with a vocabulary of 20000 words.

Like what you’re reading?

Already a member?Sign In

Member Price

$11

Non-Member Price

$21

Add to Cart Sign In

Get this article FREE with a new membership!