Acoustic Model Adaptation for Speech Recognition

作者:Shinoda Koichi*
来源:IEICE Transactions on Information and Systems, 2010, E93D(9): 2348-2362.
DOI:10.1587/transinf.E93.D.2348

摘要

Statistical speech recognition using continuous-density hidden Markov models (CDHMMs) has yielded many practical applications. However, in general, mismatches between the training data and input data significantly degrade recognition accuracy. Various acoustic model adaptation techniques using a few input utterances have been employed to overcome this problem. In this article, we survey these adaptation techniques, including maximum a posteriori (MAP) estimation, maximum likelihood linear regression (MLLR), and eigenvoice. We also present a schematic view called the adaptation pyramid to illustrate how these methods relate to each other.

  • 出版日期2010-9