Active Learning Using Phone-Error Distribution for Speech Modeling

Murakami Hiroko<sup>*</sup>; Shinoda Koichi; Furui Sadaoki

doi:10.1587/transinf.E95.D.2486

摘要

We propose an active learning framework for speech recognition that reduces the amount of data required for acoustic modeling. This framework consists of two steps. We first obtain a phone-error distribution using an acoustic model estimated from transcribed speech data. Then, from a text corpus we select a sentence whose phone-occurrence distribution is close to the phone-error distribution and collect its speech data. We repeat this process to increase the amount of transcribed speech data. We applied this framework to speaker adaptation and acoustic model training. Our evaluation results showed that it significantly reduced the amount of transcribed data while maintaining the same level of accuracy.

出版日期2012-10

全文

访问全文

收藏分享被引浏览

更新时间：2018-04-10 19:52

Active Learning Using Phone-Error Distribution for Speech Modeling

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友