Multi-Stream Posterior Features and Combining Subspace Gmms for Low Resource LVCSR

Qian Yanmin<sup>*</sup>; Xu Ji; Liu Jia

摘要

Large vocabulary continuous speech recognition is particularly difficult for low-resource languages. In the scenario we focus on here is that there is a very limited amount of acoustic training data in the target language, but more plentiful data in other languages. We investigate both feature-level and model-level approaches. The first is based on the MLP framework, in which we train the multi-streams based on the Automatic speech attribute transcription strategy and data sampling method individually, and a multilingual training mode using the non-target languages data is presented to obtain more discriminative features. At the model level we apply the recently proposed Subspace Gaussian mixture model to obtain more improvement. Finally, combining these two strategies in a multilingual training mode we get a large improvement of more than 13% absolute versus a conventional baseline.

出版日期2013-4
单位清华大学

全文

下载全文

收藏分享被引浏览

更新时间：2018-08-02 23:31

Multi-Stream Posterior Features and Combining Subspace Gmms for Low Resource LVCSR

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友