Applying Batch Normalization to Hybrid NN-HMM Model For Speech Recognition

Zhan, Hongjian<sup>*</sup>; Chen, Guilin; Lu, Yue

doi:10.1007/978-981-10-3005-5_35

摘要

Batch Normalization has showed success in image classification and other image processing areas by reducing internal covariate shift in deep network model's training procedure. In this paper, we propose to apply batch normalization to speech recognition within the hybrid NN-HMM model. We evaluate the performance of this new method in the acoustic model of the hybrid system with a speaker-independent speech recognition task using some Chinese datasets. Compared to the former best model we used in the Chinese datasets, it shows that with batch normalization we can reach lower word error rate (WER) of 8%-13% relatively, meanwhile we just need 60% iterations of original model to finish the training procedure.

出版日期2016
单位华东师范大学

全文

访问全文

收藏分享被引(10) 浏览

更新时间：2023-10-18 18:43

Applying Batch Normalization to Hybrid NN-HMM Model For Speech Recognition

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友