Application of Morphosyntactic and Class-Based Language Models in Automatic Speech Recognition of Polish

Smywinski Pohl Alexsander; Ziolko Bartosz

doi:10.1142/S0218213016500068

摘要

In this paper we investigate the usefulness of morphosyntactic information as well as clustering in modeling Polish for automatic speech recognition. Polish is an inflectional language, thus we investigate the usefulness of an N-gram model based on morphosyntactic features. We present how individual types of features influence the model and which types of features are best suited for building a language model for automatic speech recognition. We compared the results of applying them with a class-based model that is automatically derived from the training corpus. We show that our approach towards clustering performs significantly better than frequently used SRI LM clustering method. However, this difference is apparent only for smaller corpora.

出版日期2016-4

全文

访问全文

收藏分享被引浏览

更新时间：2021-03-25 23:39

Application of Morphosyntactic and Class-Based Language Models in Automatic Speech Recognition of Polish

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友