A Sparse Plus Low-Rank Exponential Language Model for Limited Resource Scenarios

Hutchinson Brian<sup>*</sup>; Ostendorf Mari; Fazel Maryam

doi:10.1109/TASLP.2014.2379593

摘要

This paper describes a new exponential language model that decomposes the model parameters into one or more low-rank matrices that learn regularities in the training data and one or more sparse matrices that learn exceptions (e.g., keywords). The low-rank matrices induce continuous-space representations of words and histories. The sparse matrices learn multi-word lexical items and topic/domain idiosyncrasies. This model generalizes the standard l(1)-regularized exponential language model, and has an efficient accelerated first-order training algorithm. Language modeling experiments show that the approach is useful in scenarios with limited training data, including low resource languages and domain adaptation.

出版日期2015-3

全文

访问全文

收藏分享被引(7) 浏览

更新时间：2021-04-10 06:54

A Sparse Plus Low-Rank Exponential Language Model for Limited Resource Scenarios

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友