A novel density-based clustering method using word embedding features for dialogue intention recognition

Jang Jungsun; Lee Yeonsoo; Lee Seolhwa; Shin Dongwon; Kim Dongjun; Rim Haechang

doi:10.1007/s10586-016-0649-7

摘要

In dialogue systems, understanding user utterances is crucial for providing appropriate responses. Various classification models have been proposed to deal with natural language understanding tasks related to user intention analysis, such as dialogue acts or emotion recognition. However, models that use original lexical features without any modifications encounter the problem of data sparseness, and constructing sufficient training data to overcome this problem is labor-intensive, time-consuming, and expensive. To address this issue, word embedding models that can learn lexical synonyms using vast raw corpora have recently been proposed. However, the analysis of embedding features is not yet sufficient to validate the efficiency of such models. Specifically, using the cosine similarity score as a feature in the embedding space neglects the skewed nature of the word frequency distribution, which can affect the improvement of model performance. This paper describes a novel density-based clustering method that efficiently integrates word embedding vectors into dialogue intention recognition. Experimental results show that our proposed model helps overcome the data sparseness problem seen in previous classification models and can assist in improving the classification performance.

出版日期2016-12

全文

访问全文

收藏分享被引(4) 浏览

更新时间：2021-03-21 03:31

A novel density-based clustering method using word embedding features for dialogue intention recognition

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友