A non-negative tensor factorization model for selectional preference induction

作者:Van de Cruys Tim*
来源:Natural Language Engineering, 2010, 16(4): 417-437.
DOI:10.1017/S1351324910000148

摘要

The distributional similarity methods have proven to be a valuable tool for the induction of semantic similarity. Until now, most algorithms use two-way co-occurrence data to compute the meaning of words. Co-occurrence frequencies, however, need not be pairwise. One can easily imagine situations where it is desirable to investigate co-occurrence frequencies of three modes and beyond. This paper will investigate tensor factorization methods to build a model of three-way co-occurrences. The approach is applied to the problem of selectional preference induction, and automatically evaluated in a pseudo-disambiguation task. The results show that tensor factorization, and non-negative tensor factorization in particular, is a promising tool for Natural Language Processing (NLP).

  • 出版日期2010-10
  • 单位INRIA