An analysis of NP-completeness in novelty and diversity ranking

Carterette Ben<sup>*</sup>

doi:10.1007/s10791-010-9157-1

摘要

A useful ability for search engines is to be able to rank objects with novelty and diversity: the top k documents retrieved should cover possible intents of a query with some distribution, or should contain a diverse set of subtopics related to the user's information need, or contain nuggets of information with little redundancy. Evaluation measures have been introduced to measure the effectiveness of systems at this task, but these measures have worst-case NP-hard computation time. The primary consequence of this is that there is no ranking principle akin to the Probability Ranking Principle for document relevance that provides uniform instruction on how to rank documents for novelty and diversity. We use simulation to investigate the practical implications of this for optimization and evaluation of retrieval systems.

出版日期2011-2

全文

访问全文

收藏分享被引(7) 浏览

更新时间：2018-02-09 22:19

An analysis of NP-completeness in novelty and diversity ranking

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友