Processing top-N relational queries by learning

作者:Zhu Liang*; Meng Weiyi; Liu Chunnian; Yang Wenzhu; Liu Dazhong
来源:Journal of Intelligent Information Systems, 2010, 34(1): 21-55.
DOI:10.1007/s10844-009-0078-7

摘要

A top-N selection query against a relation is to find the N tuples that satisfy the query condition the best but not necessarily completely. In this paper, we propose a new method for evaluating top-N queries against a relation. This method employs a learning-based strategy. Initially, this method finds and saves the optimal search spaces for a small number of random top-N queries. The learned knowledge is then used to evaluate new queries. Extensive experiments are carried out to measure the performance of this strategy and the results indicate that it is highly competitive with existing techniques for both low-dimensional and high-dimensional data. Furthermore, the knowledge base can be updated based on new user queries to reflect new query patterns so that frequently submitted queries can be processed most efficiently. The maintenance and stability of the knowledge base are also addressed in the paper.