Adapting ADtrees for improved performance on large datasets with high-arity features

Van Dam Robert; Langkilde Geary Irene; Ventura Dan<sup>*</sup>

doi:10.1007/s10115-012-0510-0

摘要

The ADtree, a data structure useful for caching sufficient statistics, has been successfully adapted to grow lazily when memory is limited and to update sequentially with an incrementally updated dataset. However, even these modified forms of the ADtree still exhibit inefficiencies in terms of both space usage and query time, particularly on datasets with very high dimensionality and with high-arity features. We propose four modifications to the ADtree, each of which can be used to improve size and query time under specific types of datasets and features. These modifications also provide an increased ability to precisely control how an ADtree is built and to tune its size given external memory or speed requirements.

出版日期2013-6

全文

访问全文

收藏分享被引浏览

更新时间：2018-04-21 04:50

Adapting ADtrees for improved performance on large datasets with high-arity features

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友