Approximate similarity search for online multimedia services on distributed CPU-GPU platforms

Teodoro George<sup>*</sup>; Valle Eduardo; Mariano Nathan; Torres Ricardo; Meira Wagner Jr; Saltz Joel H

doi:10.1007/s00778-013-0329-7

摘要

Similarity search in high-dimensional spaces is a pivotal operation for several database applications, including online content-based multimedia services. With the increasing popularity of multimedia applications, these services are facing new challenges regarding (1) the very large and growing volumes of data to be indexed/searched and (2) the necessity of reducing the response times as observed by end-users. In addition, the nature of the interactions between users and online services creates fluctuating query request rates throughout execution, which requires a similarity search engine to adapt to better use the computation platform and minimize response times. In this work, we address these challenges with Hypercurves, a flexible framework for answering approximate k-nearest neighbor (kNN) queries for very large multimedia databases. Hypercurves executes in hybrid CPU-GPU environments and is able to attain massive query-processing rates through the cooperative use of these devices. Hypercurves also changes its CPU-GPU task partitioning dynamically according to the observed load, aiming for optimal response times. In our empirical evaluation, dynamic task partitioning reduced query response times by approximately 50 % compared to the best static task partition. Due to a probabilistic proof of equivalence to the sequential kNN algorithm, the CPU-GPU execution of Hypercurves in distributed (multi-node) environments can be aggressively optimized, attaining superlinear scalability while still guaranteeing, with high probability, results at least as good as those from the sequential algorithm.

出版日期2014-6

全文

访问全文

收藏分享被引(6) 浏览

更新时间：2019-03-28 03:14

Approximate similarity search for online multimedia services on distributed CPU-GPU platforms

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友