摘要

We study data structures in the presence of adversarial noise. We want to encode a given object in a succinct data structure that enables us to efficiently answer specific queries about the object, even if the data structure has been corrupted by a constant fraction of errors. We measure the efficiency of a data structure in terms of its length (the number of bits in its representation) and query-answering time, measured by the number of bit-probes to the (possibly corrupted) representation. The main issue is the trade-off between these two. This new model is the common generalization of (static) data structures and locally decodable error-correcting codes (LDCs). We prove a number of upper and lower bounds on various natural error-correcting data structure problems. In particular, we show that the optimal length of t-probe error-correcting data structures for the MEMBERSHIP problem (where we want to store subsets of size s from a universe of size n such that membership queries can be answered efficiently) is approximately the optimal length of t-probe LDCs that encode strings of length s. It has been conjectured that LDCs with small t must be superpolynomially long. This bad probes-versus-length trade-off carries over to error-correcting data structures for MEMBERSHIP and many other data structure problems. We then circumvent this problem by defining so-called relaxed error-correcting data structures, inspired by the notion of %26quot;relaxed locally decodable codes%26quot; developed in the PCP literature. Here the decoder is required to answer most queries correctly with high probability, and for the remaining queries the decoder with high probability either answers correctly or declares %26quot;don%26apos;t know.%26quot; Furthermore, if there is no noise on the data structure, it answers all queries correctly with high probability. We obtain positive results for the following two data structure problems: (1) MEMBERSHIP. We construct a relaxed error-correcting data structure for this problem with length nearly linear in s log n that answers membership queries with O(1) bit-probes. This nearly matches the asymptotically optimal parameters for the noiseless case: length O(s log n) and one bit-probe, due to Buhrman et al. (2) UNIVARIATE POLYNOMIAL EVALUATION (namely, we want to store a univariate polynomial g of degree deg(g) %26lt;= s over the integers modulo n such that evaluation queries can be answered efficiently; i.e., we can evaluate the output of g on a given integer modulo n). We construct a relaxed error-correcting data structure for this problem with length nearly linear in s log n that answers evaluation queries with polylog(s) . log(1+o(1)) (n) bit-probes. This nearly matches the parameters of the best known noiseless construction due to Kedlaya and Umans.

  • 出版日期2013