A memory path index for DOM tree queries

作者:Yang Qing*; Zhang Huibing; Zhang Jingwei
来源:Information Technology Journal, 2011, 10(9): 1830-1835.
DOI:10.3923/itj.2011.1830.1835

摘要

Large amount of content with similar structures are being integrated into one big XML document, for example, the product records crawled from Web, on which users requires a quick retrieval. DOM tree is a memory model for HTML and XML documents. There are three basic queries on this model, (1) search content located by a given absolute path, (2) search content located by a given relative path, (3) get the path locating the given content. For an XML document with great width and depth, those queries often have a long response time because of time loss on tree traversal. It will be very helpful for Web applications and XML query processing to improve the query efficiency on DOM trees. Present study proposed three index structures to deal with the three basic queries, every of them can give a quick response for corresponding queries with backward search strategy. The experiments on different XML documents show that these index structures can reduce the query time effectively.

  • 出版日期2011

全文