摘要

Existing work which aims to provide a ground on Frequently Changing Structures from XML data could be discovered mainly devoted to the discovery of structural changes, while neglected the changes of content. Therefore, an improved approach, i.e. SC-Mining, is proposed in this paper to determine a better way to mine Frequently Changing Sections from XML documents considering changes of both structure and content. In order to reduce the times of scanning documents and make the discovering process efficient, a data model, Historical Structure and Content-Document Object Model (HSC-DOM), is proposed, together with some optimization techniques. After a little modification, HSC-DOM can be used to maintain versions of dynamic XML data. Experimental results show that our algorithms are efficient.

  • 出版日期2011

全文