Density link-based methods for clustering web pages

作者:Chehreghani Morteza Haghir; Abolhassani Hassan; Chehreghani Mostafa Haghir
来源:Decision Support Systems, 2009, 47(4): 374-382.
DOI:10.1016/j.dss.2009.04.002

摘要

World Wide Web is a huge information space, making it a valuable resource for decision making. However, it should be effectively managed for such a purpose. One important management technique is clustering the web data. In this paper, we propose sonic developments in Clustering methods to achieve higher qualities. At first we study a new density based method adapted for hierarchical clustering of web documents. Then utilizing the hyperlink structure of web, we propose a new method that incorporates density concepts with web graph. These algorithms have the preference of low complexity and as experimental results reveal, the resultant clusters have high quality.

  • 出版日期2009-11