An Algorithm of Semi-supervised Web-page Classification Based on Fuzzy Clustering

作者:Chen Geng*; Zhu Yuquan; Tan Jianing; Hu Tianhan
来源:International Forum on Information Technology and Applications (IFITA 2009), China,Sichuan,Chengdu, 2009-05-15 to 2009-05-17.
DOI:10.1109/IFITA.2009.490

摘要

It is very difficult to obtain labeled training samples. However, it is very easy to obtain non-labeled training samples. So, it is important task that how to classify Web-page using these training samples. An Algorithm called FC-TSVM based on fuzzy clustering is proposed The algorithm FC-TSVM uses the fuzzy clustering algorithm to determine the number of positive label samples, and add the information of homepages hyperlink as part of the classifications. The experiments show that the algorithm FC-TSVM can efficiently improve the accuracy and stability of web page classification.

  • 出版日期2009
  • 单位南京审计大学

全文