A Multi-layer Representation Model for Multimedia Web Page and Topic

作者:Shi Peng*; Hu Changjun; Ni Yuemin; Ding Lianhong
来源:Pacific/Asia Workshop on Computational Intelligence and Industrial Application, 2008-12-19 to 2008-12-20.

摘要

Web topic indicates the topic from Web pages on the Internet. Traditional methods to describe a Web topic come from text mining. However, Web page consists of not only text but also multimedia contents, such as image, audio, video and so on. The multimedia contents of Web topic can't be denoted by text-based representation methods. This paper proposes a new approach, named multi-layer representation model, to represent Web topic with all the contents that Web page contains. The model is composed of several semantic layers, including text layer, image layer, audio layer, video layer and other extensible layers. Web resources are located on different layers according to their types. Their relationships within one layer and between layers are represented by inner-layer links and cross-layer links respectively. This method can exactly describe Web topic with richer resource semantics and bring benefits for the similarity computing between Web topics.