A class of neural-network-based transducers for web information extraction

Sleiman Hassan A<sup>*</sup>; Corchuelo Rafael

doi:10.1016/j.neucom.2013.05.057

摘要

The Web is a huge and still growing information repository that has attracted the attention of many companies. Many such companies rely on information extractors to integrate information that is buried into semi-structured web documents into automatic business processes. Many information extractors build on extraction rules, which can be handcrafted or learned using supervised or unsupervised techniques. The literature provides a variety of techniques to learn information extraction rules that build on ad hoc machine learning techniques. In this paper, we propose a hybrid approach that explores the use of standard machine-learning techniques to extract web information. We have specifically explored using neural networks; our results show that our proposal outperforms three state-of-the-art techniques in the literature, which opens up quite a new approach to information extraction.

出版日期2014-7-5

全文

访问全文

收藏分享被引(15) 浏览

更新时间：2024-03-31 23:43

A class of neural-network-based transducers for web information extraction

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友