Network analysis of narrative content in large corpora

作者:Sudhahar Saatviga*; De Fazio Gianluca; Franzosi Roberto; Cristianini Nello
来源:Natural Language Engineering, 2015, 21(1): 81-112.
DOI:10.1017/S1351324913000247

摘要

We present a methodology for the extraction of narrative information from a large corpus. The key idea is to transform the corpus into a network, formed by linking the key actors and objects of the narration, and then to analyse this network to extract information about their relations. By representing information into a single network it is possible to infer relations between these entities, including when they have never been mentioned together. We discuss various types of information that can be extracted by our method, various ways to validate the information extracted and two different application scenarios. Our methodology is very scalable, and addresses specific research needs in social sciences.

  • 出版日期2015-1