A soft computing approach to big data summarization

Smits Gregory<sup>*</sup>; Pivert Olivier; Yager Ronald R; Nerzic Pierre

doi:10.1016/j.fss.2018.02.017

摘要

The added value of a dataset lies in the knowledge a domain expert can extract from it. Considering the continuously increasing volume and velocity of these datasets, efficient tools have to be defined to generate meaningful, condensed and human-interpretable representations of big datasets. In the proposed approach, soft computing techniques are used to define an interface between the numerical and categorical space of data definition and the linguistic space of human reasoning. Based on the expert's own vocabulary about the data, a personal summary composed of linguistic terms is efficiently generated and graphically displayed as a term cloud offering a synthetic view of the data properties. Using dedicated indexing strategies linking data and their subjective linguistic rewritings, exploration functionalities are provided on top of the summary to let the user browse the data. Experimentations confirm that the space change operates in linear time wrt. the size of the dataset making the approach tractable on large scale data.

出版日期2018-10-1

全文

访问全文

收藏分享被引浏览

更新时间：2021-01-10 19:45

A soft computing approach to big data summarization

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友