An Ontology-Based Text Mining Method to Develop D-Matrix from Unstructured Text

作者:Rajpathak Dnyanesh G*; Singh Satnam
来源:IEEE Transactions on Systems, Man, and Cybernetics: Systems , 2014, 44(7): 966-977.
DOI:10.1109/TSMC.2013.2281963

摘要

Fault dependency (D)-matrix is a systematic diagnostic model [7] to capture the hierarchical system-level fault diagnostic information consisting of dependencies between observable symptoms and failure modes associated with a system. Constructing a D-matrix from first principles and updating it using the domain knowledge is a labor intensive and time consuming task. Further, in-time augmentation of D-matrix through the discovery of new symptoms and failure modes observed for the first time is a challenging task. Here, we describe an ontology-based text mining method for automatically constructing and updating a D-matrix by mining hundreds of thousands of repair verbatim (typically written in unstructured text) collected during the diagnosis episodes. In our approach, we first construct the fault diagnosis ontology consisting of concepts and relationships commonly observed in the fault diagnosis domain. Next, we employ the text mining algorithms that make use of this ontology to identify the necessary artifacts, such as parts, symptoms, failure modes, and their dependencies from the unstructured repair verbatim text. The proposed method is implemented as a prototype tool and validated by using real-life data collected from the automobile domain.

  • 出版日期2014-7