An Algorithm for Pattern Extraction in Fingerprints

作者:Palacios Bejarano Bernardo; Cerruela Garcia Gonzalo; Luque Ruiz Irene; Angel Gomez Nieto Miguel
来源:Chemometrics and Intelligent Laboratory Systems, 2013, 125: 87-100.
DOI:10.1016/j.chemolab.2013.04.003

摘要

In this paper, we describe an algorithm devised to be used for the extraction of patterns existing in fingerprints. The algorithm takes as input data a data set of molecules represented by its corresponding fingerprints, and generates a set of disjoint patterns also consisting of binary arrays satisfying to subsets, not necessarily disjoint, of the input data set. The algorithm has been developed in Java, allowing its integration in free and proprietary computational chemistry software due to acceptable performance. Fingerprint patterns extracted by the algorithm from molecule data sets are organized in a hierarchical structure where the nodes at each level can be used as a cluster for the classification of the data set. In this paper, we analyze the usefulness of this tree structure and we describe the advantages in the analysis of data sets regarding MCS-based tree structure. Moreover, the use of fingerprint patterns is compared with other representational spaces in the building of QSAR models, showing better results.

  • 出版日期2013-6-15

全文