A hybrid approach to increase the informedness of CE-based data using locus-specific thresholding and machine learning

Marciano Michael A<sup>*</sup>; Williamson Victoria R; Adelman Jonathan D

doi:10.1016/j.fsigen.2018.03.017

摘要

The interpretation of genetic profiles require a robust and reliable method to discriminate true allelic information from noise, regardless of the instrumentation or methods used. Traditionally, static peak detection thresholds (analytical thresholds) have been applied to capillary electrophoresis generated data to distinguish the true allelic peaks from noise. While the rigid nature of these thresholds attempts to conservatively account for baseline variability across instrument runs, samples, capillaries, dye-channels, injection times, and voltage, its static nature is unable to adapt, leading to a loss of allelic information that exists below the threshold. The method described herein is able to account for this variability by collectively minimizing the incorrect detection of non-allelic artifacts (false positives) and the threshold-induced dropout of true allelic information (false negatives). This is accomplished by using a dynamic locus and sample specific analytical threshold and a machine learning-derived probabilistic artifact detection model. The system produced an allele detection accuracy of 97.2%, an 11.4% increase from the lowest static threshold (50 RFU), with a low incidence of incorrectly identified artifacts (0.79%). This adaptive method outperformed static thresholds in the retention of allelic information content at minimal cost.

出版日期2018-7

全文

访问全文

收藏分享被引(8) 浏览

更新时间：2022-08-12 21:15

A hybrid approach to increase the informedness of CE-based data using locus-specific thresholding and machine learning

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友