摘要

In recent years, HMP has provided analytical insights into the human microbiome to gain better insights into its effects on human health. Applying insights for therapeutic and biotechnological applications requires researchers efficiently classify the vast amount of microbes which comprise the microbiome. Since these datasets are sparse and complex in nature, the application of dimensionality reduction algorithms is a popular way for extracting latent phylogenetic themes. We introduce an Augmented Barnes-Hut t-SNE method, which is both more efficient in processing time and sensitive to subtle albeit meaningful variations in microbial classifications based on 5 high-level anatomical regions. We demonstrate that our method not only separates the microbiome into these regions, but can further elucidate them into 18 separate sample sites. It is contended that this approach can accurately resolve phylogenetic themes at varying levels of granularity, and anticipate its application in other research domains where complex high-dimensional datasets are prevalent.

全文