A case study for cloud based high throughput analysis of NGS data using the globus genomics system

作者:Bhuvaneshwar Krithika; Sulakhe Dinanath; Gauba Robinder; Rodriguez Alex; Madduri Ravi; Dave Utpal; Lacinski Lukasz; Foster Ian; Gusev Yuriy; Madhavan Subha*
来源:Computational and Structural Biotechnology Journal, 2015, 13: 64-74.
DOI:10.1016/j.csbj.2014.11.001

摘要

Next generation sequencing (NGS) technologies produce massive amounts of data requiring a powerful computational infrastructure, high quality bioinformatics software, and skilled personnel to operate the tools. We present a case study of a practical solution to this data management and analysis challenge that simplifies terabyte scale data handling and provides advanced tools for NGS data analysis. These capabilities are implemented using the "Globus Genomics" system, which is an enhanced Galaxy workflow system made available as a service that offers users the capability to process and transfer data easily, reliably and quickly to address end-to-endNGS analysis requirements. The Globus Genomics system is built on Amazon's cloud computing infrastructure. The system takes advantage of elastic scaling of compute resources to run multiple workflows in parallel and it also helps meet the scale-out analysis needs of modern translational genomics research. (C) 2014 Bhuvaneshwar et al. Published by Elsevier B.V. on behalf of the Research Network of Computational and Structural Biotechnology. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/3.0/).

  • 出版日期2015