Archiving genome data

作者:Gruetz R*; Mathieu N; Loehnhardt B; Weil P; Krawczak M
来源:Medizinische Genetik, 2013, 25(3): 388-394.
DOI:10.1007/s11825-013-0403-y

摘要

In view of the increasing amount of data arising from genome research, efficient research data management is becoming increasingly important in this domain. The third, and last, article of the series on "Research data management for genome data" describes the general lifecycle of research data-from their planning via the selection and inclusion into storage facilities to preservation measures and final user access. Archives play an important role in nearly all phases of this life cycle, which renders them an important component of genome data processing. Three exemplary public archives for genome data are introduced: the European Molecular Biology Laboratory (EMBL) databank, the Sequence Read Archive, and the Trace Archive. Owing to the high level of specialization of these institutions, however, additional archives are required that allow more generic data storage or, alternatively, easy extension to other genome data types. A generic concept for such archives will be described and recommendations given for their practical implementation.

全文