Manually curated database of rice proteins

作者:Gour Pratibha; Garg Priyanka; Jain Rashmi; Joseph Shaji V; Tyagi Akhilesh K; Raghuvanshi Saurabh*
来源:Nucleic Acids Research, 2014, 42(D1): D1214-D1221.
DOI:10.1093/nar/gkt1072

摘要

'Manually Curated Database of Rice Proteins' (MCDRP) available at http://www.genomeindia.org/ biocuration is a unique curated database based on published experimental data. Semantic integration of scientific data is essential to gain a higher level of understanding of biological systems. Since the majority of scientific data is available as published literature, text mining is an essential step before the data can be integrated and made available for computer-based search in various databases. However, text mining is a tedious exercise and thus, there is a large gap in the data available in curated databases and published literature. Moreover, data in an experiment can be perceived from several perspectives, which may not reflect in the text-based curation. In order to address such issues, we have demonstrated the feasibility of digitizing the experimental data itself by creating a database on rice proteins based on in-house developed data curation models. Using these models data of individual experiments have been digitized with the help of universal ontologies. Currently, the database has data for over 1800 rice proteins curated from >4000 different experiments of over 400 research articles. Since every aspect of the experiment such as gene name, plant type, tissue and developmental stage has been digitized, experimental data can be rapidly accessed and integrated.

  • 出版日期2014-1