Analysis of 2,297 expressed sequence tags (ESTs) from a cDNA library of flax (Linum ustitatissimum L.) bark tissue

作者:Long, Song-Hua; Deng, Xin; Wang, Yu-Fu; Li, Xiang; Qiao, Rui-Qing; Qiu, Cai-Sheng; Guo, Yuan; Hao, Dong-Mei; Jia, Wan-Qi; Chen, Xin-Bo*
来源:Molecular Biology Reports, 2012, 39(5): 6289-6296.
DOI:10.1007/s11033-012-1450-1

摘要

Bast fibre crops are the second most important natural fibre crops following cotton. Of these, flax (Linum ustitatissimum L.) is the most widely planted in the world, with its fibre used for high quality linen textile. A cDNA library of flax bark tissues was constructed with the purpose of identifying genes involved in the Bast fibre development. A total of 2,297 unigene sequences were obtained from 3,200 randomly selected clones of the cDNA library. These sequences were grouped into 155 clusters and 2,142 singletons, which have been submitted to the GenBank databases. By putative functional annotation, 23.3% of these sequences were similar to known proteins in GenBank, 44.0% of these sequences were similar to unknown proteins, and 32.7% of these sequences showed no significant similarity to any other protein sequences in existing databases. Classified by the Gene Ontology, 24.8, 23.1 and 14.3% were assigned to molecular function, biological process, and cellular component GO terms, respectively. By further bioinformatics approaches, about 110 ESTs matched cell wall related genes in the MAIZEWALL database, representing 16 functional categories of all 19 categories, of which, the most abundant category was protein synthesis. Based on the PlantTFDB database, 39 of the 64 transcription factor families in the Arabidopsis thaliana genome were identified as being involved in flax cell wall formation. The sequences and bioinformatics analysis data generated in this paper will be useful for gene expression, cloning and genetic engineering studies to characterize bast fibre development and improve the properties of the bast fibres.