摘要

Because of the advantages of EST-SSR markers, it has been employed as powerful markers for genetic diversity analysis, comparative mapping and phylogenetic studies. In this study, a total of 429,869 tobacco (Nicotiana tabacum L.) ESTs were downloaded from the public databases, which offers an opportunity to identify SSRs in ESTs by data mining, and 38,165 SSRs were identified from 379,967 uni-ESTs with the frequency of one SSR per 5.52 kb. Mono- and tri-nucleotide repeat motifs were the dominant repeat types, accounting for 40.53 and 34.51% of all SSRs, respectively. After eliminating mononucleotide-containing sequences, 86 pairs of primers were designed to amplify in four tobacco accessions. Only 15 primers (17.44%) showed polymorphism, and then they were further used to assess genetic diversity of 20 tobacco accessions. Unweighted pair-group method with arithmetic average dendrograms (UPGMA) and principal coordinates analysis plots (PCA) revealed genetic differentiation between N. rustica and N. tabacum, and between oriental tobacco and other accessions of N. tabacum. The present study reported the development of EST-SSR markers in tobacco by exploiting EST databases, and confirmed the effective way to develop markers. These EST-SSRs can serve in studies on cultivar identification, genetic diversity analysis, and genetics in tobacco.