摘要

Background: Variation in the genomes of single-stranded RNA viruses affects their infectivity and pathogenicity in two ways. First, viral genome sequence variations lead to changes in viral protein sequences and activities. Second, viral genome sequence variation produces diversity at the level of nucleotide composition and diversity in the interactions between viral RNAs and host toll-like receptors (TLRs). A viral genome-typing method based on this type of diversity has not yet been established. %26lt;br%26gt;Methodology/Principal Findings: In this study, we propose a novel genomic trait called the %26quot;TLR stimulatory trimer composition%26quot; (TSTC) and two quantitative indicators, Score S and Score N, named %26quot;TLR stimulatory scores%26quot; (TSS). Using the complete genome sequences of 10,994 influenza A viruses (IAV) and 251 influenza B viruses, we show that TSTC analysis reveals the diversity of Score S and Score N among the IAVs isolated from various hosts. In addition, we show that low values of Score S are correlated with high pathogenicity and pandemic potential in IAVs. Finally, we use Score S and Score N to construct a logistic regression model to recognize IAV strains that are highly pathogenic or have high pandemic potential. %26lt;br%26gt;Conclusions/Significance: Results from the TSTC analysis indicate that there are large differences between human and avian IAV genomes (except for segment 3), as illustrated by Score S. Moreover, segments 1, 2, 3 and 4 may be major determinants of the stimulatory activity exerted on human TLRs 7 and 8. We also find that a low Score S value is associated with high pathogenicity and pandemic potential in IAV. The p value from the TSS-derived logistic regression model is useful for recognizing emerging IAVs that have high pathogenicity and pandemic potential.

全文