A comprehensive bioinformatics analysis on multiple Gene Expression Omnibus datasets of nonalcoholic fatty liver disease and nonalcoholic steatohepatitis

作者:Huang, Shanzhou; Sun, Chengjun; Hou, Yuchen; Tang, Yunhua; Zhu, Zebin; Zhang, Zhiheng; Zhang, Yixi; Wang, Linhe; Zhao, Qiang; Chen, Mao-Gen; Guo, Zhiyong; Wang, Dongping; Ju, Weiqiang; Zhou, Qi; Wu, Linwei*; He, Xiaoshun*
来源:Scientific Reports, 2018, 8(1): 7630.
DOI:10.1038/s41598-018-25658-4

摘要

Fatty liver disease is one of the leading causes of chronic damage in western countries. Approximately 25% of adults in the United States have fatty livers in the absence of excessive alcohol consumption, a condition termed nonalcoholic fatty liver disease (NAFLD). Little is known about the prevalence and genetic background of NAFLD or the factors that determine its development. In this study, we used the Gene-Cloud of Biotechnology Information bioinformatics platform to carry out a comprehensive bioinformatics analysis identifying differentially expressed genes (DEGs), key biological processes and intersecting pathways. We imported 3 Gene Expression Omnibus datasets (GSE31803, GSE49541, and GSE63067). Then, we assessed the expression of the DEGs in clinical samples. We found that CD24 was the only gene co-expressed in all 3 datasets. "Glycolysis/gluconeogenesis", "p53 signaling pathway" and "glycine, serine and threonine metabolism" were 3 common pathways related to the fatty liver process. In NAFLD tissues, CD24, COL1A1, LUM, THBS2 and EPHA3 were upregulated, and PZP was downregulated. CD24 is a core gene among these DEGs and have not yet been studied of its impact on NAFLD. Co-expressed genes, common biological processes and intersecting pathways identified in the study might play an important role in NAFLD progression. Further studies are needed to elucidate the mechanism of these potential genes and pathways in NAFLD.