摘要

Histone modifications play a crucial role in regulating gene expression and cell lineage determination and maintenance at the epigenetic level. To systematically investigate this phenomenon, this paper presented a statistical hybrid clustering algorithm to identify common combinatorial histone modification patterns. We applied the algorithm to 39 histone modification marks in human CD4+ T cells and detected 854 common combinatorial histone modification patterns. Our results could cover 211 (76.17%) patterns among 277 patterns identified by the tandem mass spectrometry experiments. Based on the frequency statistical analysis, it was found that the co-occurrence frequencies of 20 backbone modifications are greater than or close to 0.2 in the 854 patterns, we also found that 15 modifications (H2BK120ac, H4K91ac, H2BK20ac, etc.), three histone acetylations (H2AK9ac, H4K16ac, and H4K12ac) and five histone methylations (H3K79me1, H3K79me2, 3K79me3, H4K20me1, and H2BK5me1) were most likely prone to coexist respectively in these patterns. In addition, we found that DNA methylation tends to combine with histone acetylation rather than histone methylation.