Identification of alternative topological domains in chromatin - PubMed (original) (raw)
Identification of alternative topological domains in chromatin
Darya Filippova et al. Algorithms Mol Biol. 2014.
Abstract
Chromosome conformation capture experiments have led to the discovery of dense, contiguous, megabase-sized topological domains that are similar across cell types and conserved across species. These domains are strongly correlated with a number of chromatin markers and have since been included in a number of analyses. However, functionally-relevant domains may exist at multiple length scales. We introduce a new and efficient algorithm that is able to capture persistent domains across various resolutions by adjusting a single scale parameter. The ensemble of domains we identify allows us to quantify the degree to which the domain structure is hierarchical as opposed to overlapping, and our analysis reveals a pronounced hierarchical structure in which larger stable domains tend to completely contain smaller domains. The identified novel domains are substantially different from domains reported previously and are highly enriched for insulating factor CTCF binding and histone marks at the boundaries.
Keywords: Alternative topological domains; Chromatin conformation capture; Dynamic programming.
Figures
Figure 1
Interaction matrix for a portion of human chromosome 1 from a recent Hi-C experiment by Dixon et al. [5]. Each axis represents a location on the chromosome with a step of 40kbp. Densely interacting domains identified by the method of Dixon et al. are shown in red. Alternative domains are shown as dotted black lines on the upper triangular portion of the matrix. Visual inspection of the lower triangular portion suggests domains could be completely nested within another and highly overlapping when compared to Dixon et al.’s domains. This motivates the problem of identifying alternative domains across length scales.
Figure 2
Our approach identifies densely interacting domains across scales. (a) Our algorithm discovers domains with mean frequency value for inter- and intra-domain interactions (solid lines) at or better than that of Dixon et al. domains (dotted lines). Each solid line represents domains at different resolution γ in human fibroblast cells. (b) Multiscale domains identified in human fibroblast cells by our dynamic program tend to have higher mean frequency than those of Dixon et al. (distributions are plotted after outliers >μ+4_σ_ were removed).
Figure 3
Domain sizes and count across resolutions. The domain sizes increase and the domain count decreases as the resolution parameter drops. Above: plotted are maximum (red), average (blue), and minimum (green) domain size averaged over all chromosomes for the domains on human fibroblast cells (IMR90). The magenta line shows the average domain size for domains reported by Dixon et al. Below: the number of domains increases for higher values of resolution parameter. The magenta line displays domain count for Dixon et al.
Figure 4
Domain persistence across scales. (a) Domains identified by our algorithm (black) are smaller at higher resolutions and merge to form larger domains at γ close to 0. Visual inspection shows qualitative differences between consensus domains (red) and domains reported by Dixon et al. (green). Data shown for the first 4Mb of chromosome 1. (b) Variation of information for domains identified by our algorithm across different resolutions for chromosome 1 in human fibroblast cells.
Figure 5
Comparison of Dixon et al.’s domain set with the multiscale consensus set for chromosomes 1–22 ( x -axis). We used the variation of information (VI) (_y_-axis) to compute distances between domain sets for the multiscale consensus set vs. Dixon et al. (blue dots) and the multiscale consensus vs. randomly shuffled domains (red diamonds).
Figure 6
Enrichment for chromatin marks and histone modifications in domain boundaries. Enrichment of CTCF binding (a) in IMR90 and (b) in mESC and histone modifications (c), (d) in mESC around domain boundaries for our consensus set of persistent domains (left, blue), and for those identified by Dixon et al. (right, blue). Green lines represent the presence of CTCF at the midpoint of the topological domains.
Figure 7
Domain sets at various resolutions. 10 best optimal and near-optimal solutions for resolutions _γ_=0.5,0.35,0.15,0.10 for a portion of human fibroblast chromosome 20 (IMR90). Variations in the domain assignments within a single γ and across resolutions correspond with visually identifiable, hierarchical regions of dense Hi-C interactions. All histone mark tracks were obtained from IMR90 cells. Plotted with WashU EpiGenome Browser [23].
Similar articles
- Rich Chromatin Structure Prediction from Hi-C Data.
Malik L, Patro R. Malik L, et al. IEEE/ACM Trans Comput Biol Bioinform. 2019 Sep-Oct;16(5):1448-1458. doi: 10.1109/TCBB.2018.2851200. Epub 2018 Jun 28. IEEE/ACM Trans Comput Biol Bioinform. 2019. PMID: 29994683 - Topological domains in mammalian genomes identified by analysis of chromatin interactions.
Dixon JR, Selvaraj S, Yue F, Kim A, Li Y, Shen Y, Hu M, Liu JS, Ren B. Dixon JR, et al. Nature. 2012 Apr 11;485(7398):376-80. doi: 10.1038/nature11082. Nature. 2012. PMID: 22495300 Free PMC article. - Insulator function and topological domain border strength scale with architectural protein occupancy.
Van Bortle K, Nichols MH, Li L, Ong CT, Takenaka N, Qin ZS, Corces VG. Van Bortle K, et al. Genome Biol. 2014 Jun 30;15(6):R82. doi: 10.1186/gb-2014-15-5-r82. Genome Biol. 2014. PMID: 24981874 Free PMC article. - Contribution of Topological Domains and Loop Formation to 3D Chromatin Organization.
Ea V, Baudement MO, Lesne A, Forné T. Ea V, et al. Genes (Basel). 2015 Jul 27;6(3):734-50. doi: 10.3390/genes6030734. Genes (Basel). 2015. PMID: 26226004 Free PMC article. Review. - Making Sense of the Tangle: Insights into Chromatin Folding and Gene Regulation.
Chung IM, Ketharnathan S, Kim SH, Thiruvengadam M, Rani MK, Rajakumar G. Chung IM, et al. Genes (Basel). 2016 Sep 23;7(10):71. doi: 10.3390/genes7100071. Genes (Basel). 2016. PMID: 27669308 Free PMC article. Review.
Cited by
- Machine and Deep Learning Methods for Predicting 3D Genome Organization.
Wall BPG, Nguyen M, Harrell JC, Dozmorov MG. Wall BPG, et al. Methods Mol Biol. 2025;2856:357-400. doi: 10.1007/978-1-0716-4136-1_22. Methods Mol Biol. 2025. PMID: 39283464 Review. - Nucleoporin Elys attaches peripheral chromatin to the nuclear pores in interphase nuclei.
Doronin SA, Ilyin AA, Kononkova AD, Solovyev MA, Olenkina OM, Nenasheva VV, Mikhaleva EA, Lavrov SA, Ivannikova AY, Simonov RA, Fedotova AA, Khrameeva EE, Ulianov SV, Razin SV, Shevelyov YY. Doronin SA, et al. Commun Biol. 2024 Jun 29;7(1):783. doi: 10.1038/s42003-024-06495-w. Commun Biol. 2024. PMID: 38951619 Free PMC article. - Enhancing Hi-C contact matrices for loop detection with Capricorn: a multiview diffusion model.
Fang T, Liu Y, Woicik A, Lu M, Jha A, Wang X, Li G, Hristov B, Liu Z, Xu H, Noble WS, Wang S. Fang T, et al. Bioinformatics. 2024 Jun 28;40(Suppl 1):i471-i480. doi: 10.1093/bioinformatics/btae211. Bioinformatics. 2024. PMID: 38940142 Free PMC article. - A comprehensive benchmarking with interpretation and operational guidance for the hierarchy of topologically associating domains.
Xu J, Xu X, Huang D, Luo Y, Lin L, Bai X, Zheng Y, Yang Q, Cheng Y, Huang A, Shi J, Bo X, Gu J, Chen H. Xu J, et al. Nat Commun. 2024 May 23;15(1):4376. doi: 10.1038/s41467-024-48593-7. Nat Commun. 2024. PMID: 38782890 Free PMC article. - Genome structural dynamics: insights from Gaussian network analysis of Hi-C data.
Banerjee A, Zhang S, Bahar I. Banerjee A, et al. Brief Funct Genomics. 2024 Sep 27;23(5):525-537. doi: 10.1093/bfgp/elae014. Brief Funct Genomics. 2024. PMID: 38654598 Free PMC article.
References
LinkOut - more resources
Full Text Sources
Other Literature Sources