Sequence comparison of human and mouse genes reveals a homologous block structure in the promoter regions - PubMed (original) (raw)
Comparative Study
Sequence comparison of human and mouse genes reveals a homologous block structure in the promoter regions
Yutaka Suzuki et al. Genome Res. 2004 Sep.
Abstract
Comparative sequence analysis was carried out for the regions adjacent to experimentally validated transcriptional start sites (TSSs), using 3324 pairs of human and mouse genes. We aligned the upstream putative promoter sequences over the 1-kb proximal regions and found that the sequence conservation could not be further extended at, on average, 510 bp upstream positions of the TSSs. This discontinuous manner of the sequence conservation revealed a "block" structure in about one-third of the putative promoter regions. Consistently, we also observed that G+C content and CpG frequency were significantly different inside and outside the blocks. Within the blocks, the sequence identity was uniformly 65% regardless of their length. About 90% of the previously characterized transcription factor binding sites were located within those blocks. In 46% of the blocks, the 5' ends were bounded by interspersed repetitive elements, some of which may have nucleated the genomic rearrangements. The length of the blocks was shortest in the promoters of genes encoding transcription factors and of genes whose expression patterns are brain specific, which suggests that the evolutional diversifications in the transcriptional modulations should be the most marked in these populations of genes.
Figures
Figure 1
Sequence identity between human and mouse PPRs. Sequence alignments were calculated using LALIGN with the default parameters. The sequence identity was evaluated as the number of aligned nucleotides in the regions of -1000 to +200 (TSS: 0). The average sequence identities were calculated for each region (A). (B) The PPRs were separated into the 200-bp windows at the positions indicated in the inset. Sequence identity was calculated for each of the windows. Frequency as to which of the windows belong to which of the sequence identity groups represented on the horizontal axis is plotted.
Figure 2
Sequence alignments of the block structure in PPRs and nongenic regions. (A) Frequency of the blocks belonging to each population is shown. (B) Relation between length of the block and the average sequence identity within it. (C) Relation between percentile position within the block and the average sequence identity. (D) Alignment of the nongenic sequences using LALIGN. The sequences ranging from -1 kb to + 200 bp of the putative syntenic regions located in nongenic regions as in UCSC genome browser were aligned and the frequencies of the aligned nucleotides were calculated at each of the positions. Vertical line represents the frequency of the nucleotide at the indicated position being located within the block. (Note that the vertical axis in Figure 1 represents the frequency of the sequence “identity”).
Figure 3
Sequence alignments around the boundary of the block and that of the first intron and the second exon using SSEARCH. (A) Sequences of human and mouse PPRs were aligned using SSEARCH with a 50-bp moving window around the boundary of the block. The broken line represents the boundary of the block calculated using LALIGN. The vertical axis represents the average score of the SSEARCH calculated for the corresponding position. The horizontal axis represents the relative position to the boundary. (B) Result of an analysis similar to that shown in A, using the proximal sequences of the 5′ end of the second exons. The broken line represents the exon-intron boundary. The horizontal axis represents the relative position to the exon-intron boundary.
Similar articles
- Large-scale collection and characterization of promoters of human and mouse genes.
Suzuki Y, Yamashita R, Shirota M, Sakakibara Y, Chiba J, Mizushima-Sugano J, Kel AE, Arakawa T, Carninci P, Kawai J, Hayashizaki Y, Takagi T, Nakai K, Sugano S. Suzuki Y, et al. In Silico Biol. 2004;4(4):429-44. In Silico Biol. 2004. PMID: 15506993 - Identification and functional characterization of the human and murine polo-like kinase (Plk) promoter.
Bräuninger A, Strebhardt K, Rübsamen-Waigmann H. Bräuninger A, et al. Oncogene. 1995 Nov 2;11(9):1793-800. Oncogene. 1995. PMID: 7478607 - Transcriptional regulation of the mouse PNRC2 promoter by the nuclear factor Y (NFY) and E2F1.
Zhou D, Masri S, Ye JJ, Chen S. Zhou D, et al. Gene. 2005 Nov 21;361:89-100. doi: 10.1016/j.gene.2005.07.012. Epub 2005 Sep 21. Gene. 2005. PMID: 16181749 - Systematic analysis of alternative promoters correlated with alternative splicing in human genes.
Ma X, Li-Ling J, Huang Q, Chen X, Hou L, Ma F. Ma X, et al. Genomics. 2009 May;93(5):420-5. doi: 10.1016/j.ygeno.2009.01.008. Epub 2009 Feb 11. Genomics. 2009. PMID: 19442634 - Identification of highly specific localized sequence motifs in human ribosomal protein gene promoters.
Roepcke S, Zhi D, Vingron M, Arndt PF. Roepcke S, et al. Gene. 2006 Jan 3;365:48-56. doi: 10.1016/j.gene.2005.09.033. Epub 2005 Dec 15. Gene. 2006. PMID: 16343812
Cited by
- The Identification of Large Rearrangements Involving Intron 2 of the CDH1 Gene in BRCA1/2 Negative and Breast Cancer Susceptibility.
Ben Aissa-Haj J, Pinheiro H, Cornelis F, Sebai M, Meseure D, Briaux A, Berteaux P, Lefol C, Des Guetz G, Trassard M, Stevens D, Vialard F, Bieche I, Noguès C, Tang R, Oliveira C, Stoppat-Lyonnet D, Lidereau R, Rouleau E. Ben Aissa-Haj J, et al. Genes (Basel). 2022 Nov 25;13(12):2213. doi: 10.3390/genes13122213. Genes (Basel). 2022. PMID: 36553480 Free PMC article. - On the NF-Y regulome as in ENCODE (2019).
Ronzio M, Bernardini A, Pavesi G, Mantovani R, Dolfini D. Ronzio M, et al. PLoS Comput Biol. 2020 Dec 28;16(12):e1008488. doi: 10.1371/journal.pcbi.1008488. eCollection 2020 Dec. PLoS Comput Biol. 2020. PMID: 33370256 Free PMC article. - Magnetic Resonance Imaging for Translational Research in Oncology.
Fiordelisi MF, Cavaliere C, Auletta L, Basso L, Salvatore M. Fiordelisi MF, et al. J Clin Med. 2019 Nov 6;8(11):1883. doi: 10.3390/jcm8111883. J Clin Med. 2019. PMID: 31698697 Free PMC article. Review. - Isoforms of Base Excision Repair Enzymes Produced by Alternative Splicing.
Boldinova EO, Khairullin RF, Makarova AV, Zharkov DO. Boldinova EO, et al. Int J Mol Sci. 2019 Jul 3;20(13):3279. doi: 10.3390/ijms20133279. Int J Mol Sci. 2019. PMID: 31277343 Free PMC article. Review. - Characterization of gene promoters in pig: conservative elements, regulatory motifs and evolutionary trend.
Wei K, Ma L, Zhang T. Wei K, et al. PeerJ. 2019 Jun 25;7:e7204. doi: 10.7717/peerj.7204. eCollection 2019. PeerJ. 2019. PMID: 31275764 Free PMC article.
References
- Batzer, M.A. and Deininger, P.L. 2002. Alu repeats and human genomic diversity. Nat. Rev. Genet. 3: 370-379. - PubMed
- Boguski, M.S. 2002. Comparative genomics: The mouse that roared. Nature 420: 515-516. - PubMed
- Carninci, P. and Hayashizaki, Y. 1999. High-efficiency full-length cDNA cloning. Methods Enzymol. 303: 19-44. - PubMed
- Cross, S.H. and Bird, A.P. 1995. CpG islands and genes. Curr. Opin. Genet. Dev. 5: 309-314. - PubMed
- Deininger, P.L. and Batzer, M.A. 2002. Mammalian retroelements. Genome Res. 12: 1455-1465. - PubMed
WEB SITE REFERENCES
- http://dbtss.hgc.jp/; DBTSS.
- http://fantom.gsc.riken.go.jp/; FANTOM.
- ftp://ftp.virginia.edu/pub/fasta/; SSEARCH.
- http://genome.ucsc.edu/downloads.html; UCSC Genome Browser.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases