Short blocks from the noncoding parts of the human genome have instances within nearly all known genes and relate to biological processes - PubMed (original) (raw)
Comparative Study
. 2006 Apr 25;103(17):6605-10.
doi: 10.1073/pnas.0601688103. Epub 2006 Apr 24.
Affiliations
- PMID: 16636294
- PMCID: PMC1447521
- DOI: 10.1073/pnas.0601688103
Comparative Study
Short blocks from the noncoding parts of the human genome have instances within nearly all known genes and relate to biological processes
Isidore Rigoutsos et al. Proc Natl Acad Sci U S A. 2006.
Abstract
Using an unsupervised pattern-discovery method, we processed the human intergenic and intronic regions and catalogued all variable-length patterns with identically conserved copies and multiplicities above what is expected by chance. Among the millions of discovered patterns, we found a subset of 127,998 patterns, termed pyknons, which have additional nonoverlapping instances in the untranslated and protein-coding regions of 30,675 transcripts from 20,059 human genes. The pyknons arrange combinatorially in the untranslated and coding regions of numerous human genes where they form mosaics. Consecutive instances of pyknons in these regions show a strong bias in their relative placement, favoring distances of approximately 22 nucleotides. We also found pyknons to be enriched in a statistically significant manner in genes involved in specific processes, e.g., cell communication, transcription, regulation of transcription, signaling, transport, etc. For approximately 1/3 of the pyknons, the intergenic/intronic instances of their reverse complement lie within 380,084 nonoverlapping regions, typically 60-80 nucleotides long, which are predicted to form double-stranded, energetically stable, hairpin-shaped RNA secondary structures; additionally, the pyknons subsume approximately 40% of the known microRNA sequences, thus suggesting a possible link with posttranscriptional gene silencing and RNA interference. Cross-genome comparisons reveal that many of the pyknons have instances in the 3' UTRs of genes from other vertebrates and invertebrates where they are overrepresented in similar biological processes, as in the human genome. These unexpected findings suggest potential unique functional connections between the coding and noncoding parts of the human genome.
Conflict of interest statement
Conflict of interest statement: No conflicts declared.
Figures
Fig. 1.
Pyknons in the 3′ UTRs of the apoptosis inhibitor birc4 (shown above the horizontal line) and nine other genes. The sequences below the line contain some of birc4's pyknons, but in different arrangements; they also contain instances of other pyknons that are not present in birc4's 3′ UTR. The 10 3′ UTRs are pyknon mosaics. The shown pyknons, whether highlighted or in dark gray, have 40 or more instances in the genome's intergenic/intronic regions and additional copies in the untranslated and coding regions of these and other genes. We highlight only those pyknons that appear two or more times in the shown 3′ UTRs. The light gray string -(xx)- indicates that xx nucleotides separate the pyknons that surround it. To appreciate the importance of this picture, it suffices to track the number of copies and relative position of TGCACTCCAGCCTGGG, TAATCCCAGCACTTTGGGA, GGCTGAGGCAGGAGAAT, and GAGGTTGCAGTGAGCC.
Fig. 2.
Probability density functions for the distance between the starting points of consecutive instances of pyknons, shown separately for 5′ UTRs, CRs, and 3′ UTRs. The distributions have long tails, and only a portion is shown. Note the peaks at x = 18, 22, 24, 26, 29, 30, and 31.
Similar articles
- Coding DNA repeated throughout intergenic regions of the Arabidopsis thaliana genome: evolutionary footprints of RNA silencing.
Feng J, Naiman DQ, Cooper B. Feng J, et al. Mol Biosyst. 2009 Dec;5(12):1679-87. doi: 10.1039/b903031j. Epub 2009 Apr 30. Mol Biosyst. 2009. PMID: 19452047 - Human genome connectivity code links disease-associated SNPs, microRNAs and pyknons.
Glinsky GV. Glinsky GV. Cell Cycle. 2009 Mar 15;8(6):925-30. doi: 10.4161/cc.8.6.7937. Epub 2009 Mar 26. Cell Cycle. 2009. PMID: 19229130 - Intron size, abundance, and distribution within untranslated regions of genes.
Hong X, Scofield DG, Lynch M. Hong X, et al. Mol Biol Evol. 2006 Dec;23(12):2392-404. doi: 10.1093/molbev/msl111. Epub 2006 Sep 15. Mol Biol Evol. 2006. PMID: 16980575 - Picking pyknons out of the human genome.
Meynert A, Birney E. Meynert A, et al. Cell. 2006 Jun 2;125(5):836-8. doi: 10.1016/j.cell.2006.05.019. Cell. 2006. PMID: 16751093 Review. - Variation in the Untranslated Genome and Susceptibility to Infections.
Ramsuran V, Ewy R, Nguyen H, Kulkarni S. Ramsuran V, et al. Front Immunol. 2018 Sep 7;9:2046. doi: 10.3389/fimmu.2018.02046. eCollection 2018. Front Immunol. 2018. PMID: 30245696 Free PMC article. Review.
Cited by
- Regulation of Immune Cells by microRNAs and microRNA-Based Cancer Immunotherapy.
Veiga RN, Zambalde ÉP, Cox L, Jucoski TS, Kohler AF, Carvalho TM, Rodrigues AC, Ludwig B, Crowley K, de Oliveira JC, Cortez MA. Veiga RN, et al. Adv Exp Med Biol. 2022;1385:75-108. doi: 10.1007/978-3-031-08356-3_3. Adv Exp Med Biol. 2022. PMID: 36352211 - Identifying common transcriptome signatures of cancer by interpreting deep learning models.
Jha A, Quesnel-Vallières M, Wang D, Thomas-Tikhonenko A, Lynch KW, Barash Y. Jha A, et al. Genome Biol. 2022 May 17;23(1):117. doi: 10.1186/s13059-022-02681-3. Genome Biol. 2022. PMID: 35581644 Free PMC article. - Targeting non-coding RNAs to overcome cancer therapy resistance.
Chen B, Dragomir MP, Yang C, Li Q, Horst D, Calin GA. Chen B, et al. Signal Transduct Target Ther. 2022 Apr 13;7(1):121. doi: 10.1038/s41392-022-00975-3. Signal Transduct Target Ther. 2022. PMID: 35418578 Free PMC article. Review. - The transcriptional trajectories of pluripotency and differentiation comprise genes with antithetical architecture and repetitive-element content.
Telonis AG, Rigoutsos I. Telonis AG, et al. BMC Biol. 2021 Mar 25;19(1):60. doi: 10.1186/s12915-020-00928-8. BMC Biol. 2021. PMID: 33765992 Free PMC article. - Pyknon-Containing Transcripts Are Downregulated in Colorectal Cancer Tumors, and Loss of PYK44 Is Associated With Worse Patient Outcome.
Evangelista AF, de Menezes WP, Berardinelli GN, Dos Santos W, Scapulatempo-Neto C, Guimarães DP, Calin GA, Reis RM. Evangelista AF, et al. Front Genet. 2020 Nov 12;11:581454. doi: 10.3389/fgene.2020.581454. eCollection 2020. Front Genet. 2020. PMID: 33304384 Free PMC article.
References
- Mattick J. S. Nat. Rev. Genet. 2004;5:316–323. - PubMed
- Ruvkun G. Science. 2001;294:797–799. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources