Post-transcriptional processing generates a diversity of 5′-modified long and short RNAs (original) (raw)

Nature volume 457, pages 1028–1032 (2009)Cite this article

Abstract

The transcriptomes of eukaryotic cells are incredibly complex. Individual non-coding RNAs dwarf the number of protein-coding genes, and include classes that are well understood as well as classes for which the nature, extent and functional roles are obscure1. Deep sequencing of small RNAs (<200 nucleotides) from human HeLa and HepG2 cells revealed a remarkable breadth of species. These arose both from within annotated genes and from unannotated intergenic regions. Overall, small RNAs tended to align with CAGE (cap-analysis of gene expression) tags2, which mark the 5′ ends of capped, long RNA transcripts. Many small RNAs, including the previously described promoter-associated small RNAs3, appeared to possess cap structures. Members of an extensive class of both small RNAs and CAGE tags were distributed across internal exons of annotated protein coding and non-coding genes, sometimes crossing exon–exon junctions. Here we show that processing of mature mRNAs through an as yet unknown mechanism may generate complex populations of both long and short RNAs whose apparently capped 5′ ends coincide. Supplying synthetic promoter-associated small RNAs corresponding to the c-MYC transcriptional start site reduced MYC messenger RNA abundance. The studies presented here expand the catalogue of cellular small RNAs and demonstrate a biological impact for at least one class of non-canonical small RNAs.

This is a preview of subscription content, access via your institution

Access options

Subscribe to this journal

Receive 51 print issues and online access

$199.00 per year

only $3.90 per issue

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Additional access options:

Similar content being viewed by others

Accession codes

Primary accessions

Gene Expression Omnibus

Data deposits

Sequences generated during this study have been deposited in GEO under accession number GSE14362.

References

  1. Birney, E. et al. Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature 447, 799–816 (2007)
    ADS CAS Google Scholar
  2. Shiraki, T. et al. Cap analysis gene expression for high-throughput analysis of transcriptional starting point and identification of promoter usage. Proc. Natl Acad. Sci. USA 100, 15776–15781 (2003)
    Article ADS CAS Google Scholar
  3. Kapranov, P., Willingham, A. T. & Gingeras, T. R. Genome-wide transcription and the implications for genomic organization. Nature Rev. Genet. 8, 413–423 (2007)
    Article CAS Google Scholar
  4. Kapranov, P. et al. RNA maps reveal new RNA classes and a possible function for pervasive transcription. Science 316, 1484–1488 (2007)
    Article ADS CAS Google Scholar
  5. Mardis, E. R. The impact of next-generation sequencing technology on genetics. Trends Genet. 24, 133–141 (2008)
    Article CAS Google Scholar
  6. Eddy, S. The UCSC Genome Browser <http://www.genome.ucsc.edu/cgi-bin/hgTables?db=hg18&hgta_group=genes&hgta_track=rnaGene&hgta_table=rnaGene&hgta_doSchema=describe+table+schema> (2006)
  7. Yang, J. H. et al. snoSeeker: an advanced computational package for screening of guide and orphan snoRNA genes in the human genome. Nucleic Acids Res. 34, 5112–5123 (2006)
    Article CAS Google Scholar
  8. Huttenhofer, A. et al. RNomics: an experimental approach that identifies 201 candidates for novel, small, non-messenger RNAs in mouse. EMBO J. 20, 2943–2953 (2001)
    Article CAS Google Scholar
  9. Kawaji, H. et al. CAGE Basic/Analysis Databases: the CAGE resource for comprehensive promoter analysis. Nucleic Acids Res. 34, D632–D636 (2006)
    Article CAS Google Scholar
  10. Carninci, P. et al. Genome-wide analysis of mammalian promoter architecture and evolution. Nature Genet. 38, 626–635 (2006)
    Article CAS Google Scholar
  11. Berget, S. M. Exon recognition in vertebrate splicing. J. Biol. Chem. 270, 2411–2414 (1995)
    Article CAS Google Scholar
  12. Olofsson, S. O. & Boren, J. Apolipoprotein B: a clinically important apolipoprotein which assembles atherogenic lipoproteins and promotes the development of atherosclerosis. J. Intern. Med. 258, 395–410 (2005)
    Article CAS Google Scholar
  13. Rada-Iglesias, A. et al. Whole-genome maps of USF1 and USF2 binding and histone H3 acetylation reveal new aspects of promoter structure and candidate genes for common human disorders. Genome Res. 18, 380–392 (2008)
    Article CAS Google Scholar
  14. Bochnig, P., Reuter, R., Bringmann, P. & Luhrmann, R. A monoclonal antibody against 2,2,7-trimethylguanosine that reacts with intact, class U, small nuclear ribonucleoproteins as well as with 7-methylguanosine-capped RNAs. Eur. J. Biochem. 168, 461–467 (1987)
    Article CAS Google Scholar
  15. Morris, K. V., Chan, S. W., Jacobsen, S. E. & Looney, D. J. Small interfering RNA-induced transcriptional gene silencing in human cells. Science 305, 1289–1292 (2004)
    ADS CAS Google Scholar
  16. Ting, A. H., Schuebel, K. E., Herman, J. G. & Baylin, S. B. Short double-stranded RNA induces transcriptional gene silencing in human cancer cells in the absence of DNA methylation. Nature Genet. 37, 906–910 (2005)
    Article CAS Google Scholar
  17. Janowski, B. A. et al. Activating gene expression in mammalian cells with promoter-targeted duplex RNAs. Nature Chem. Biol. 3, 166–173 (2007)
    Article CAS Google Scholar
  18. Mattick, J. S. RNA regulation: a new genetics? Nature Rev. Genet. 5, 316–323 (2004)
    Article CAS Google Scholar
  19. Mattick, J. S. Challenging the dogma: the hidden layer of non-protein-coding RNAs in complex organisms. Bioessays 25, 930–939 (2003)
    Article CAS Google Scholar
  20. Willingham, A. T. et al. A strategy for probing the function of noncoding RNAs finds a repressor of NFAT. Science 309, 1570–1573 (2005)
    Article ADS CAS Google Scholar

Download references

Acknowledgements

We thank L. Cardone, D. Rebolini, M. Kramer, and W. R. McCombie for Illumina sequencing. We wish to thank J. Brosius, J. Schmitz and T. Rozhdestvensky for their help with the small RNA cloning protocol and J. Dumais for technical assistance. K.F.-T. was in part supported by the Schering Foundation. This work was supported in part by grants from the NIH and was performed as part of the ENCODE consortium (G.J.H. and T.R.G.). G.J.H is an investigator of the Howard Hughes Medical Institute.

Author Contributions K.F.-T. and P.K. performed experiments in collaboration with E.D., V.S., R.D. and A.T.W. P.K., S.F., R.S. and G.A. performed data analysis. G.J.H. and T.R.G. planned experiments and wrote the paper.

Author information

Author notes

  1. Katalin Fejes-Toth and Philipp Kapranov: These authors contributed equally to this work.

Authors and Affiliations

  1. Watson School of Biological Sciences,,
    Katalin Fejes-Toth, Vihra Sotirova, Ravi Sachidanandam, Gordon Assaf, Gregory J. Hannon & Thomas R. Gingeras
  2. Howard Hughes Medical Institute, Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, New York 11724, USA.,
    Katalin Fejes-Toth, Vihra Sotirova, Gordon Assaf & Gregory J. Hannon
  3. Affymetrix, Inc. Santa Clara, California 95051, USA.,
    Philipp Kapranov, Sylvain Foissac, Aarron T. Willingham, Radha Duttagupta, Erica Dumais & Thomas R. Gingeras

Consortia

Affymetrix/Cold Spring Harbor Laboratory ENCODE Transcriptome Project

Corresponding authors

Correspondence toGregory J. Hannon or Thomas R. Gingeras.

Ethics declarations

Competing interests

R.D. is an employee of Affymetrix, which manufactures the tiling arrays used in the study.

Additional information

Lists of participants and their affiliations appear at the end of the paper.

Supplementary information

Supplementary Information

This file contains Supplementary Methods, Supplementary References, Supplementary Figures S1-S3 with Legends and Supplementary Tables S1-S2 (PDF 1208 kb)

PowerPoint slides

Rights and permissions

About this article

Cite this article

Affymetrix/Cold Spring Harbor Laboratory ENCODE Transcriptome Project. Post-transcriptional processing generates a diversity of 5′-modified long and short RNAs.Nature 457, 1028–1032 (2009). https://doi.org/10.1038/nature07759

Download citation

This article is cited by

Editorial Summary

Small RNAs: mining the gaps

The transcriptomes of eukaryotic cells are unexpectedly complex, with virtually the entire non-repeat portions of many genomes being transcribed. This paper from the Affymetrix/Cold Spring Harbor Laboratory ENCODE Transcriptome Project reports a deep sequencing study revealing that a remarkable breadth of RNA species is produced both from within annotated genes and from unannotated intergenic regions in human cells. Importantly, many of these small RNAs possess cap structures and appear to be processed from mature mRNAs resulting in populations of long and short RNAs with capped 5 ends that coincide. Transfecting synthetic PASRs (promoter-associated short RNAs) corresponding to the c-MYC transcriptional start site reduced MYC mRNA abundance, demonstrating a biological impact for at least one class of these small RNAs.

Associated content