Post-transcriptional processing generates a diversity of 5′-modified long and short RNAs (original) (raw)
- Letter
- Published: 25 January 2009
Nature volume 457, pages 1028–1032 (2009)Cite this article
- 4217 Accesses
- 4 Altmetric
- Metrics details
Abstract
The transcriptomes of eukaryotic cells are incredibly complex. Individual non-coding RNAs dwarf the number of protein-coding genes, and include classes that are well understood as well as classes for which the nature, extent and functional roles are obscure1. Deep sequencing of small RNAs (<200 nucleotides) from human HeLa and HepG2 cells revealed a remarkable breadth of species. These arose both from within annotated genes and from unannotated intergenic regions. Overall, small RNAs tended to align with CAGE (cap-analysis of gene expression) tags2, which mark the 5′ ends of capped, long RNA transcripts. Many small RNAs, including the previously described promoter-associated small RNAs3, appeared to possess cap structures. Members of an extensive class of both small RNAs and CAGE tags were distributed across internal exons of annotated protein coding and non-coding genes, sometimes crossing exon–exon junctions. Here we show that processing of mature mRNAs through an as yet unknown mechanism may generate complex populations of both long and short RNAs whose apparently capped 5′ ends coincide. Supplying synthetic promoter-associated small RNAs corresponding to the c-MYC transcriptional start site reduced MYC messenger RNA abundance. The studies presented here expand the catalogue of cellular small RNAs and demonstrate a biological impact for at least one class of non-canonical small RNAs.
This is a preview of subscription content, access via your institution
Access options
Subscribe to this journal
Receive 51 print issues and online access
$199.00 per year
only $3.90 per issue
Buy this article
- Purchase on SpringerLink
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout
Additional access options:
Similar content being viewed by others
Accession codes
Primary accessions
Gene Expression Omnibus
Data deposits
Sequences generated during this study have been deposited in GEO under accession number GSE14362.
References
- Birney, E. et al. Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature 447, 799–816 (2007)
ADS CAS Google Scholar - Shiraki, T. et al. Cap analysis gene expression for high-throughput analysis of transcriptional starting point and identification of promoter usage. Proc. Natl Acad. Sci. USA 100, 15776–15781 (2003)
Article ADS CAS Google Scholar - Kapranov, P., Willingham, A. T. & Gingeras, T. R. Genome-wide transcription and the implications for genomic organization. Nature Rev. Genet. 8, 413–423 (2007)
Article CAS Google Scholar - Kapranov, P. et al. RNA maps reveal new RNA classes and a possible function for pervasive transcription. Science 316, 1484–1488 (2007)
Article ADS CAS Google Scholar - Mardis, E. R. The impact of next-generation sequencing technology on genetics. Trends Genet. 24, 133–141 (2008)
Article CAS Google Scholar - Eddy, S. The UCSC Genome Browser <http://www.genome.ucsc.edu/cgi-bin/hgTables?db=hg18&hgta_group=genes&hgta_track=rnaGene&hgta_table=rnaGene&hgta_doSchema=describe+table+schema> (2006)
- Yang, J. H. et al. snoSeeker: an advanced computational package for screening of guide and orphan snoRNA genes in the human genome. Nucleic Acids Res. 34, 5112–5123 (2006)
Article CAS Google Scholar - Huttenhofer, A. et al. RNomics: an experimental approach that identifies 201 candidates for novel, small, non-messenger RNAs in mouse. EMBO J. 20, 2943–2953 (2001)
Article CAS Google Scholar - Kawaji, H. et al. CAGE Basic/Analysis Databases: the CAGE resource for comprehensive promoter analysis. Nucleic Acids Res. 34, D632–D636 (2006)
Article CAS Google Scholar - Carninci, P. et al. Genome-wide analysis of mammalian promoter architecture and evolution. Nature Genet. 38, 626–635 (2006)
Article CAS Google Scholar - Berget, S. M. Exon recognition in vertebrate splicing. J. Biol. Chem. 270, 2411–2414 (1995)
Article CAS Google Scholar - Olofsson, S. O. & Boren, J. Apolipoprotein B: a clinically important apolipoprotein which assembles atherogenic lipoproteins and promotes the development of atherosclerosis. J. Intern. Med. 258, 395–410 (2005)
Article CAS Google Scholar - Rada-Iglesias, A. et al. Whole-genome maps of USF1 and USF2 binding and histone H3 acetylation reveal new aspects of promoter structure and candidate genes for common human disorders. Genome Res. 18, 380–392 (2008)
Article CAS Google Scholar - Bochnig, P., Reuter, R., Bringmann, P. & Luhrmann, R. A monoclonal antibody against 2,2,7-trimethylguanosine that reacts with intact, class U, small nuclear ribonucleoproteins as well as with 7-methylguanosine-capped RNAs. Eur. J. Biochem. 168, 461–467 (1987)
Article CAS Google Scholar - Morris, K. V., Chan, S. W., Jacobsen, S. E. & Looney, D. J. Small interfering RNA-induced transcriptional gene silencing in human cells. Science 305, 1289–1292 (2004)
ADS CAS Google Scholar - Ting, A. H., Schuebel, K. E., Herman, J. G. & Baylin, S. B. Short double-stranded RNA induces transcriptional gene silencing in human cancer cells in the absence of DNA methylation. Nature Genet. 37, 906–910 (2005)
Article CAS Google Scholar - Janowski, B. A. et al. Activating gene expression in mammalian cells with promoter-targeted duplex RNAs. Nature Chem. Biol. 3, 166–173 (2007)
Article CAS Google Scholar - Mattick, J. S. RNA regulation: a new genetics? Nature Rev. Genet. 5, 316–323 (2004)
Article CAS Google Scholar - Mattick, J. S. Challenging the dogma: the hidden layer of non-protein-coding RNAs in complex organisms. Bioessays 25, 930–939 (2003)
Article CAS Google Scholar - Willingham, A. T. et al. A strategy for probing the function of noncoding RNAs finds a repressor of NFAT. Science 309, 1570–1573 (2005)
Article ADS CAS Google Scholar
Acknowledgements
We thank L. Cardone, D. Rebolini, M. Kramer, and W. R. McCombie for Illumina sequencing. We wish to thank J. Brosius, J. Schmitz and T. Rozhdestvensky for their help with the small RNA cloning protocol and J. Dumais for technical assistance. K.F.-T. was in part supported by the Schering Foundation. This work was supported in part by grants from the NIH and was performed as part of the ENCODE consortium (G.J.H. and T.R.G.). G.J.H is an investigator of the Howard Hughes Medical Institute.
Author Contributions K.F.-T. and P.K. performed experiments in collaboration with E.D., V.S., R.D. and A.T.W. P.K., S.F., R.S. and G.A. performed data analysis. G.J.H. and T.R.G. planned experiments and wrote the paper.
Author information
Author notes
- Katalin Fejes-Toth and Philipp Kapranov: These authors contributed equally to this work.
Authors and Affiliations
- Watson School of Biological Sciences,,
Katalin Fejes-Toth, Vihra Sotirova, Ravi Sachidanandam, Gordon Assaf, Gregory J. Hannon & Thomas R. Gingeras - Howard Hughes Medical Institute, Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, New York 11724, USA.,
Katalin Fejes-Toth, Vihra Sotirova, Gordon Assaf & Gregory J. Hannon - Affymetrix, Inc. Santa Clara, California 95051, USA.,
Philipp Kapranov, Sylvain Foissac, Aarron T. Willingham, Radha Duttagupta, Erica Dumais & Thomas R. Gingeras
Consortia
Affymetrix/Cold Spring Harbor Laboratory ENCODE Transcriptome Project
Cold Spring Harbor Laboratory
- Katalin Fejes-Toth
- , Vihra Sotirova
- , Ravi Sachidanandam
- , Gordon Assaf
- & Gregory J. Hannon
Affymetrix
- Philipp Kapranov
- , Sylvain Foissac
- , Aarron T. Willingham
- , Radha Duttagupta
- , Erica Dumais
- & Thomas R. Gingeras
Corresponding authors
Correspondence toGregory J. Hannon or Thomas R. Gingeras.
Ethics declarations
Competing interests
R.D. is an employee of Affymetrix, which manufactures the tiling arrays used in the study.
Additional information
Lists of participants and their affiliations appear at the end of the paper.
Supplementary information
Supplementary Information
This file contains Supplementary Methods, Supplementary References, Supplementary Figures S1-S3 with Legends and Supplementary Tables S1-S2 (PDF 1208 kb)
PowerPoint slides
Rights and permissions
About this article
Cite this article
Affymetrix/Cold Spring Harbor Laboratory ENCODE Transcriptome Project. Post-transcriptional processing generates a diversity of 5′-modified long and short RNAs.Nature 457, 1028–1032 (2009). https://doi.org/10.1038/nature07759
- Received: 16 September 2008
- Accepted: 02 January 2009
- Published: 25 January 2009
- Issue Date: 19 February 2009
- DOI: https://doi.org/10.1038/nature07759
This article is cited by
Editorial Summary
Small RNAs: mining the gaps
The transcriptomes of eukaryotic cells are unexpectedly complex, with virtually the entire non-repeat portions of many genomes being transcribed. This paper from the Affymetrix/Cold Spring Harbor Laboratory ENCODE Transcriptome Project reports a deep sequencing study revealing that a remarkable breadth of RNA species is produced both from within annotated genes and from unannotated intergenic regions in human cells. Importantly, many of these small RNAs possess cap structures and appear to be processed from mature mRNAs resulting in populations of long and short RNAs with capped 5 ends that coincide. Transfecting synthetic PASRs (promoter-associated short RNAs) corresponding to the c-MYC transcriptional start site reduced MYC mRNA abundance, demonstrating a biological impact for at least one class of these small RNAs.