The Landscape of long noncoding RNA classification - PubMed (original) (raw)
Review
The Landscape of long noncoding RNA classification
Georges St Laurent et al. Trends Genet. 2015 May.
Abstract
Advances in the depth and quality of transcriptome sequencing have revealed many new classes of long noncoding RNAs (lncRNAs). lncRNA classification has mushroomed to accommodate these new findings, even though the real dimensions and complexity of the noncoding transcriptome remain unknown. Although evidence of functionality of specific lncRNAs continues to accumulate, conflicting, confusing, and overlapping terminology has fostered ambiguity and lack of clarity in the field in general. The lack of fundamental conceptual unambiguous classification framework results in a number of challenges in the annotation and interpretation of noncoding transcriptome data. It also might undermine integration of the new genomic methods and datasets in an effort to unravel the function of lncRNA. Here, we review existing lncRNA classifications, nomenclature, and terminology. Then, we describe the conceptual guidelines that have emerged for their classification and functional annotation based on expanding and more comprehensive use of large systems biology-based datasets.
Keywords: annotation of long non-coding RNAs; classification of long non-coding RNAs; function of long non-coding RNAs; lincRNA; lncRNA; long non-coding RNA; systems biology; transcriptome; vlincRNA.
Copyright © 2015 Elsevier Ltd. All rights reserved.
Figures
Figure 1. Schematic diagram illustrating various classes of ncRNAs
Three hypothetical loci are shown. Protein coding exons are shown as green (locus 1) or yellow boxes (locus 3). Locus 2 signifies a pseudogene of locus 1. Regulatory regions of locus 1 are shown in purple (promoter) and magenta (enhancer). Repeats are denoted by brown boxes. Lines with arrows represent ncRNAs. CAR: chromatin-associated RNA. ceRNA: Competing endogenous. RNA ciRNA: chromatin-interlinking RNA (grey) or circular intronic RNA (green). ecircRNA: exonic circular RNA. eRNA: enhancer-associated RNA. lincRNA: long intervening non-coding RNA. ncRNA-a: activating non-coding RNAs. PALR: promoter-associated long RNA. PIN: partially intronic RNA. TIN: totally intronic RNA. TSSa-RNA: transcription start site-associated RNA. T-UCR: Transcribed Ultraconserved Regions. uaRNA: 3′UTR-derived RNAs. vlincRNA: very long intergenic non-coding RNA. The role depicted here for CARs and ciRNAs in stabilizing a chromatin loop is hypothetical.
Figure 2. Properties of different published lists of human transcripts representing various classes of ncRNAs
Sequence conservation was defined by the conserved elements from the Vertebrate Multiz Alignment & Conservation (100 Species) from the UCSC Browser [147]. Relative conservation represents the fraction of conserved bases relative to the total lengths for each list of ncRNAs. Relative mass and expression levels represent averages of several malignant and normal tissues profiled using single-molecule RNA-seq analysis [5, 29]. Only uniquely aligning non-rRNA and non-chrM reads were considered. Relative mass represents proportion of reads mapping to a particular genomic element relative to all reads. The relative expression is the relative mass divided by the total length of each list and normalized to the relative expression of coding exons (defined by UCSC Genes). Promoter-associated RNAs were defined by the regions 3 kb upstream of annotated start sites of UCSC Genes. Given the lack of a comprehensive list of standalone human intronic RNAs, we extrapolated the relative mass of those based on mouse data [50]. The GENCODE annotations [33] are based on v19.
Figure 3. Outline of the consolidated conceptual framework of ncRNA classification
Highly accurate empirical RNA-seq data drives both annotation and quantification of the longest ncRNA (Tier 1) and of processed ncRNA species (Tier 2) across the entire genome. The quantitation data serves as the basis for the combined global matrix of knowledge of expression of each (coding and non-coding) RNA gene and transcript across multiple biological sources (Tier 3). This information provides the input for the functional annotation of non-coding transcripts using systems biology approaches. Mapping of RNA modifications provides the final layer of knowledge in this scheme.
Figure 4. A genomic view of the 8q24 region upstream of the human MYC gene
This clinically-important locus containing many GWAS hits associated with several cancers represents an example of a genomic region that could clearly benefit from the new annotation scheme. The RNAseq analysis reveals fairly strong signal on both strands covering most of this >1Mbp region. Yet, the known lncRNA annotations represent only a small fraction of this locus and judging by the distribution of the RNAseq signal and known promoters, are likely part of much larger transcript units (for example vlincRNAs shown on the figure). Transcriptome RNA-seq data is represented by the polyA- nuclear RNA from normal epidermal keratinocytes (NHEK) and embryonic stem cells (H1) generated by the ENCODE consortium [3]. In addition, vlincRNAs [29], promoters [32], and disease-associated variants from genome-wide association studies [148] (GWAS) are shown. Reproduced with permission from [12].
Similar articles
- Evolutionary analysis across mammals reveals distinct classes of long non-coding RNAs.
Chen J, Shishkin AA, Zhu X, Kadri S, Maza I, Guttman M, Hanna JH, Regev A, Garber M. Chen J, et al. Genome Biol. 2016 Feb 2;17:19. doi: 10.1186/s13059-016-0880-9. Genome Biol. 2016. PMID: 26838501 Free PMC article. - Structural and Functional Annotation of Long Noncoding RNAs.
Smith MA, Mattick JS. Smith MA, et al. Methods Mol Biol. 2017;1526:65-85. doi: 10.1007/978-1-4939-6613-4_4. Methods Mol Biol. 2017. PMID: 27896736 - Evolutionary annotation of conserved long non-coding RNAs in major mammalian species.
Bu D, Luo H, Jiao F, Fang S, Tan C, Liu Z, Zhao Y. Bu D, et al. Sci China Life Sci. 2015 Aug;58(8):787-98. doi: 10.1007/s11427-015-4881-9. Epub 2015 Jun 27. Sci China Life Sci. 2015. PMID: 26117828 - Strategies to Annotate and Characterize Long Noncoding RNAs: Advantages and Pitfalls.
Cao H, Wahlestedt C, Kapranov P. Cao H, et al. Trends Genet. 2018 Sep;34(9):704-721. doi: 10.1016/j.tig.2018.06.002. Epub 2018 Jul 17. Trends Genet. 2018. PMID: 30017313 Review. - Evolution to the rescue: using comparative genomics to understand long non-coding RNAs.
Ulitsky I. Ulitsky I. Nat Rev Genet. 2016 Oct;17(10):601-14. doi: 10.1038/nrg.2016.85. Epub 2016 Aug 30. Nat Rev Genet. 2016. PMID: 27573374 Review.
Cited by
- Long non‑coding RNA HCG11 suppresses the malignant phenotype of non‑small cell lung cancer cells by targeting a miR‑875/SATB2 axis.
Su Z, Chen M, Ding R, Shui L, Zhao Q, Luo W. Su Z, et al. Mol Med Rep. 2021 Aug;24(2):552. doi: 10.3892/mmr.2021.12191. Epub 2021 Jun 3. Mol Med Rep. 2021. PMID: 34080031 Free PMC article. - The Tetraodon nigroviridis reference transcriptome: developmental transition, length retention and microsynteny of long non-coding RNAs in a compact vertebrate genome.
Basu S, Hadzhiev Y, Petrosino G, Nepal C, Gehrig J, Armant O, Ferg M, Strahle U, Sanges R, Müller F. Basu S, et al. Sci Rep. 2016 Sep 15;6:33210. doi: 10.1038/srep33210. Sci Rep. 2016. PMID: 27628538 Free PMC article. - Landscape of associations between long non-coding RNAs and infiltrating immune cells in liver hepatocellular carcinoma.
Li L, Song X, Lv Y, Jiang Q, Fan C, Huang D. Li L, et al. J Cell Mol Med. 2020 Oct;24(19):11243-11253. doi: 10.1111/jcmm.15690. Epub 2020 Sep 10. J Cell Mol Med. 2020. PMID: 32910548 Free PMC article. - FOXK2 transcription factor and its roles in tumorigenesis (Review).
Wang Z, Liu X, Wang Z, Hu Z. Wang Z, et al. Oncol Lett. 2022 Nov 3;24(6):461. doi: 10.3892/ol.2022.13581. eCollection 2022 Dec. Oncol Lett. 2022. PMID: 36380871 Free PMC article. Review. - Approaches to Identify and Characterise the Post-Transcriptional Roles of lncRNAs in Cancer.
Carter JM, Ang DA, Sim N, Budiman A, Li Y. Carter JM, et al. Noncoding RNA. 2021 Mar 9;7(1):19. doi: 10.3390/ncrna7010019. Noncoding RNA. 2021. PMID: 33803328 Free PMC article. Review.
References
- Kapranov P, et al. Large-scale transcriptional activity in chromosomes 21 and 22. Science. 2002;296:916–919. - PubMed
- Okazaki Y, et al. Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs. Nature. 2002;420:563–573. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
- R01 NS081208/NS/NINDS NIH HHS/United States
- RC2 AG036596/AG/NIA NIH HHS/United States
- MH084880/MH/NIMH NIH HHS/United States
- R21 DA035592/DA/NIDA NIH HHS/United States
- R01 MH084880/MH/NIMH NIH HHS/United States
- R01 NS063974/NS/NINDS NIH HHS/United States
- NS071674/NS/NINDS NIH HHS/United States
- R01 MH083733/MH/NIMH NIH HHS/United States
- DA035592/DA/NIDA NIH HHS/United States
- P50 NS071674/NS/NINDS NIH HHS/United States
LinkOut - more resources
Full Text Sources
Other Literature Sources