Darned in 2013: inclusion of model organisms and linking with Wikipedia (original) (raw)

Journal Article

,

1Biochemistry Department, University College Cork, Cork, Ireland and 2School of Biomedical Engineering, Institute of Technology-Banaras Hindu University, Varanasi, 221005, India

Search for other works by this author on:

,

1Biochemistry Department, University College Cork, Cork, Ireland and 2School of Biomedical Engineering, Institute of Technology-Banaras Hindu University, Varanasi, 221005, India

Search for other works by this author on:

,

1Biochemistry Department, University College Cork, Cork, Ireland and 2School of Biomedical Engineering, Institute of Technology-Banaras Hindu University, Varanasi, 221005, India

Search for other works by this author on:

1Biochemistry Department, University College Cork, Cork, Ireland and 2School of Biomedical Engineering, Institute of Technology-Banaras Hindu University, Varanasi, 221005, India

*To whom correspondence should be addressed. Tel: +353 21 4205419; Fax:

+353 21 4205462

; Email: p.baranov@ucc.ie

Search for other works by this author on:

Received:

07 September 2012

Accepted:

23 September 2012

Published:

15 October 2012

Cite

Anmol M. Kiran, John J. O'Mahony, Komal Sanjeev, Pavel V. Baranov, Darned in 2013: inclusion of model organisms and linking with Wikipedia, Nucleic Acids Research, Volume 41, Issue D1, 1 January 2013, Pages D258–D261, https://doi.org/10.1093/nar/gks961
Close

Navbar Search Filter Mobile Enter search term Search

Abstract

DARNED (DAtabase of RNa EDiting, available at http://darned.ucc.ie) is a centralized repository of reference genome coordinates corresponding to RNA nucleotides having altered templated identities in the process of RNA editing. The data in DARNED are derived from published datasets of RNA editing events. RNA editing instances have been identified with various methods, such as bioinformatics screenings, deep sequencing and/or biochemical techniques. Here we report our current progress in the development and expansion of the DARNED. In addition to novel database features the DARNED update describes inclusion of Drosophila melanogaster and Mus musculus RNA editing events and the launch of a community-based annotation in the RNA WikiProject.

INTRODUCTION

The Database of RNA editing (DARNED) was created in 2010 as an effort to provide a centralized repository for observable RNA editing events in the Homo sapiens transcriptome. The initial release of the database provided access to ∼42 000 coordinates in the human reference genome which have been identified or predicted to undergo RNA editing process upon transcription (1).

RNA editing results in localized alterations in RNA sequences relative to their genomic templates. It is ubiquitous to almost all known life forms (2–8). A-to-I RNA editing (Deamination of Adenosine to Inosine) is the most common type of editing in organisms with a developed central nervous system (CNS) (2,9–13). Inosine is recognized as Guanosine by ribosomes and reverse transcriptase due to its base pairing with Cytidine (14). In H. sapiens, A-to-I editing dominates in transcripts from the repetitive regions of the genome, especially from Alu elements (15–17). However, it also occurs in non-repetitive regions and affects sequences of small non-coding RNAs (18–20) and pre-mRNAs. The latter event can generate alternative transcript variants by affecting splice junctions (21,22) or synthesize alternative protein isoforms by non-synonymous codon substitutions in coding regions of mRNA (23–25). In addition to A-to-I editing, a few examples of C-to-U RNA editing (Deamination Cytidine to Uridine) are also well-established in humans (26–28). Recently the existence of all other 10 possible discrepancies between RNA and DNA has also been reported (28). This study generated a major controversy, as many follow-up analyses demonstrated that the dataset contains large number of false positives (29–32). Therefore, these data are not included in the current update of DARNED. Nonetheless, RNA:DNA discrepancies (other than A-to-G and C-to-U) can be found within more confident datasets (33–35) which are included into the update. The absence of plausible mechanisms for RNA editing corresponding to these discrepancies combined with the presence of false positives even in high-confident datasets suggests that corresponding editing events are spurious. However, because we cannot exclude a possibility that these events are real, we felt that their inclusion in DARNED is important and appropriate. We hope that this will stimulate exploration of the true nature of these discrepancies.

In addition to improving DARNED interface, we expanded the database by including RNA editing data from mouse and flies (36,37). Inequalities in the functional significance of particular RNA editing events prompted us to start detailed annotation of events whose functions are well-established. Wikipedia was chosen as a platform for this purpose due to its popularity and convenience for community-based annotations (38). Integration of Wikipedia with DARNED through bidirectional hyperlinking is another novel feature of DARNED.

NOVEL DATABASE INTERFACE

Now the user can choose whether to search RNA editing in H. sapiens, in Drosophila melanogaster or in M us musculus. As in the original version of the database, the search can be performed either by specifying the genomic coordinate range or the RefGene/RefSeq IDs (1). To facilitate new sequence-based search, we integrated the BLAST engine (39) which allows the retrieval of the genomic regions based on its similarity to a user-specified query sequence. The BLAST output is parsed and combined with the data on known RNA editing sites (Figure 1). We presume that this feature is particularly useful for researchers who are interested in exploring RNA editing in species closely related to those that are in DARNED. The sequence-based search is also useful when a user explores known RNA editing sites in transcripts whose genomic templates are unknown or difficult to map, e.g., when the sequence shares similarity with multiple regions of the genome.

Preview of sequence-based search results with highlighted known RNA editing locations in the RNA–DNA alignment.

Figure 1.

Preview of sequence-based search results with highlighted known RNA editing locations in the RNA–DNA alignment.

The DARNED database is linked to the UCSC Genome Browser (40) and ENSEMBL (except for hg18 and mm9 assemblies) (41), so that each RNA editing event could be explored in conjunction with annotation tracks within these genome browsers. The DARNED data from the previous update are also directly available in the UCSC browser. ENSEMBL receives up-to-date date DARNED information through a dedicated Distributed Annotation System (DAS) server (42) for all assemblies, see AVAILABILITY AND LICENSE section.

Past experiences have demonstrated that available EST data are not strongly associated with RNA editing events and are often misleading—for most coordinates in DARNED, EST data are not available. ESTs derived from ultra-edited RNAs could not be aligned to the genome without special pre-processing (43). Not surprisingly, no correlation was found between occurrence of bona fide RNA editing sited and mismatches in EST alignments (44). Our aim is to provide users with the most relevant information in the simplest form as possible, therefore, EST data have been removed. For this reason, EST ID-based searches cannot be performed in a current version of DARNED. However, users can still obtain information on potential RNA editing sites matching particular EST sequences by performing a sequence-based search in DARNED.

NEW DATA

The newly available human-related data were collected from literature-published since the original launch of the DARNED database (33–36,43–48). As described in the next paragraphs, the data were generated with a variety of methods. At present, we do not provide any information on reliability of the methods, current version of DARNED provides access for the reported sites without estimation of their trustworthiness.

Sakurai et al. (44) developed a new technique named Inosine Chemical Erasing (ICE) to convert Inosines to N1-cyanoethylinosines in RNAs by treating it with acrylonitril using Michael addition reaction. This modification inhibits Watson–Crick base pairing of Inosine with Cytidine, which leads to truncated cDNA synthesis from edited RNA by reverse transcriptase. Comparison of the chromatograms for cDNA from acrylonitrile-treated RNA pool with normal RNA provides visual evidence of RNA editing. He et al. (45) used a novel computational approach to predict possible A-to-I editing by finding mRNA/EST and reference genome discrepancies in humans for selected tissues. Carmi et al. (43) also used a novel computational approach to predict possible RNA editing cases. They used >250 bp mRNA/ESTs high-quality sequences which were not aligned to the human reference genome. These unaligned sequences were realigned with the genomic sequence after masking EST and genomic sequences (by converting all A–G in genomic sequence) using MEGABLAST for all possible strand combinations. Original sequences were recovered for masked ESTs with good alignment, and mismatches were examined with statistical filters (43). Ramaswami et al. (35) applied a computational pipeline to predict RNA editing using RNA and DNA deep sequencing data from the same samples. Close to 150 000 RNA editing sites were predicted for lymphoblastoid cell line GM12878 and close to 500 000 RNA editing sites were predicted as a result of re-examination of Peng et al. data (34) who originally reported about 22 000 cases. Graveley et al. (36) and Rodriguez et al. (49) reported ∼2300 RNA editing sites in D. melanogaster. In mice, several studies (37,48,50,51) reported ∼8500 sites, combined.

LAUNCH OF A COMMUNITY-BASED ANNOTATION

Like a mutation, an RNA editing event may have different effects on the fitness of the organism. In H. sapiens, most of the known RNA editing events occur in repetitive regions of the genome and presumably have little or no effect on fitness. Such sites are often termed promiscuous RNA editing sites (13,52). However, a number of RNA editing events are known to play distinctive functional roles by generating protein isoforms that cannot be directly derived from the genomic sequences by the means of standard transcription, processing and translation (53,54). Apparent difference in the significance of promiscuous and recoding RNA editing sites justifies unequal attention to these sites by researchers and consequently requires differential annotation of such edits. Recoding RNA editing demands highly elaborated annotations. However, the incorporation of such annotations directly into DARNED would lead to substantial non-uniformity. To solve the problem we decided to use Wikipedia (38) as an external media for the annotation of recoding RNA editing sites with bidirectional links to DARNED. We have generated Wikipedia subsection entries on RNA editing for 16 genes and for Alu repeats, see Table 1. Given the success of community-based annotation in other databases, such as Rfam (55) and the overall growth of the RNA WikiProject (56), we hope that RNA editing representation on Wikipedia will expand and improve.

Table 1.

Annotated genes and corresponding hyperlinks to Wikipedia

Table 1.

Annotated genes and corresponding hyperlinks to Wikipedia

IMPLEMENTATION

The updated DARNED is designed using Python–Django web framework, with MySQL as the back-end database and published using Apache web server.

FUTURE PLANS

RNA editing datasets are highly heterogeneous due to the diversity of the methods used for the identification of RNA editing sites. The degree of confidence for particular RNA editing events varies not only between datasets, but also within datasets, e.g. due to differences in sequence depth. Besides incorporation of novel data into DARNED, the most important future goal of DARNED is the development of a grading scheme to measure confidence of RNA editing sites that will display the confidence rating of RNA editing sites. Other plans include annotating functional RNA editing cases through Wikipedia and engaging the scientific community in this process.

AVAILABILITY AND LICENSE

DARNED usage and redistribution is governed by Creative Commons Attribute-Non-commercial-Share Alike License. The Database is freely accessible at http://darned.ucc.ie. DARNED DAS servers for all species and assemblies are available at http://darned.ucc.ie:8000/.

FUNDING

Science Foundation Ireland Principle Investigator Award [06/IN.1/B81 to P.V.B.]; Wellcome Trust [094423 to P.V.B.]. Funding for open access charge: Wellcome Trust [094423 to P.V.B.].

Conflict of interest statement. None declared.

ACKNOWLEDGEMENTS

We are grateful to Paul Gardner for introducing us to WikiProject RNA and to all DARNED users for their critical suggestions and encouragement. We also thank Audrey Michel and Patrick O’Connor for their help during preparation of this manuscript, as well as many DARNED users for their useful suggestions. The development of the database was supported by Science Foundation Ireland.

REFERENCES

1

DARNED: a DAtabase of RNa EDiting in humans

,

Bioinformatics

,

2010

, vol.

26

(pg.

1772

-

1776

)

2

Functions and regulation of RNA editing by ADAR deaminases

,

Ann. Rev. Biochem.

,

2010

, vol.

79

(pg.

321

-

349

)

3

A pilot study of bacterial genes with disrupted ORFs reveals a surprising profusion of protein sequence recoding mediated by ribosomal frameshifting and transcriptional realignment

,

Mol. Biol. Evol.

,

2011

, vol.

28

(pg.

3195

-

3211

)

4

The process of RNA editing in plant mitochondria

,

Mitochondrion

,

2008

, vol.

8

(pg.

35

-

46

)

5

Uridine insertion/deletion editing in trypanosomes: a playground for RNA-guided information transfer

,

Wiley Interdiscip. Rev. RNA

,

2011

, vol.

2

(pg.

669

-

685

)

6

Complex management: RNA editing in trypanosomes

,

Trends Biochem. Sci.

,

2005

, vol.

30

(pg.

97

-

105

)

7

A new Ebola virus nonstructural glycoprotein expressed through RNA editing

,

J. Virol.

,

2011

, vol.

85

(pg.

5406

-

5414

)

8

Non-templated addition of nucleotides to the 3′ end of nascent RNA during RNA editing in Physarum

,

EMBO J.

,

2001

, vol.

20

(pg.

1405

-

1414

)

9

The many roles of an RNA editor

,

Nat. Rev. Genet.

,

2001

, vol.

2

(pg.

869

-

878

)

10

Enhancing non-coding RNA information content with ADAR editing

,

Neurosci. Lett.

,

2009

, vol.

466

(pg.

89

-

98

)

11

Adenosine-to-inosine RNA editing shapes transcriptome diversity in primates

,

Proc. Natl Acad. Sci. USA

,

2010

, vol.

107

(pg.

12174

-

12179

)

12

RNA editing, DNA recoding and the evolution of human cognition

,

Trends Neurosci.

,

2008

, vol.

31

(pg.

227

-

233

)

13

Identification of A-to-I RNA editing: dotting the i's in the human transcriptome

,

Biochem. Biokhimiia

,

2011

, vol.

76

(pg.

915

-

923

)

14

Studies on the base pairing properties of deoxyinosine by solid phase hybridisation to oligonucleotides

,

Nucleic Acids Res.

,

1994

, vol.

22

(pg.

131

-

136

)

15

Widespread A-to-I RNA editing of Alu-containing mRNAs in the human transcriptome

,

PLoS Biol.

,

2004

, vol.

2

pg.

e391

16

Widespread RNA editing of embedded Alu elements in the human transcriptome

,

Genome Res.

,

2004

, vol.

14

(pg.

1719

-

1725

)

17

et al.

Systematic identification of abundant A-to-I editing sites in the human transcriptome

,

Nat. Biotechnol.

,

2004

, vol.

22

(pg.

1001

-

1005

)

18

RNA editing of human microRNAs

,

Genome Biol.

,

2006

, vol.

7

pg.

R27

19

Frequency and fate of microRNA editing in human brain

,

Nucleic Acids Res.

,

2008

, vol.

36

(pg.

5270

-

5280

)

20

Modulation of microRNA processing and expression through RNA editing by ADAR deaminases

,

Nat. Struct. Mol. Biol.

,

2006

, vol.

13

(pg.

13

-

21

)

21

RNA editing and alternative splicing: the importance of co-transcriptional coordination

,

EMBO Rep.

,

2006

, vol.

7

(pg.

303

-

307

)

22

Regulation of alternative splicing by RNA editing

,

Nature

,

1999

, vol.

399

(pg.

75

-

80

)

23

Evolutionarily conserved human targets of adenosine to inosine RNA editing

,

Nucleic Acids Res.

,

2005

, vol.

33

(pg.

1162

-

1168

)

24

Genome-wide identification of human RNA editing sites by parallel DNA capturing and sequencing

,

Science

,

2009

, vol.

324

(pg.

1210

-

1213

)

25

Screening of human SNP database identifies recoding sites of A-to-I RNA editing

,

RNA

,

2008

, vol.

14

(pg.

2074

-

2085

)

26

A novel form of tissue-specific RNA processing produces apolipoprotein-B48 in intestine

,

Cell

,

1987

, vol.

50

(pg.

831

-

840

)

27

et al.

Apolipoprotein B-48 is the product of a messenger RNA with an organ-specific in-frame stop codon

,

Science

,

1987

, vol.

238

(pg.

363

-

366

)

28

C-to-U RNA editing: mechanisms leading to genetic diversity

,

J. Biol. Chem.

,

2003

, vol.

278

(pg.

1395

-

1398

)

29

Comment on “Widespread RNA and DNA sequence differences in the human transcriptome”

,

Science

,

2012

, vol.

335

1302; author reply 1302

30

Comment on “Widespread RNA and DNA sequence differences in the human transcriptome”

,

Science

,

2012

, vol.

335

1302; author reply 1302

31

Comment on “Widespread RNA and DNA sequence differences in the human transcriptome”

,

Science

,

2012

, vol.

335

1302; author reply 1302

32

Very few RNA and DNA sequence differences in the human transcriptome

,

PLoS One

,

2011

, vol.

6

pg.

e25842

33

et al.

Extensive genomic and transcriptional diversity identified through massively parallel DNA and RNA sequencing of eighteen Korean individuals

,

Nat. Genet.

,

2011

, vol.

43

(pg.

745

-

752

)

34

et al.

Comprehensive analysis of RNA-Seq data reveals extensive RNA editing in a human transcriptome

,

Nat. Biotechnol.

,

2012

, vol.

30

(pg.

253

-

260

)

35

Accurate identification of human Alu and non-Alu RNA editing sites

,

Nat. Methods

,

2012

, vol.

9

(pg.

579

-

581

)

36

et al.

The developmental transcriptome of Drosophila melanogaster

,

Nature

,

2011

, vol.

471

(pg.

473

-

479

)

37

High levels of RNA-editing site conservation amongst 15 laboratory mouse strains

,

Genome Biol.

,

2012

, vol.

13

pg.

r26

38

Making your database available through Wikipedia: the pros and cons

,

Nucleic Acids Res.

,

2012

, vol.

40

(pg.

D9

-

D12

)

39

Basic local alignment search tool

,

J. Mol. Biol.

,

1990

, vol.

215

(pg.

403

-

410

)

40

et al.

The UCSC Genome Browser database: extensions and updates 2011

,

Nucleic Acids Res.

,

2012

, vol.

40

(pg.

D918

-

D923

)

41

et al.

Ensembl 2012

,

Nucleic Acids Res.

,

2012

, vol.

40

(pg.

D84

-

D90

)

42

ProServer: a simple, extensible Perl DAS server

,

Bioinformatics

,

2007

, vol.

23

(pg.

1568

-

1570

)

43

Identification of widespread ultra-edited human RNAs

,

PLoS Genet.

,

2011

, vol.

7

pg.

e1002317

44

Inosine cyanoethylation identifies A-to-I RNA editing sites in the human transcriptome

,

Nat. Chem. Biol.

,

2010

, vol.

6

(pg.

733

-

740

)

45

Computational detection and functional analysis of human tissue-specific A-to-I RNA editing

,

PLoS One

,

2011

, vol.

6

pg.

e18129

46

et al.

Altered adenosine-to-inosine RNA editing in human cancer

,

Genome Res.

,

2007

, vol.

17

(pg.

1586

-

1595

)

47

Accurate identification of A-to-I RNA editing in human by transcriptome sequencing

,

Genome Res.

,

2012

, vol.

22

(pg.

142

-

150

)

48

RNA editing level in the mouse is determined by the genomic repeat repertoire

,

RNA

,

2006

, vol.

12

(pg.

1802

-

1809

)

49

Nascent-seq indicates widespread cotranscriptional RNA editing in Drosophila

,

Mol. Cell

,

2012

, vol.

47

(pg.

27

-

37

)

50

Canonical A-to-I and C-to-U RNA editing is enriched at 3′UTRs and microRNA target sites in multiple mouse tissues

,

PLoS One

,

2012

, vol.

7

pg.

e33720

51

Transcriptome-wide sequencing reveals numerous APOBEC1 mRNA-editing targets in transcript 3′ UTRs

,

Nat. Struct. Mol. Biol.

,

2011

, vol.

18

(pg.

230

-

236

)

52

Elucidating the inosinome: global approaches to adenosine-to-inosine RNA editing

,

Nat. Rev. Genet.

,

2011

, vol.

12

(pg.

81

-

85

)

53

Breaking the central dogma by RNA editing

,

Chem. Rev.

,

2006

, vol.

106

(pg.

3397

-

3411

)

54

Posttranscriptional recoding by RNA editing

,

Adv. Prot. Chem. Struct. Biol.

,

2012

, vol.

86

(pg.

193

-

224

)

55

et al.

Rfam: Wikipedia, clans and the “decimal” release

,

Nucleic Acids Res.

,

2011

, vol.

39

(pg.

D141

-

D145

)

56

The RNA WikiProject: community annotation of RNA families

,

RNA

,

2008

, vol.

14

(pg.

2462

-

2464

)

© The Author(s) 2012. Published by Oxford University Press.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial reuse, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com.

I agree to the terms and conditions. You must accept the terms and conditions.

Submit a comment

Name

Affiliations

Comment title

Comment

You have entered an invalid code

Thank you for submitting a comment on this article. Your comment will be reviewed and published at the journal's discretion. Please check for further notifications by email.

Citations

Views

Altmetric

Metrics

Total Views 1,674

1,340 Pageviews

334 PDF Downloads

Since 12/1/2016

Month: Total Views:
December 2016 2
January 2017 2
February 2017 8
March 2017 1
May 2017 2
June 2017 6
July 2017 9
August 2017 5
September 2017 3
October 2017 8
November 2017 20
December 2017 31
January 2018 10
February 2018 20
March 2018 23
April 2018 15
May 2018 9
June 2018 8
July 2018 31
August 2018 18
September 2018 8
October 2018 15
November 2018 28
December 2018 11
January 2019 17
February 2019 15
March 2019 25
April 2019 26
May 2019 16
June 2019 27
July 2019 18
August 2019 27
September 2019 29
October 2019 28
November 2019 20
December 2019 19
January 2020 27
February 2020 16
March 2020 19
April 2020 16
May 2020 12
June 2020 30
July 2020 20
August 2020 28
September 2020 23
October 2020 17
November 2020 30
December 2020 23
January 2021 9
February 2021 23
March 2021 23
April 2021 19
May 2021 20
June 2021 13
July 2021 9
August 2021 19
September 2021 17
October 2021 20
November 2021 15
December 2021 18
January 2022 10
February 2022 7
March 2022 13
April 2022 28
May 2022 7
June 2022 20
July 2022 22
August 2022 33
September 2022 22
October 2022 36
November 2022 22
December 2022 18
January 2023 18
February 2023 16
March 2023 16
April 2023 10
May 2023 11
June 2023 25
July 2023 20
August 2023 9
September 2023 22
October 2023 20
November 2023 8
December 2023 20
January 2024 16
February 2024 22
March 2024 13
April 2024 24
May 2024 30
June 2024 13
July 2024 25
August 2024 19
September 2024 23
October 2024 30

Citations

86 Web of Science

×

Email alerts

Citing articles via

More from Oxford Academic