Sequence Read Archive (original) (raw)

The Sequence Read Archive (SRA, previously known as the Short Read Archive) is a bioinformatics database that provides a public repository for DNA sequencing data, especially the "short reads" generated by high-throughput sequencing, which are typically less than 1,000 base pairs in length. The archive is part of the International Nucleotide Sequence Database Collaboration (INSDC), and run as a collaboration between the NCBI, the European Bioinformatics Institute (EBI), and the DNA Data Bank of Japan (DDBJ).

Property	Value
dbo:abstract	The Sequence Read Archive (SRA, previously known as the Short Read Archive) is a bioinformatics database that provides a public repository for DNA sequencing data, especially the "short reads" generated by high-throughput sequencing, which are typically less than 1,000 base pairs in length. The archive is part of the International Nucleotide Sequence Database Collaboration (INSDC), and run as a collaboration between the NCBI, the European Bioinformatics Institute (EBI), and the DNA Data Bank of Japan (DDBJ). The archive was established by the National Center for Biotechnology Information (NCBI) in 2007 in order to provide a repository for data produced by RNA-Seq and ChIP-Seq studies as well as large-scale studies including the Human Microbiome Project and the 1000 Genomes Project. Originally called the Short Read Archive, the name was changed in anticipation of future sequencing technologies being able to produce longer sequence reads. The volume of data deposited in the Sequence Read Archive has grown rapidly. As of September 2010, 65% of the SRA was human genomic sequence, with another 16% relating to human metagenome sequence reads. Much of this data was deposited through the 1000 Genomes Project. In June 2011, the data contained within the SRA passed 100 Terabases of DNA in volume. The preferred data format for files submitted to the SRA is the BAM format, which is capable of storing both aligned and unaligned reads. Internally the SRA relies on the NCBI SRA Toolkit, used at all three INSDC member databases, to provide flexible data compression, API access and conversion to other formats such as FASTQ. NCBI announced their plan to close the NCBI SRA in February 2011 due to funding reduction. However, EBI and DDBJ announced that they would continue to support the SRA. In October 2011, NCBI announced continuation of funding for the SRA. Deposition of data in the SRA is mandated by most funding agencies and open access journals. Nature Publishing Group journals require that DNA and RNA sequencing data is made available through the SRA. (en)
dbo:thumbnail	wiki-commons:Special:FilePath/Database.png?width=300
dbo:title	Sequence Read Archive (en)
dbo:wikiPageExternalLink	http://trace.ddbj.nig.ac.jp/dra/index_e.html http://trace.ddbj.nig.ac.jp/dra/index_e.shtml http://www.ebi.ac.uk/ena/ http://www.ebi.ac.uk/ena/about/sra_submissions https://www.ncbi.nlm.nih.gov/Traces/sra/ https://www.ncbi.nlm.nih.gov/sra/
dbo:wikiPageID	31713909 (xsd:integer)
dbo:wikiPageLength	7197 (xsd:nonNegativeInteger)
dbo:wikiPageRevisionID	1064268529 (xsd:integer)
dbo:wikiPageWikiLink	dbr:List_of_biological_databases dbr:Nature_Publishing_Group dbr:Base_pairs dbr:Human_Microbiome_Project dbr:DNA_Data_Bank_of_Japan dbr:International_Nucleotide_Sequence_Database_Collaboration dbr:1000_Genomes_Project dbc:Genetics_in_the_United_Kingdom dbr:Application_programming_interface dbr:Data_compression dbr:RNA-Seq dbc:Science_and_technology_in_Cambridgeshire dbr:DNA_sequencing dbr:Database dbc:Genetics_databases dbr:European_Bioinformatics_Institute dbr:File_format dbc:South_Cambridgeshire_District dbr:Binary_Alignment_Map dbr:Bioinformatics dbr:Human_genome dbr:Metagenome dbr:National_Center_for_Biotechnology_Information dbr:ChIP-Seq dbr:FASTQ_format dbr:File:Database.png dbr:BAM_format dbr:Open_access_journals dbr:High-throughput_sequencing dbr:File:History_(and_predicted_future)_size_of_the_Sequence_Read_Archive.svg
dbp:center	dbr:European_Bioinformatics_Institute dbr:National_Center_for_Biotechnology_Information DNA Data Bank of Japan (en)
dbp:description	dbr:FASTQ_format dbr:BAM_format
dbp:logo	dbr:File:Database.png
dbp:organism	all (en)
dbp:title	Sequence Read Archive (en)
dbp:url	http://www.ebi.ac.uk/ena/ https://www.ncbi.nlm.nih.gov/sra/
dbp:wikiPageUsesTemplate	dbt:Infobox_biodatabase dbt:Reflist
dc:description	BAM data FASTQ Sequences
dct:subject	dbc:Genetics_in_the_United_Kingdom dbc:Science_and_technology_in_Cambridgeshire dbc:Genetics_databases dbc:South_Cambridgeshire_District
gold:hypernym	dbr:Database
rdf:type	owl:Thing schema:CreativeWork dbo:Work wikidata:Q386724 dbo:Database dul:InformationObject dbo:BiologicalDatabase
rdfs:comment	The Sequence Read Archive (SRA, previously known as the Short Read Archive) is a bioinformatics database that provides a public repository for DNA sequencing data, especially the "short reads" generated by high-throughput sequencing, which are typically less than 1,000 base pairs in length. The archive is part of the International Nucleotide Sequence Database Collaboration (INSDC), and run as a collaboration between the NCBI, the European Bioinformatics Institute (EBI), and the DNA Data Bank of Japan (DDBJ). (en)
rdfs:label	Sequence Read Archive (en)
owl:sameAs	freebase:Sequence Read Archive wikidata:Sequence Read Archive dbpedia-fa:Sequence Read Archive https://global.dbpedia.org/id/4v3Z3
prov:wasDerivedFrom	wikipedia-en:Sequence_Read_Archive?oldid=1064268529&ns=0
foaf:depiction	wiki-commons:Special:FilePath/Database.png wiki-commons:Special:FilePath/History_(and_predicte...size_of_the_Sequence_Read_Archive.svg
foaf:homepage	http://trace.ddbj.nig.ac.jp/dra/index_e.html http://www.ebi.ac.uk/ena/ https://www.ncbi.nlm.nih.gov/sra/
foaf:isPrimaryTopicOf	wikipedia-en:Sequence_Read_Archive
is dbo:wikiPageDisambiguates of	dbr:SRA
is dbo:wikiPageRedirects of	dbr:Short_Read_Archive
is dbo:wikiPageWikiLink of	dbr:List_of_biological_databases dbr:Monoraphidium_neglectum dbr:Investigations_into_the_origin_of_COVID-19 dbr:AMPHORA dbr:BioSamples dbr:Bioinformatics dbr:Bloom_filters_in_bioinformatics dbr:National_Database_for_Autism_Research dbr:SRA dbr:European_Nucleotide_Archive dbr:FASTQ_format dbr:Transcriptomics_technologies dbr:Short_Read_Archive
is foaf:primaryTopic of	wikipedia-en:Sequence_Read_Archive