Accounting for technical noise in single-cell RNA-seq experiments (original) (raw)

Accession codes

Primary accessions

ArrayExpress

Change history

In the version of this article initially published online, the dilution for the ERCC Spike-In Control Mix added to the lysis mix was given as 1:40,000 in the Online Methods. The actual dilution used was 1:400. The error has been corrected for the PDF and HTML versions of this article.

References

  1. Hashimshony, T., Wagner, F., Sher, N. & Yanai, I. Cell Rep. 2, 666–673 (2012).
    Article CAS Google Scholar
  2. Islam, S. et al. Genome Res. 21, 1160–1167 (2011).
    Article CAS Google Scholar
  3. Ramsköld, D. et al. Nat. Biotechnol. 30, 777–782 (2012).
    Article Google Scholar
  4. Tang, F. et al. Nat. Protoc. 5, 516–535 (2010).
    Article CAS Google Scholar
  5. Tang, F. et al. Nat. Methods 6, 377–382 (2009).
    Article CAS Google Scholar
  6. Chambers, I. et al. Nature 450, 1230–1234 (2007).
    Article CAS Google Scholar
  7. Reynolds, N. et al. Cell Stem Cell 10, 583–594 (2012).
    Article CAS Google Scholar
  8. Chang, H.H., Hemberg, M., Barahona, M., Ingber, D.E. & Huang, S. Nature 453, 544–547 (2008).
    Article CAS Google Scholar
  9. Toyooka, Y., Shimosato, D., Murakami, K., Takahashi, K. & Niwa, H. Development 135, 909–918 (2008).
    Article CAS Google Scholar
  10. Shalek, A.K. et al. Nature 498, 236–240 (2013).
    Article CAS Google Scholar
  11. Marioni, J.C., Mason, C.E., Mane, S.M., Stephens, M. & Gilad, Y. Genome Res. 18, 1509–1517 (2008).
    Article CAS Google Scholar
  12. Brady, S.M. et al. Science 318, 801–806 (2007).
    Article CAS Google Scholar
  13. Jiang, L. et al. Genome Res. 21, 1543–1551 (2011).
    Article CAS Google Scholar
  14. Benjamini, Y. & Hochberg, Y. Stat. Soc. Series B Stat. Methodol. 57, 289–300 (1995).
    Google Scholar
  15. Clough, S.J. & Bent, A.F. Plant J. 16, 735–743 (1998).
    Article CAS Google Scholar
  16. Birnbaum, K. et al. Nat. Methods 2, 615–619 (2005).
    Article CAS Google Scholar
  17. Wu, T.D. & Nacu, S. Bioinformatics 26, 873–881 (2010).
    Article CAS Google Scholar
  18. Irizarry, R.A. et al. Biostatistics 4, 249–264 (2003).
    Article Google Scholar
  19. Anders, S. & Huber, W. Genome Biol. 11, R106 (2010).
    Article CAS Google Scholar
  20. Alexa, A., Rahnenfuhrer, J. & Lengauer, T. Bioinformatics 22, 1600–1607 (2006).
    Article CAS Google Scholar

Download references

Acknowledgements

We thank E. Furlong and W. Huber for helpful discussions. We also acknowledge K. Birnbaum (New York University) for kindly providing pWOX5::GFP and pGl2::GFP seed. S.A. acknowledges partial funding from the European Union (FP7-Health, project Radiant); M.G.H. acknowledges the Australian Research Council for present funding. The EMBL Genomics Core Facility provided technical support for this work. We acknowledge A. Surani for the use of the C1 Single-Cell Auto Prep System in his lab and B. Jones for performing the experiment. We also acknowledge A. McKenzie (Medical Research Council Laboratory of Molecular Biology) for the _Il13_-GFP reporter mice and the Sanger-EBI Single Cell Centre for technical support. We acknowledge the support of European Research Council Starting Grant no. 260507, ThSWITCH.

Author information

Author notes

  1. Philip Brennecke, Simon Anders and Jong Kyoung Kim: These authors contributed equally to this work.

Authors and Affiliations

  1. European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
    Philip Brennecke, Simon Anders, Bianka Baying, Vladimir Benes & Marcus G Heisler
  2. EMBL, European Bioinformatics Institute (EBI), Hinxton, UK
    Jong Kyoung Kim, Aleksandra A Kołodziejczyk, Xiuwei Zhang, Sarah A Teichmann & John C Marioni
  3. Wellcome Trust Sanger Institute, Hinxton, UK
    Aleksandra A Kołodziejczyk & Sarah A Teichmann
  4. Medical Research Council Laboratory of Molecular Biology, Cambridge, UK
    Valentina Proserpio
  5. University of Sydney, Sydney, Australia
    Marcus G Heisler

Authors

  1. Philip Brennecke
    You can also search for this author inPubMed Google Scholar
  2. Simon Anders
    You can also search for this author inPubMed Google Scholar
  3. Jong Kyoung Kim
    You can also search for this author inPubMed Google Scholar
  4. Aleksandra A Kołodziejczyk
    You can also search for this author inPubMed Google Scholar
  5. Xiuwei Zhang
    You can also search for this author inPubMed Google Scholar
  6. Valentina Proserpio
    You can also search for this author inPubMed Google Scholar
  7. Bianka Baying
    You can also search for this author inPubMed Google Scholar
  8. Vladimir Benes
    You can also search for this author inPubMed Google Scholar
  9. Sarah A Teichmann
    You can also search for this author inPubMed Google Scholar
  10. John C Marioni
    You can also search for this author inPubMed Google Scholar
  11. Marcus G Heisler
    You can also search for this author inPubMed Google Scholar

Contributions

P.B. designed plant cell experiments, carried out experiments, interpreted results and wrote the paper; S.A. developed the statistical method, performed bioinformatics analyses and wrote the paper; J.K.K. performed bioinformatics analyses and helped write the paper; A.A.K. designed and carried out mouse cell experiments and helped write the paper; X.Z. designed and analyzed mouse cell experiments and helped write the paper; V.P. designed and carried out mouse cell experiments and helped write the paper; B.B. adapted an Illumina sequencing library preparation protocol; V.B. contributed to adapting the Illumina sequencing library preparation protocol and gave advice; S.A.T. designed mouse cell experiments and helped write the paper; J.C.M. contributed to the development of the statistical method, performed bioinformatics analyses, supervised the project and wrote the paper; M.G.H. initiated the project, designed plant cell experiments, interpreted results, supervised the project and wrote the paper.

Corresponding authors

Correspondence toJohn C Marioni or Marcus G Heisler.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Text and Figures

Supplementary Figures 1–14 and Supplementary Notes 1–9 (PDF 8682 kb)

Supplementary Software

Accounting for technical noise in single-cell RNA-seq experiments – Supplement II. This file contains the R code used to perform the analysis described in the manuscript. (PDF 1938 kb)

Supplementary Table 1

Estimated amounts of total RNA in single plant cells. Content of total RNA from single cells was estimated based on the 50 pg HeLa total RNA spike-in. The calculation is based on the assumption that the fraction of polyadenylated RNA is comparable between HeLa and A. thaliana input material. For detailed description refer to the Methods section. (XLSX 8 kb)

Supplementary Table 2

List of highly variable genes in GL2 cells. The columns give the gene ID, the gene name, the normalized average read count, the cell with the strongest expression, and, for each cell, the log2 ratio of the cell's expression to the average. (XLSX 134 kb)

Supplementary Table 3

List of highly variable genes in QC cells. Columns are the same as in Supplementary Table 2. (XLSX 16 kb)

Supplementary Table 4

List of GO categories that are significantly enriched for highly variable genes (Online Methods). (XLSX 10 kb)

Supplementary Table 5

Read counts for the 91 mouse immune cells spiked with ERCC spike-ins. Each column corresponds to a cell, each row to gene or an ERCC spike-in molecule. Mouse gene names have been replaces by randomized identifiers (column 1). The second column contains the transcript lengths used for the analysis in Supplementary Note 5. The transcript lengths are computed from Ensembl annotation by taking the union of all exons within a gene, where the exons annotated as “retained introns” and “nonsense mediated decay” are excluded. (XLSX 14402 kb)

Supplementary Table 6

Full list of barcoded Illumina PE adapters used for multiplexing of cDNA libraries. The set of adapters used depends on the degree of multiplexing applied to the samples. 4-plex is the lower limit for multiplexing. (XLSX 10 kb)

Supplementary Table 7

List of qPCR primers used (XLSX 8 kb)

Supplementary Table 8

Read counts for A. thaliana experiments. Raw number of reads mapped to each gene for all samples. (XLSX 6101 kb)

Supplementary Table 9

Transcript lengths for the human genome used for the analysis described in Supplementary Note 5, computed as described above, in the legend for Supplementary Table 5. (XLSX 1227 kb)

Rights and permissions

About this article

Cite this article

Brennecke, P., Anders, S., Kim, J. et al. Accounting for technical noise in single-cell RNA-seq experiments.Nat Methods 10, 1093–1095 (2013). https://doi.org/10.1038/nmeth.2645

Download citation