Single-neuron sequencing analysis of L1 retrotransposition and somatic mutation in the human brain - PubMed (original) (raw)

. 2012 Oct 26;151(3):483-96.

doi: 10.1016/j.cell.2012.09.035.

Xuyu Cai, Eunjung Lee, L Benjamin Hills, Princess C Elhosary, Hillel S Lehmann, J J Parker, Kutay D Atabay, Edward C Gilmore, Annapurna Poduri, Peter J Park, Christopher A Walsh

Affiliations

Single-neuron sequencing analysis of L1 retrotransposition and somatic mutation in the human brain

Gilad D Evrony et al. Cell. 2012.

Abstract

A major unanswered question in neuroscience is whether there exists genomic variability between individual neurons of the brain, contributing to functional diversity or to an unexplained burden of neurological disease. To address this question, we developed a method to amplify genomes of single neurons from human brains. Because recent reports suggest frequent LINE-1 (L1) retrotransposition in human brains, we performed genome-wide L1 insertion profiling of 300 single neurons from cerebral cortex and caudate nucleus of three normal individuals, recovering >80% of germline insertions from single neurons. While we find somatic L1 insertions, we estimate <0.6 unique somatic insertions per neuron, and most neurons lack detectable somatic insertions, suggesting that L1 is not a major generator of neuronal diversity in cortex and caudate. We then genotyped single cortical cells to characterize the mosaicism of a somatic AKT3 mutation identified in a child with hemimegalencephaly. Single-neuron sequencing allows systematic assessment of genomic diversity in the human brain.

Copyright © 2012 Elsevier Inc. All rights reserved.

PubMed Disclaimer

Figures

Figure 1

Figure 1. Isolation and genome amplification of single human neuronal nuclei

(A) Schematic of the method. (B) Fluorescence-activated cell sorting of cortical nuclei stained with NeuN shows two separable populations: NeuN+ (population I) and NeuN− (population II). A subset of population I (Ia) consisting of large neuronal nuclei was sorted and reanalyzed, confirming sort purity. Two populations of nuclei are sometimes apparent without NeuN staining, due to the increased background staining of the larger population I nuclei. Fluorescence decrease of the sorted population on reanalysis is always observed due to photobleaching and washing of non-specific staining in the first sort. (C) RT-PCR confirming the neuronal and non-neuronal identities of populations Ia and II, respectively, by assaying for expression of nuclear RNA for two neuronal (SNAP25 and SYT1), two astroglial (GFAP and AQP4), and input control (RPL37A) genes. RT-PCR and western blot experiments (Figures 1C and 1D) were performed with NeuN/Mef2c double labeling in which all NeuN+ nuclei were Mef2c+ (data not shown). (D) Western blot analysis of NeuN and Olig2 (an oligodendrocyte marker), confirming neuronal and non-neuronal identity, respectively, of populations Ia and II. (E) Quantitative MDA reactions monitored in real-time confirm accurate sorting of the desired number of nuclei. The time to amplify to a threshold above background (TimeT, analogous to qPCR CT value) is plotted on the y-axis (error bars ±1SD, n=7 or 8 reactions per condition). Points were fit to a semi-log line of slope −4.3, corresponding to 1.7-fold amplification per unit time. See also Figure S1.

Figure 2

Figure 2. Single-neuron genome-wide coverage, amplification bias, and identity fingerprinting

(A) Schematic of the low-coverage genome sequencing method. (B) Chromosome copy numbers of single cortical neurons from normal (UMB1465, 46XY) and trisomy 18 (UMB866, 47XY,+18) individuals. Copy numbers are normalized to the median copy number of each chromosome across the 8 single neurons, with autosomes adjusted to a median copy number of 2. Orange lines denote ±1 copy. (C) Higher-resolution copy number profiling in 6,000 equal-read bins of ~500kb in size shows that MDA bias can be corrected by normalization to an MDA-amplified reference. Orange lines denote ±1 copy, and purple points indicate off-scale bins. (D) Identifiler fingerprinting confirms the single neurons derive from the correct individuals, and measures allele preferential amplification (PA), low amplification (LA), allele dropout (AD), and discordant allele (DA) rates. (E) Fraction of genotypes by SNP microarray that are concordant between 3 single neurons and bulk DNA confirms the single neurons derive from the correct individual. See also Figure S2 and Table S1.

Figure 3

Figure 3. Genome-wide L1Hs insertion profiling (L1-IP) in single neurons

(A) Schematic of the L1-IP method. Primers 1 and 3 (L1Hs-AC and ILMN-Adaptor1_L1Hs-G, respectively) are specific to L1Hs diagnostic nucleotides. Primer 2 represents 8 different 5bp arbitrary seed primers, each containing the same barcode. Primer 4 (ILMN-SeqAdaptor2) incorporates an Illumina adaptor. See Table S3 for primer sequences. (B) L1-IP sequencing reads for one representative known reference insertion (L1Hs-KR-chr11_115209613). For each sample, a total read coverage track and a raw reads track are shown. Each read coverage track is scaled to the maximum peak height of the sample (scale on the right, in reads per million mapped reads, RPM). In the raw reads track, up to 3 reads are shown for each position. The green arrow marks the L1Hs insertion. Plus and minus strand reads are red and blue, respectively. Low-level MDA-chimera reads (yellow asterisks) are seen in the local region of the true insertion only in MDA-amplified samples. (C) The number of peaks found above different confidence score thresholds corresponding to known reference insertions (KR), known non-reference insertions (KNR), and unknown peaks (UNK). Data shown is the mean for all bulk (n=31), 100-cell (n=15) and 1-cell (n=303) samples from all 3 individuals (includes 15, 5 and 3 technical replicates, respectively). Shading around each line shows ±SD. KR and KNR insertions used for peak annotation are in Table S5. (D) Representative gel images of 3’ junction PCR (3’PCR) of 20 different germline insertions (8 KR, 8 KNR, and 4 UNK). (E) 3’PCR quantification of AD and LD in 1-neuron samples (n=83), of 3 heterozygous and 3 homozygous L1Hs insertions. AD and LD are quantified for heterozygous and homozygous insertions, respectively. NL, normal amplification; LA, low amplification; AD, allelic-dropout; LD, locus-dropout. See also Figures S3, S4, S5, and S6.

Figure 4

Figure 4. Chromosome L1-IP profile of single neurons

Circos plot (Krzywinski et al., 2009) of chromosomes 1 and 2, from representative L1-IP samples from individual 1465: (A) bulk DNA, (B) cortex 100-neurons #1, (C) cortex 1-neuron #2, and (D) caudate 1-neuron #1. Peaks are shown for loci where at least one of the samples has a peak confidence score >0.5. Bulk DNA track shows the mean confidence score across all bulk DNA samples of individual 1465. KR, KNR, and UNK peaks are colored as indicated in the key. Below 100-neuron and 1-neuron sample tracks are annotations for peaks present with a score >0.5 in bulk DNA but absent in the sample (‘Dropout’), and peaks absent from bulk DNA but present in the sample with a score >0.5 and at least 20kb away from the nearest KR/KNR insertion in the individual to exclude MDA-chimera peaks (‘Somatic peak’). Figures for all chromosomes can be found in Supplemental Data 1.

Figure 5

Figure 5. Single-neuron fingerprinting with L1-IP

(A) Unbiased hierarchical clustering of all samples sequenced in this study (excluding technical replicates) by transposon profile. Each row represents a sample, and each column represents a specific L1Hs insertion. Data is shown for all KR and KNR insertions with an average score of at least 0.5 in at least one individual’s samples. Black and white squares indicate presence or absence, respectively, of the insertion using a confidence score threshold of 0.5. All samples cluster correctly by individual except for 3 low-quality 1-neuron samples that cluster in a separate branch (bottom branch). Additional row annotations are colored for individual (I), sample type (S), and tissue (T), illustrating correct clustering by individual. Column annotations show annotation for KR (black) and KNR (white) insertions, and mean confidence scores across all samples of each individual. Samples also cluster by individual when including all insertions including unknown peaks (data not shown). (B) L1-IP read coverage for a representative polymorphic known non-reference insertion (L1Hs-KNR-1158). (C) Representative gel images of 3’PCR of 11 polymorphic germline insertions with 1-neuron DNA. 3’PCR products are only detected in individuals predicted by L1-IP to have the insertion. All polymorphic insertions tested are listed in Table S3.

Figure 6

Figure 6. Quantification of somatic L1Hs insertions, and validation of a somatic insertion, in single neurons

(A) Mean number (±SD) of somatic insertion candidates per single neuron in each tissue in the study, corrected for sensitivity. The insertion rates per neuron are shown before and after 3’PCR and secondary validation. Horizontal dashed lines and adjacent numbers indicate the mean number of insertions across all single neurons from all tissues. Low-quality samples that did not achieve the necessary KNR detection rate with a confidence score >0.5 were excluded from the analysis in a quality control check (‘QC-fail’ in Table S2). The number of cells included in each analysis were n=50, 45, 45, 50, 50, and 44 for 1465-cortex, 1465-caudate, 4638-cortex, 4638-caudate, 4643-cortex, and 4643-caudate, respectively, after removing low-quality samples failing quality control. (B) Mean number (±SD) of unique somatic insertion candidates (i.e. present in only one single neuron sample of the individual) per single neuron in each tissue, corrected for sensitivity. (C) Gel images of 3’PCR validation of a somatic L1Hs insertion found by L1-IP in individual 1465 cortex 1-neuron #2 (L1-IP peak ID chr15_67625710_plus_0_0). (D) Location of the somatic L1Hs insertion (L1-IP peak ID chr15_67625710_plus_0_0) in antisense orientation in intron 4 of the gene IQCH, and the corresponding L1-IP peak in 1465-cortex 1-neuron #2. The insertion’s target site duplication coordinates are chr15: 67,625,702–67,625,714 (hg19). A 5’ transduction (orange) identified the source L1Hs on chr8: 73,787,792–73,793,823. (E) Representative gel images from a 3’PCR screen of 83 1-neuron samples from individual 1465 cortex (24 1-neuron samples shown) for the somatic insertion in Figures 6C and 6D. The two cortical 1-neuron samples (#2 and #77) found to have the insertion are shown. 1-neuron #77 was found to have the insertion only in the 3’PCR screen since it was not profiled by L1-IP. 3’PCR product sequencing and full-length cloning confirmed the insertion had identical 5’ and 3’ breakpoints and TSD in both neurons (#2 and #77). See also Figure S7.

Figure 7

Figure 7. Single-cell analysis of a somatic brain AKT3 mutation causing hemimegalencephaly

(A) An axial T2-weighted image from the MRI of the hemimegalencephaly patient, HMG-3, with a somatic AKT3 E17K mutation shows the enlarged right hemisphere with abnormally thick and malformed cerebral gray matter and abnormal signal of the white matter (white dashed line). On the right is an MRI image of a normal brain. (B) Single-cell FACS sorting of HMG-3 resected cortex. (C) Representative Sanger sequencing traces of a bulk unsorted nuclei sample and single-cell samples from NeuN+ and NeuN− populations. The calculated % mosaicism for single-cell samples (corrected for allelic dropout) is shown. Arrow and asterisks mark the site of the AKT3 c.49G→A (E17K) mutation. See Table S4 for percent mosaicism of all samples from HMG-3.

Similar articles

Cited by

References

    1. Baillie JK, Barnett MW, Upton KR, Gerhardt DJ, Richmond TA, De Sapio F, Brennan PM, Rizzu P, Smith S, Fell M, et al. Somatic retrotransposition alters the genetic landscape of the human brain. Nature. 2011;479:534–537. - PMC - PubMed
    1. Beck CR, Collier P, Macfarlane C, Malig M, Kidd JM, Eichler EE, Badge RM, Moran JV. LINE-1 retrotransposition activity in human genomes. Cell. 2010;141:1159–1170. - PMC - PubMed
    1. Blainey PC, Quake SR. Digital MDA for enumeration of total nucleic acid contamination. Nucleic Acids Res. 2011;39:e19. - PMC - PubMed
    1. Cantrell MA, Scott L, Brown CJ, Martinez AR, Wichman HA. Loss of LINE-1 activity in the megabats. Genetics. 2008;178:393–404. - PMC - PubMed
    1. Coufal NG, Garcia-Perez JL, Peng GE, Yeo GW, Mu Y, Lovci MT, Morell M, O'Shea KS, Moran JV, Gage FH. L1 retrotransposition in human neural progenitor cells. Nature. 2009;460:1127–1131. - PMC - PubMed

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources