A method to sequence and quantify DNA integration for monitoring outcome in gene therapy (original) (raw)

Journal Article

,

1Department of Microbiology, University of Pennsylvania School of Medicine, 3610 Hamilton Walk, Philadelphia, PA 19104-6076, USA, 2Department of Family/Preventive Medicine, University of California, San Diego School of Medicine, San Diego, CA 92093-0901, USA, 3Commissariat à l'énergie atomique et aux énergies alternatives, Institute of Emerging Diseases and Innovative Therapies (iMETI), 4INSERM U962 and University of Paris XI, 92265 Fontenay-aux-Roses, France, 5Genetics Division, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115, USA, 6Department of Biotherapy, Hopital Necker-Enfants Malades, Assistance Publique – Hôpitaux de Paris (AP-HP), Université René Descartes, INSERM, Centre d’Investigation Clinique intégré en Biothérapies, Groupe Hospitalier Universitaire Ouest, AP-HP, Paris, France, 7Center for Cell Engineering, 8Molecular Pharmacology and Chemistry Program, Memorial Sloan-Kettering Cancer Center, New York, NY 10065, USA and 9Division of Genetics and Physiology, Department of Biology, 20014 University of Turku, Turku, Finland

Search for other works by this author on:

,

1Department of Microbiology, University of Pennsylvania School of Medicine, 3610 Hamilton Walk, Philadelphia, PA 19104-6076, USA, 2Department of Family/Preventive Medicine, University of California, San Diego School of Medicine, San Diego, CA 92093-0901, USA, 3Commissariat à l'énergie atomique et aux énergies alternatives, Institute of Emerging Diseases and Innovative Therapies (iMETI), 4INSERM U962 and University of Paris XI, 92265 Fontenay-aux-Roses, France, 5Genetics Division, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115, USA, 6Department of Biotherapy, Hopital Necker-Enfants Malades, Assistance Publique – Hôpitaux de Paris (AP-HP), Université René Descartes, INSERM, Centre d’Investigation Clinique intégré en Biothérapies, Groupe Hospitalier Universitaire Ouest, AP-HP, Paris, France, 7Center for Cell Engineering, 8Molecular Pharmacology and Chemistry Program, Memorial Sloan-Kettering Cancer Center, New York, NY 10065, USA and 9Division of Genetics and Physiology, Department of Biology, 20014 University of Turku, Turku, Finland

Search for other works by this author on:

,

1Department of Microbiology, University of Pennsylvania School of Medicine, 3610 Hamilton Walk, Philadelphia, PA 19104-6076, USA, 2Department of Family/Preventive Medicine, University of California, San Diego School of Medicine, San Diego, CA 92093-0901, USA, 3Commissariat à l'énergie atomique et aux énergies alternatives, Institute of Emerging Diseases and Innovative Therapies (iMETI), 4INSERM U962 and University of Paris XI, 92265 Fontenay-aux-Roses, France, 5Genetics Division, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115, USA, 6Department of Biotherapy, Hopital Necker-Enfants Malades, Assistance Publique – Hôpitaux de Paris (AP-HP), Université René Descartes, INSERM, Centre d’Investigation Clinique intégré en Biothérapies, Groupe Hospitalier Universitaire Ouest, AP-HP, Paris, France, 7Center for Cell Engineering, 8Molecular Pharmacology and Chemistry Program, Memorial Sloan-Kettering Cancer Center, New York, NY 10065, USA and 9Division of Genetics and Physiology, Department of Biology, 20014 University of Turku, Turku, Finland

Search for other works by this author on:

,

1Department of Microbiology, University of Pennsylvania School of Medicine, 3610 Hamilton Walk, Philadelphia, PA 19104-6076, USA, 2Department of Family/Preventive Medicine, University of California, San Diego School of Medicine, San Diego, CA 92093-0901, USA, 3Commissariat à l'énergie atomique et aux énergies alternatives, Institute of Emerging Diseases and Innovative Therapies (iMETI), 4INSERM U962 and University of Paris XI, 92265 Fontenay-aux-Roses, France, 5Genetics Division, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115, USA, 6Department of Biotherapy, Hopital Necker-Enfants Malades, Assistance Publique – Hôpitaux de Paris (AP-HP), Université René Descartes, INSERM, Centre d’Investigation Clinique intégré en Biothérapies, Groupe Hospitalier Universitaire Ouest, AP-HP, Paris, France, 7Center for Cell Engineering, 8Molecular Pharmacology and Chemistry Program, Memorial Sloan-Kettering Cancer Center, New York, NY 10065, USA and 9Division of Genetics and Physiology, Department of Biology, 20014 University of Turku, Turku, Finland

Search for other works by this author on:

,

1Department of Microbiology, University of Pennsylvania School of Medicine, 3610 Hamilton Walk, Philadelphia, PA 19104-6076, USA, 2Department of Family/Preventive Medicine, University of California, San Diego School of Medicine, San Diego, CA 92093-0901, USA, 3Commissariat à l'énergie atomique et aux énergies alternatives, Institute of Emerging Diseases and Innovative Therapies (iMETI), 4INSERM U962 and University of Paris XI, 92265 Fontenay-aux-Roses, France, 5Genetics Division, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115, USA, 6Department of Biotherapy, Hopital Necker-Enfants Malades, Assistance Publique – Hôpitaux de Paris (AP-HP), Université René Descartes, INSERM, Centre d’Investigation Clinique intégré en Biothérapies, Groupe Hospitalier Universitaire Ouest, AP-HP, Paris, France, 7Center for Cell Engineering, 8Molecular Pharmacology and Chemistry Program, Memorial Sloan-Kettering Cancer Center, New York, NY 10065, USA and 9Division of Genetics and Physiology, Department of Biology, 20014 University of Turku, Turku, Finland

Search for other works by this author on:

,

1Department of Microbiology, University of Pennsylvania School of Medicine, 3610 Hamilton Walk, Philadelphia, PA 19104-6076, USA, 2Department of Family/Preventive Medicine, University of California, San Diego School of Medicine, San Diego, CA 92093-0901, USA, 3Commissariat à l'énergie atomique et aux énergies alternatives, Institute of Emerging Diseases and Innovative Therapies (iMETI), 4INSERM U962 and University of Paris XI, 92265 Fontenay-aux-Roses, France, 5Genetics Division, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115, USA, 6Department of Biotherapy, Hopital Necker-Enfants Malades, Assistance Publique – Hôpitaux de Paris (AP-HP), Université René Descartes, INSERM, Centre d’Investigation Clinique intégré en Biothérapies, Groupe Hospitalier Universitaire Ouest, AP-HP, Paris, France, 7Center for Cell Engineering, 8Molecular Pharmacology and Chemistry Program, Memorial Sloan-Kettering Cancer Center, New York, NY 10065, USA and 9Division of Genetics and Physiology, Department of Biology, 20014 University of Turku, Turku, Finland

Search for other works by this author on:

,

1Department of Microbiology, University of Pennsylvania School of Medicine, 3610 Hamilton Walk, Philadelphia, PA 19104-6076, USA, 2Department of Family/Preventive Medicine, University of California, San Diego School of Medicine, San Diego, CA 92093-0901, USA, 3Commissariat à l'énergie atomique et aux énergies alternatives, Institute of Emerging Diseases and Innovative Therapies (iMETI), 4INSERM U962 and University of Paris XI, 92265 Fontenay-aux-Roses, France, 5Genetics Division, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115, USA, 6Department of Biotherapy, Hopital Necker-Enfants Malades, Assistance Publique – Hôpitaux de Paris (AP-HP), Université René Descartes, INSERM, Centre d’Investigation Clinique intégré en Biothérapies, Groupe Hospitalier Universitaire Ouest, AP-HP, Paris, France, 7Center for Cell Engineering, 8Molecular Pharmacology and Chemistry Program, Memorial Sloan-Kettering Cancer Center, New York, NY 10065, USA and 9Division of Genetics and Physiology, Department of Biology, 20014 University of Turku, Turku, Finland

Search for other works by this author on:

,

1Department of Microbiology, University of Pennsylvania School of Medicine, 3610 Hamilton Walk, Philadelphia, PA 19104-6076, USA, 2Department of Family/Preventive Medicine, University of California, San Diego School of Medicine, San Diego, CA 92093-0901, USA, 3Commissariat à l'énergie atomique et aux énergies alternatives, Institute of Emerging Diseases and Innovative Therapies (iMETI), 4INSERM U962 and University of Paris XI, 92265 Fontenay-aux-Roses, France, 5Genetics Division, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115, USA, 6Department of Biotherapy, Hopital Necker-Enfants Malades, Assistance Publique – Hôpitaux de Paris (AP-HP), Université René Descartes, INSERM, Centre d’Investigation Clinique intégré en Biothérapies, Groupe Hospitalier Universitaire Ouest, AP-HP, Paris, France, 7Center for Cell Engineering, 8Molecular Pharmacology and Chemistry Program, Memorial Sloan-Kettering Cancer Center, New York, NY 10065, USA and 9Division of Genetics and Physiology, Department of Biology, 20014 University of Turku, Turku, Finland

Search for other works by this author on:

,

1Department of Microbiology, University of Pennsylvania School of Medicine, 3610 Hamilton Walk, Philadelphia, PA 19104-6076, USA, 2Department of Family/Preventive Medicine, University of California, San Diego School of Medicine, San Diego, CA 92093-0901, USA, 3Commissariat à l'énergie atomique et aux énergies alternatives, Institute of Emerging Diseases and Innovative Therapies (iMETI), 4INSERM U962 and University of Paris XI, 92265 Fontenay-aux-Roses, France, 5Genetics Division, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115, USA, 6Department of Biotherapy, Hopital Necker-Enfants Malades, Assistance Publique – Hôpitaux de Paris (AP-HP), Université René Descartes, INSERM, Centre d’Investigation Clinique intégré en Biothérapies, Groupe Hospitalier Universitaire Ouest, AP-HP, Paris, France, 7Center for Cell Engineering, 8Molecular Pharmacology and Chemistry Program, Memorial Sloan-Kettering Cancer Center, New York, NY 10065, USA and 9Division of Genetics and Physiology, Department of Biology, 20014 University of Turku, Turku, Finland

Search for other works by this author on:

,

1Department of Microbiology, University of Pennsylvania School of Medicine, 3610 Hamilton Walk, Philadelphia, PA 19104-6076, USA, 2Department of Family/Preventive Medicine, University of California, San Diego School of Medicine, San Diego, CA 92093-0901, USA, 3Commissariat à l'énergie atomique et aux énergies alternatives, Institute of Emerging Diseases and Innovative Therapies (iMETI), 4INSERM U962 and University of Paris XI, 92265 Fontenay-aux-Roses, France, 5Genetics Division, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115, USA, 6Department of Biotherapy, Hopital Necker-Enfants Malades, Assistance Publique – Hôpitaux de Paris (AP-HP), Université René Descartes, INSERM, Centre d’Investigation Clinique intégré en Biothérapies, Groupe Hospitalier Universitaire Ouest, AP-HP, Paris, France, 7Center for Cell Engineering, 8Molecular Pharmacology and Chemistry Program, Memorial Sloan-Kettering Cancer Center, New York, NY 10065, USA and 9Division of Genetics and Physiology, Department of Biology, 20014 University of Turku, Turku, Finland

Search for other works by this author on:

... Show more

Received:

29 October 2010

Revision received:

26 January 2011

Accepted:

24 February 2011

Cite

Troy Brady, Shoshannah L. Roth, Nirav Malani, Gary P. Wang, Charles C. Berry, Philippe Leboulch, Salima Hacein-Bey-Abina, Marina Cavazzana-Calvo, Eirini P. Papapetrou, Michel Sadelain, Harri Savilahti, Frederic D. Bushman, A method to sequence and quantify DNA integration for monitoring outcome in gene therapy, Nucleic Acids Research, Volume 39, Issue 11, 1 June 2011, Page e72, https://doi.org/10.1093/nar/gkr140
Close

Navbar Search Filter Mobile Enter search term Search

Abstract

Human genetic diseases have been successfully corrected by integration of functional copies of the defective genes into human cells, but in some cases integration of therapeutic vectors has activated proto-oncogenes and contributed to leukemia. For this reason, extensive efforts have focused on analyzing integration site populations from patient samples, but the most commonly used methods for recovering newly integrated DNA suffer from severe recovery biases. Here, we show that a new method based on phage Mu transposition in vitro allows convenient and consistent recovery of integration site sequences in a form that can be analyzed directly using DNA barcoding and pyrosequencing. The method also allows simple estimation of the relative abundance of gene-modified cells from human gene therapy subjects, which has previously been lacking but is crucial for detecting expansion of cell clones that may be a prelude to adverse events.

INTRODUCTION

Human gene therapy has been carried out successfully for several diseases (1–8), but adverse events have occurred in which subjects developed leukemia associated with insertion of therapeutic vectors near proto-oncogenes (3,6,9,10). For this reason, it is important to track the location and abundance of different integration sites in cells from gene therapy-treated subjects. Tracking integration sites is also of interest in the use of transposons as insertional mutations in model organisms and in basic studies of integrating genomic parasites (11,12).

In the previously used protocols for integration site recovery in gene therapy, gene-corrected cells were isolated from patients, genomic DNA was purified and samples were typically cleaved with restriction enzymes. The exposed DNA ends were then ligated to adaptor DNAs and samples amplified using PCR with one primer complementary to the adaptor and the other complementary to the vector DNA terminus. Sites of integration were identified by sequencing PCR products from a primer bound to the vector DNA, so that the sequence read extended into the flanking human DNA (13,14). It was noticed, however, that integration sites were not equally recovered by different enzymes, pointing to a recovery bias for this method. Although sites were efficiently recovered when found near restriction cleavage sites, they were difficult to detect when not positioned optimally (15,16). Moreover, implementing restriction enzyme-based methods is complex—restriction enzymes need to be identified, which do not cleave DNA within amplicons of interest, only enzymes lacking CpG dinucleotides in their recognition sites can be used due to biased distributions of CpG in mammalian DNA, and large amounts of genomic DNA are needed. Thus, there is intense interest in alternative approaches.

Improved methods are starting to be described, but all are in early stages (15–18). Concerns with available approaches include efficiency of recovery of rare integration sites (16), and the need for large amounts of genomic DNA for analysis (17,18), which is often not available in gene therapy applications.

Here we report a method for recovering sites of integrated DNA using the bacterial transposase MuA to introduce adaptors into genomic DNA to allow PCR amplification. This method is quick and simple, avoids the bias associated with restriction enzymes, recovers integration sites in a near random fashion, and provides a simple measure of cell clonal abundance.

MATERIALS AND METHODS

Cell lines and gene therapy patient samples

293T and SupT1 cells were infected in vitro using a VSV-G pseudotyped MLV or HIV vector produced by transfecting 293T cells. To generate HIV-based vectors, cells were transfected with the LTR–GFP cassette plasmid p156RRLsin-PPTCMVGFPWPRE (19), the packaging construct pCMVdeltaR9 (20) and the vesicular stomatitis virus G-producing plasmid pMD.G. VSV-G pseudotyped MLV particles were produced using pMD.G but in combination with the MLV vector segment (pMX-eGFP) and packaging construct pCGP (pCGP, kindly provided by Paul Bates). Human gene therapy samples consisted of PBMCs (β-thalassemia) or CD3+ cells (SCID-X1). Details of gene therapy samples are reported elsewhere (1,2,7,8). The production of induced pluripotent cells containing a defined number of integration sites is reported in (21).

Mu-mediated integration site recovery

A detailed protocol for preparing Mu reactions is available in the Supplementary Report 1 MuSOP. Briefly, reaction buffer, oligonucleotide donor, target DNA and water are mixed on ice followed by addition of purified Mu transposase. Reactions are then incubated at 30°C for 2–4 h after which 2 µl of the reaction is used as input for PCR. Nested PCR was carried out as described (22) except that extension temperatures during cycling were changed to 70°C for the first seven cycles followed by 67°C for the remaining 37 cycles for samples with HIV-derived vectors. For both HIV- and MLV-based reactions, the number of cycles was reduced to 25 for nested PCR. Single round PCR amplification was performed with primers normally used in the nested round of PCR, each primer encoding 454 adaptor sequences and the LTR primer encoding a DNA barcode (15).

Integration site sequencing and analysis

Integration sites were sequenced using 454 FLX platform technology. Sequences reads were trimmed to remove primer sequence and aligned to the human genome (hg18, version 36.1) using BLAT. Sequences were required to have a single best hit with ≥98% identity to the human genome, to align within 3 bp of the beginning of the sequence read and to contain a perfect match to the expected LTR sequence lying downstream of the barcoded LTR primer. Comparisons to genomic features were carried out as described previously (23,24) using a combination of logistic regression and Bayesian model averaging supplemented by the random Forest machine learning algorithm. Gene expression analyses were based on data from 293T cells (25) with expression measured using the Affymetrix HU133 plus 2.0 gene chip array. Measuring sequence conservation at the site of restriction enzyme or Mu cleavage was performed using weblogo [http://weblogo.berkeley.edu/ and (26)]. Statistical details for generating Lorentz curves and recovery probability plots can be found in Supplementary Report 2 Mu Recovery. Sequence data from this study was submitted to Genbank under accession numbers HR819863–HR863973.

RESULTS

We first quantified biases in the detection of integrated DNA using protocols relying on restriction enzyme cleavage of genomic DNA. Analysis of the recoverability of an integration site based on its proximity to the nearest restriction site showed that integrated DNA ∼49 bp from a cleavage site was recovered most frequently, and frequency of recovery decreased sharply at longer or shorter distances (Figure 1A). The case of SCID-X1 patient 10 provides an example of complications due to this recovery bias (3,7). In this patient, an integrated vector activated expression of the nearby BMI1 proto-oncogene, which was associated with massive expansion of leukemic cells. When DNA from blood cells of patient 10 was cleaved and analyzed using four different restriction enzymes, only two enzymes allowed efficient recovery of the BMI1 integration site (Figure 1B). In an extreme effort to circumvent these limitations, a study of SCID-X1 gene-corrected patients used up to six different restriction enzymes to analyze each individual sample, but even with the difficulty and expense of this large scale effort, recovery was still significantly biased (7).

The Mu-based integration site recovery method. (A) Severe recovery biases in previous methods using restriction enzyme cleavage. A large collection of integration sites generated from SCID-X1 gene therapy (Supplementary Table S1) were analyzed and plotted to show the relative recovery frequency for different distances between the restriction enzyme site used in genomic DNA cleavage and the vector integration site. The graph summarizes data obtained using six different restriction enzymes. The sharp peak documents the recovery bias (the location of the peak differed modestly for the different restriction enzymes studied; data not shown). (B) Biased recovery efficiency for four restriction enzymes in a sample from an adverse event. The integration site within BMI1 was implicated in an adverse event in SCID-X1 patient 10 (3). Each bar indicates the percent of all integration sites from the leukemic cell sample deriving from the BMI1 site for each of the three restriction enzymes or three 6-cutter cocktail (Avr I, Spe I and Nhe I) used for isolation. (C) The engineered Mu DNA donor used in these studies. 5′ and 3′ DNA ends are as marked. The ‘N’ indicates the position of an amino-modifier that blocks the DNA 3′-end to prevent adaptor-to-adaptor amplification. The dark blue indicates binding sites for MuA transposase, light blue a spacer region and green the adaptor sequence for PCR amplification. (D) The Mu-mediated integration site recovery method. MuA transposition is used to install the engineered Mu DNA donor (top), allowing PCR amplification (middle). The PCR primers contain DNA barcodes (black segments) and primers for use in 454/Roche pyrosequencing (orange). PCR products can be used directly for pyrosequencing without cloning in bacterial plasmids.

Figure 1.

The Mu-based integration site recovery method. (A) Severe recovery biases in previous methods using restriction enzyme cleavage. A large collection of integration sites generated from SCID-X1 gene therapy (Supplementary Table S1) were analyzed and plotted to show the relative recovery frequency for different distances between the restriction enzyme site used in genomic DNA cleavage and the vector integration site. The graph summarizes data obtained using six different restriction enzymes. The sharp peak documents the recovery bias (the location of the peak differed modestly for the different restriction enzymes studied; data not shown). (B) Biased recovery efficiency for four restriction enzymes in a sample from an adverse event. The integration site within BMI1 was implicated in an adverse event in SCID-X1 patient 10 (3). Each bar indicates the percent of all integration sites from the leukemic cell sample deriving from the BMI1 site for each of the three restriction enzymes or three 6-cutter cocktail (Avr I, Spe I and Nhe I) used for isolation. (C) The engineered Mu DNA donor used in these studies. 5′ and 3′ DNA ends are as marked. The ‘N’ indicates the position of an amino-modifier that blocks the DNA 3′-end to prevent adaptor-to-adaptor amplification. The dark blue indicates binding sites for MuA transposase, light blue a spacer region and green the adaptor sequence for PCR amplification. (D) The Mu-mediated integration site recovery method. MuA transposition is used to install the engineered Mu DNA donor (top), allowing PCR amplification (middle). The PCR primers contain DNA barcodes (black segments) and primers for use in 454/Roche pyrosequencing (orange). PCR products can be used directly for pyrosequencing without cloning in bacterial plasmids.

The improved method reported here (Figure 1C and D) substitutes MuA-directed transposition in vitro for restriction enzyme cleavage and adaptor ligation [for background and reviews of MuA function see (11,27–33)]. An engineered transposon end is used as donor (Figure 1C), which contains (i) binding sites for the MuA transposase; (ii) an adaptor region complementary to PCR primers; and (iii) an amine blocking group at one DNA 3′-end. In the presence of MuA transposase, the oligonucleotide donors become covalently joined to target DNA, allowing convenient installation of primer sites in human genomic DNA. PCR amplification is then carried out using primers complementary to the vector DNA end and the adaptor. Because the adaptor contains a 5′ overhang and an amino-modified 3′-end, amplification must begin within the vector DNA and extend through the adaptor, preventing adaptor-to-adaptor amplification. As little as 100 ng of genomic target DNA can be used.

Nested PCR is routinely carried out using the restriction enzyme method, and can be used with the Mu-mediated method. However, tests showed that for samples containing relatively large numbers of integrated vectors, only a single round of PCR was sufficient to yield high quality sequence populations using the Mu-mediated method, and these data sets actually showed improved diversity (described in Supplementary Report 1 ‘Mu Standard Operating Procedure’).

The PCR primers used for the final amplification step are composites, containing both the priming sequences and sequences required for 454/Roche pyrosequencing. Primer sequences also contain a DNA barcode between the 454 sequence and the priming region, allowing large numbers of amplicons to be pooled, sequences determined, then sequence reads parsed using the barcodes (Figure 1D) (34–36). Thus, hundreds of samples can be processed in pools. Pyrosequence reads are then trimmed, aligned to the human genome and distributions analyzed.

To test the Mu-mediated method, we determined 25 194 total integration site sequences, which yielded 3382 unique vector integration sites after condensing duplicates. We analyzed both HIV- and γ-retrovirus-based vectors. Samples studied included cells from patients in two gene therapy trials, which treated SCID-X1 (3) and β-thalassemia (8) and tissue culture cells infected in vitro (summarized in Supplementary Table S1).

We first compared recovery biases of the Mu and restriction enzyme-based methods. We determined the sequence preferences for Mu integration in human DNA in vitro for 5968 integration site sequence reads that included the Mu-end oligonucleotide DNA. Alignment of human sequences at Mu integration sites revealed a detectable consensus sequence closely resembling that reported previously for Mu transposition (37), but with much lower information content than cleavage sites for restriction enzymes (Figure 2A and B and Supplementary Report 2), indicating less bias in the cleaving/joining reactions.

Consensus sequences at points of adaptor addition. (A) Information content at junctions between the engineered Mu DNA and human DNA derived from vector integration site sequence reads. The x-axis shows the DNA sequence position, where the site of joining to Mu DNA is between positions 10 and 11 (arrow). Perfect sequence conservation has information content of 2 bits (y-axis). Note that some bases have little or no information content, so no letters are visible. (B) Information content at adaptor junctions from vector integration sites recovered after cleavage with the restriction enzyme Mse I, where the site of cleavage is between positions 10 and 11 (arrow).

Figure 2.

Consensus sequences at points of adaptor addition. (A) Information content at junctions between the engineered Mu DNA and human DNA derived from vector integration site sequence reads. The _x_-axis shows the DNA sequence position, where the site of joining to Mu DNA is between positions 10 and 11 (arrow). Perfect sequence conservation has information content of 2 bits (_y_-axis). Note that some bases have little or no information content, so no letters are visible. (B) Information content at adaptor junctions from vector integration sites recovered after cleavage with the restriction enzyme Mse I, where the site of cleavage is between positions 10 and 11 (arrow).

The performance of the restriction enzyme method was next compared with the Mu-mediated method by measuring recovery biases for each method, then annotating the human genome for calculated recovery rates based on the data. Thus, for any integration site in the human genome, the likelihood of recovery at each base pair could be calculated for each recovery method (Supplementary Report 2).

Figure 3A illustrates the relative recovery frequencies using the LMO2 promoter as an example, which was chosen because the gene has been involved in several adverse events in SCID-X1 gene therapy (3,38). Note that although Figure 3A summarizes results on a single region of the genome, the biases were measured genome-wide from integration site data for each method and are shown at LMO2 for purposes of illustration. Recovery was relatively consistent for the Mu-mediated method over all sites, whereas the restriction enzyme methods show sharp peaks and valleys, where valleys indicate locations where an integration event would be difficult or impossible to isolate and the peaks frequently recovered sites that would mask more rare sites. Figure 3B shows the data plotted as the cumulative recovery frequency, where perfectly unbiased recovery would be indicated by a curve that followed the diagonal from lower left to upper right. The Mu-mediated recovery method most closely approaches the diagonal and is significantly closer than even the method using six restriction enzymes (P < 0.001; see Supplementary Report 2 for statistical methods), documenting that the Mu-mediated method is the least biased.

Reduced recovery bias using the Mu-mediated integration site isolation method. (A) Relative recovery rates compared at the LMO2 promoter for (i) the Mu-mediated method; (ii) six tests with single restrictions enzymes or pools or (iii) a mixture of all six. The recovery rates were calculated from data on the placement of integration sites relative to restriction enzyme cleavage sites or Mu transposase sites used in their isolation, using the data in Supplementary Table S1 and statistical methods described in Supplementary Report 2. Calculated integration site recovery rates were then used to annotate each base over 10 kb at the LMO2 promoter (chr11, bases 33770412–33780411). ‘Six enzymes’ indicates pooled data for the six sets below. For a perfectly unbiased method, all such rates equal 1.0. (B) Statistical analysis of biases in recovery of integration sites. The x-axis plots each base of the LMO2 promoter analyzed above, treating each as a potential integration target. Sites were ranked by expected ease of isolation, with the easiest to isolate to the right. The y-axis shows the calculated proportion of sequences recovered given the measured recovery biases. Perfect unbiased recovery would follow a line from lower left to upper right. Statistical methods and P values for pair-wise comparisons are summarized in Supplementary Report 2.

Figure 3.

Reduced recovery bias using the Mu-mediated integration site isolation method. (A) Relative recovery rates compared at the LMO2 promoter for (i) the Mu-mediated method; (ii) six tests with single restrictions enzymes or pools or (iii) a mixture of all six. The recovery rates were calculated from data on the placement of integration sites relative to restriction enzyme cleavage sites or Mu transposase sites used in their isolation, using the data in Supplementary Table S1 and statistical methods described in Supplementary Report 2. Calculated integration site recovery rates were then used to annotate each base over 10 kb at the LMO2 promoter (chr11, bases 33770412–33780411). ‘Six enzymes’ indicates pooled data for the six sets below. For a perfectly unbiased method, all such rates equal 1.0. (B) Statistical analysis of biases in recovery of integration sites. The _x_-axis plots each base of the LMO2 promoter analyzed above, treating each as a potential integration target. Sites were ranked by expected ease of isolation, with the easiest to isolate to the right. The _y_-axis shows the calculated proportion of sequences recovered given the measured recovery biases. Perfect unbiased recovery would follow a line from lower left to upper right. Statistical methods and P values for pair-wise comparisons are summarized in Supplementary Report 2.

Another means of quantifying recovery biases involves comparing the increased sampling effort required to reach the results of a perfectly unbiased method. The slight biases introduced by the Mu method would require only a 10% increase in sampling effort to achieve the efficiency of a perfectly unbiased method (Supplementary Report 2). In contrast, for some restriction enzyme methods, the needed increase is too large to measure accurately (>50-fold). For the pool of all six restriction enzymes, a 45% increase in effort would be needed. Thus the Mu-mediated method yields less biased recovery with much less effort than any form of restriction enzyme-based method.

We next investigated the experimental effort required to recover all members of a fully defined integration site population using the Mu-based method. We prepared an induced pluripotent stem (iPS) cell line in which all cells contained the same six known lentiviral vector integration sites. We carried out 12 independent integration site recovery reactions using the Mu-mediated method, recovering an average of 1068 sequence reads per replicate. We found that six of the 12 replicate reactions recovered all six sites. The replicates with fewer than six yielded either four or five of the sites. To assess the sampling effort required for recovery of all six sites, we pooled different numbers of Mu reactions computationally and assessed recovery. For pools of three of the 12 Mu reactions, 98% contained all six sites. For pools of five Mu reactions, 100% of the pools contained all six sites. Thus, 3–5 independent Mu reactions are enough to completely sample an integration site population of this size at the sequencing depth used. For comparison, three restriction enzymes were tested for recovery of the six sites, and only one yielded all six.

The genome-wide distribution patterns of integration target sites for HIV and γ-retroviruses have been studied extensively (13,14,23,39,40), allowing us to assess whether the Mu-mediated method reported similar trends. For studies of integration frequency near genomic landmarks, restriction enzyme-based methods usually provided an adequate overview, because the restriction biases are only weakly related to those landmarks. Figure 4A and B compares the distributions of integration sites isolated using the two methods for HIV and γ-retrovirus-based vectors. Integration sites are compared from infections of tissue culture cells and for one SCID-X1 patient (3,7). Figure 4A summarizes the relationship of integration sites to genomic features, using a heat map format to indicate increased or decreased integration frequency compared with random distributions. For both HIV and γ-retroviral vectors, the genome-wide trends were closely similar for the Mu-mediated or restriction enzyme-mediated methods. Both HIV-based and γ-retrovirus-based vectors favor integration in regions of high gene density and associated genomic landmarks. Gamma-retroviruses favor integration near gene 5′-ends. HIV and γ-retroviruses show a complex pattern of favored and disfavored integration sites near regions of histone methylation, acetylation and bound proteins, as indicated by comparison to data from ChIP-seq experiments (41–44) (Figure 4B), again showing favored integration near marks of active transcription. These patterns matched closely for the Mu-mediated and restriction-enzyme-based methods. The genome-wide patterns for SCID-X1 gene therapy closely paralleled those seen for γ-retroviral vectors, as reported previously (3,6,7). Thus, conclusions on genome-wide distributions of integration sites from the Mu-mediated method parallel those from extensive previous studies.

Mu-mediated integration site recovery. (A) Integration frequency near genomic land marks. (B) Integration frequency near sites of histone methylation, acetylation or bound chromosomal proteins. Data sets compared are indicated by the column headings, features analyzed by the rows. Heat maps were constructed using the receiver operating characteristic (ROC) area method to compare the observed distributions to random distributions of integration sites (23,45). ROC areas of 0.5 indicated integration sites are present near the indicated genomic features as often as expected by chance. ROC areas >0.5 indicate positive association and areas <0.5 negative association. Associations are colour coded as indicated by the key at the bottom of each map. In (A), for the Gene Density, Expression Density and GC content measures, several different length genomic intervals were used for comparisons, which are indicated by numbers to the right of the black bar. In (B), ChIP-seq analysis was used to map the genome-wide distributions of sites of histone post-translational methylation or acetylation, or bound DNA binding proteins (41–44), and the results compared with integration site distributions. Statistical analysis shows that most associations where discernable colour can be seen in a heat-map tile achieve statistical significance. (C) Comparisons of the numbers and positions of Mu integration sites that allowed recovery of the vector integration sites at HMGA2, VPS13B and POLA2 generated during gene therapy for β-thalassemia.

Figure 4.

Mu-mediated integration site recovery. (A) Integration frequency near genomic land marks. (B) Integration frequency near sites of histone methylation, acetylation or bound chromosomal proteins. Data sets compared are indicated by the column headings, features analyzed by the rows. Heat maps were constructed using the receiver operating characteristic (ROC) area method to compare the observed distributions to random distributions of integration sites (23,45). ROC areas of 0.5 indicated integration sites are present near the indicated genomic features as often as expected by chance. ROC areas >0.5 indicate positive association and areas <0.5 negative association. Associations are colour coded as indicated by the key at the bottom of each map. In (A), for the Gene Density, Expression Density and GC content measures, several different length genomic intervals were used for comparisons, which are indicated by numbers to the right of the black bar. In (B), ChIP-seq analysis was used to map the genome-wide distributions of sites of histone post-translational methylation or acetylation, or bound DNA binding proteins (41–44), and the results compared with integration site distributions. Statistical analysis shows that most associations where discernable colour can be seen in a heat-map tile achieve statistical significance. (C) Comparisons of the numbers and positions of Mu integration sites that allowed recovery of the vector integration sites at HMGA2, VPS13B and POLA2 generated during gene therapy for β-thalassemia.

The Mu-mediated method also allows a new approach to quantifying the relative frequency of gene-corrected cells in patient samples. Gene corrected cell clones that are present in many copies will contribute a relatively larger proportion of their integration site DNA to the genomic DNA pool after purification, providing an increased number of target sites for Mu integration in vitro. As a result, relatively larger numbers of independent Mu integration events will lead to recovery of the same high abundance vector integration site. Quantifying the number of independent Mu integration sites per vector integration site therefore provides a simple measure of the relative abundance.

We tested this in a sample from a β-thalassemia gene therapy trial, in which a cell harbouring a single vector integration site in the HMGA2 gene expanded to comprise more than one-third of the gene-corrected peripheral blood mononuclear cells (PBMC) population, as documented by quantitative PCR assays (8). We recovered 10 independent Mu integration sites for the HMGA2 vector integration site, while all other vector integration sites were recovered with only a single Mu site (Figure 4C). This finding makes the important clinical point that the HMGA2 site is the only high abundance integration site in the expanded cell clone. Using the restriction enzyme cleavage method for integration site recovery, only two out of three of the enzymes used allowed isolation of the HMGA2 site (data not shown).

Lastly, we present a method for suppressing PCR contamination. In assessing collections of gene therapy samples, multiple tubes are commonly processed in parallel using nested PCR, providing an ideal setting for migration of PCR products between samples. In practice controlling contamination is challenging even for experienced experimentalists. We have devised a method that suppresses this, in which separate adaptors are used for each sample, so that contaminating PCR products are not amplifiable outside the correct PCR reaction. In reconstruction experiments, this has been effective at suppressing cross-over in our laboratory (data not shown). Use of multiple adaptors is described in the Standard Operating Procedure in the Supplementary Data for this article.

DISCUSSION

In summary, the Mu-mediated integration site recovery method allows simplified recovery of integration sites and estimation of relative abundance. Use of protocols based on single restriction enzymes results in failure to isolate integration sites that are not near restriction enzyme recognition sites (7,15,16), which can be crucial in monitoring adverse events during gene therapy. The challenges posed by biased isolation have been underestimated in some of the early literature in this field. One previous study attempted to circumvent recovery biases by using six restriction enzymes to study single samples, but even with this added complication and expense, recovery using this method is still more biased than with the Mu-based method (Figure 3B; P < 0.001). Thus in cases where it is critical to use integration site data to identify expanded cell clones, the Mu method is attractive and convenient.

Methods for deep sequencing are in a state of rapid transition. The Mu-mediated method described here can in principal be adapted to any of the next generation sequencing platforms. At this writing, the Solexa/Illumina method is least expensive per base, but potentially inconvenient because analysis of a handful of samples will often take up only a fraction of a run, requiring complicated coordination with others to fill out a run. The 454/Roche method is more expensive per base, but the availability of a benchtop instrument for smaller runs (the ‘Junior’ instrument http://www.gsjunior.com/) simplifies throughput. It is highly likely that additional sequencing methods will become available in the near future that may also be useable with the Mu-mediated method.

The demands placed on integration site recovery technology vary by disease state. The frequency of gene corrected cells varies from 100% (T cells in SCID-X1) to <1% [early adenosine deaminase (ADA) studies], so the demands on the technology differ for different diseases. For very low level clones, recovering integration sites and estimating their abundance is at present challenging for any technology.

Additional methods are starting to be proposed for integration site analysis, including methods based on DNA shearing (N. Gillet, N. Malani, N. Gormley, R. Carter, A. Melamed, D. Bentley, C. Berry, F. Bushman, G. Taylor and C. Bangham, submitted for publication) or limited extension from integrated vectors with a DNA polymerase followed by RNA ligation (‘nrPCR’) (16). Each of these methods is of interest but each may have inefficient steps. For the Mu-mediated method, it can be challenging to obtain enough Mu integration events to query the full human genome efficiently, though the method is suitable for analysis of large numbers of samples with small amounts of starting genomic DNA. Ongoing use has shown the method to be effective in practice. For DNA shearing, it can be challenging to obtain efficient ligation after repairing broken DNA ends, and to work with small amounts of DNA. For the nrPCR method, efficiency may be an issue (16).

Two reports in the peer-reviewed literature document the utility of the Mu-mediated method. In one case, oligoclonal reconstitution during lentiviral vector-mediated gene correction was documented using Mu-mediated recovery of integration sites during β-thalassemia gene correction in mice (46). In this case, the inferred rank order of integration site abundance from sequence read counts was similar to that inferred by quantifying the number of independent Mu-transposition events in vitro that led to site recovery. The congruence of these two measures supports the idea that quantification relative to clonal abundance was consistent. In the second study (21) the Mu-mediated method was used along with other methods and found to be comparably efficient. Long term, it will be useful to compare the effectiveness and convenience of present and future methods for quantifying integration site abundance on defined integration site populations.

FUNDING

Funding for open access charge: National Institute of Health (grants AI52845 and AI082020); New York State Stem Cell Science NYSTEM (N08T-060) University of Pennsylvania Center for AIDS Research and the Penn Genome Frontiers Institute with a grant with the Pennsylvania Department of Health. T.B. is a Special Fellow of the Leukemia & Lymphoma Society.

Conflict of interest statement. The Department of Health specifically disclaims responsibility for any analyses, interpretations, or conclusions.

ACKNOWLEDGEMENTS

The authors are grateful to members of the Bushman laboratory for help and suggestions.

REFERENCES

1

et al.

Gene therapy of human severe combined immunodeficiency (SCID)-X1 disease

,

Science

,

2000

, vol.

288

(pg.

669

-

672

)

2

et al.

Sustained correction of X-linked severe combined immunodeficiency by ex vivo gene therapy

,

N. Engl. J. Med.

,

2002

, vol.

346

(pg.

1185

-

1193

)

3

et al.

Insertional oncogenesis in 4 patients after retrovirus-mediated gene therapy of SCID-X1

,

J. Clin. Invest.

,

2008

, vol.

118

(pg.

3132

-

3142

)

4

et al.

Correction of ADA-SCID by stem cell gene therapy combined with nonmyeloablative conditioning

,

Science

,

2002

, vol.

296

(pg.

2410

-

2413

)

5

et al.

Correction of X-linked chronic granulomatous disease by gene therapy, augmented by insertional activation of MDS1-EVI1, PRDM16 or SETBP1

,

Nat. Med.

,

2006

, vol.

12

(pg.

401

-

409

)

6

et al.

Vector integration is nonrandom and clustered and influences the fate of lymphopoiesis in SCID-X1 gene therapy

,

J. Clin. Invest.

,

2007

, vol.

117

(pg.

2225

-

2232

)

7

Dynamics of gene-modified progenitor cells analyzed by tracking retroviral integration sites in a human SCID-X1 gene therapy trial

,

Blood

,

2010

, vol.

115

(pg.

4356

-

4366

)

8

et al.

Transfusion independence and HMGA2 activation after gene therapy of human beta-thalassaemia

,

Nature

,

2010

, vol.

467

(pg.

318

-

322

)

9

et al.

A serious adverse event after successful gene therapy for X-linked severe combined immunodeficiency

,

N. Engl. J. Med.

,

2003

, vol.

348

(pg.

255

-

256

)

10

et al.

LMO2-associated clonal T cell proliferation in two patients after gene therapy for SCID-X1

,

Science

,

2003

, vol.

302

(pg.

415

-

419

)

11

,

Mobile DNA II

,

2002

Washington, DC, USA

ASM Press

12

,

Lateral DNA Transfer: Mechanisms and Consequences

,

2001

Cold Spring Harbor, NY, USA

Cold Spring Harbor Laboratory Press

13

HIV-1 integration in the human genome favors active genes and local hotspots

,

Cell

,

2002

, vol.

110

(pg.

521

-

529

)

14

Transcription start regions in the human genome are favored targets for MLV integration

,

Science

,

2003

, vol.

300

(pg.

1749

-

1751

)

15

et al.

DNA bar coding and pyrosequencing to analyze adverse events in therapeutic gene transfer

,

Nucleic Acids Res.

,

2008

, vol.

36

pg.

e49

16

et al.

Comprehensive genomic access to vector integration in clinical gene therapy

,

Nat. Med.

,

2009

, vol.

15

(pg.

1431

-

1436

)

17

et al.

Simultaneous assay of every salmonella typhi gene using one million transposon mutants

,

Genome Res.

,

2009

, vol.

19

(pg.

2308

-

2316

)

18

Use of illumina sequencing to identify transposon insertions underlying mutant phenotypes in high-copy mutator lines of maize

,

Plant J.

,

2010

, vol.

63

(pg.

167

-

177

)

19

Gene transfer by lentiviral vectors is limited by nuclear translocation and rescued by HIV-1 pol sequences

,

Nat. Genet.

,

2000

, vol.

25

(pg.

217

-

222

)

20

In vivo gene delivery and stable transduction of nondividing cells by a lentiviral vector

,

Science

,

1996

, vol.

272

(pg.

263

-

267

)

21

et al.

Genomic safe harbors permit high beta-globin transgene expression in thalassemia induced pluripotent stem cells

,

Nat. Biotechnol.

,

2011

, vol.

29

(pg.

73

-

78

)

22

Methods for integration site distribution analyses in animal cell genomes

,

Methods

,

2009

, vol.

47

(pg.

261

-

268

)

23

Selection of target sites for mobile DNA integration in the human genome

,

PLoS Comput. Biol.

,

2006

, vol.

2

pg.

e157

24

Integration target site selection by a resurrected human endogenous retrovirus

,

Genes Dev.

,

2009

, vol.

23

(pg.

633

-

642

)

25

A role for LEDGF/p75 in targeting HIV DNA integration

,

Nat. Med.

,

2005

, vol.

11

(pg.

1287

-

1289

)

26

WebLogo: A sequence logo generator

,

Genome Res.

,

2004

, vol.

14

(pg.

1188

-

1190

)

27

Polynucleotidyl transfer reactions in transpositional DNA recombination

,

J. Biol. Chem.

,

1992

, vol.

287

(pg.

21273

-

21276

)

28

Studies on a “jumping gene machine”: Higher-order nucleoprotein complexes in mu DNA transposition

,

Biochem. Cell Biol.

,

1999

, vol.

77

(pg.

487

-

491

)

29

et al.

Bacteriophage mu integration in yeast and mammalian genomes

,

Nucleic Acids Res.

,

2008

, vol.

36

pg.

e148

30

The phage mu transpososome core: DNA requirements for assembly and function

,

EMBO J.

,

1995

, vol.

14

(pg.

4893

-

4903

)

31

An efficient and accurate integration of mini-mu transposons in vitro: A general methodology for functional genetic analysis and molecular biology applications

,

Nucleic Acids Res.

,

1999

, vol.

27

(pg.

2777

-

2784

)

32

Application of mu in vitro transposition for high-precision mapping of protein-protein interfaces on a yeast two-hybrid platform

,

Methods

,

2009

, vol.

49

(pg.

255

-

262

)

33

A large nucleoprotein assembly at the ends of the viral DNA mediates retroviral DNA integration

,

EMBO J.

,

1997

, vol.

16

(pg.

7511

-

7520

)

34

DNA bar coding and pyrosequencing to identify rare HIV drug resistance mutations

,

Nucleic Acids Res.

,

2007

, vol.

35

pg.

e91

35

The use of coded PCR primers enables high-throughput sequencing of multiple homolog amplification products by 454 parallel sequencing

,

PLoS ONE

,

2007

, vol.

2

pg.

e197

36

Error-correcting barcoded primers for pyrosequencing hundreds of samples in multiplex

,

Nat. Methods

,

2008

, vol.

5

(pg.

235

-

237

)

37

DNA transposition of bacteriophage mu. A quantitative analysis of target site selection in vitro

,

J. Biol. Chem.

,

2002

, vol.

277

(pg.

2843

-

2851

)

38

et al.

Insertional mutagenesis combined with acquired somatic mutations causes leukemogenesis following gene therapy of SCID-X1 patients

,

J. Clin. Invest.

,

2008

, vol.

118

(pg.

3143

-

3150

)

39

et al.

Multilineage hematopoietic reconstitution without clonal selection in ADA-SCID patients treated with stem cell gene therapy

,

J. Clin. Invest.

,

2007

, vol.

117

(pg.

2233

-

2240

)

40

et al.

Distinct genomic integration of MLV and SIV vectors in primate hematopoietic stem and progenitor cells

,

PLoS Biol.

,

2004

, vol.

2

pg.

e423

41

High-resolution profiling of histone methylations in the human genome

,

Cell

,

2007

, vol.

129

(pg.

823

-

837

)

42

et al.

Combinatorial patterns of histone acetylations and methylations in the human genome

,

Nat. Genet.

,

2008

, vol.

40

(pg.

897

-

903

)

43

Dynamic regulation of nucleosome positioning in the human genome

,

Cell

,

2008

, vol.

132

(pg.

887

-

898

)

44

Genome-wide mapping of HATs and HDACs reveals distinct functions in active and inactive genes

,

Cell

,

2009

, vol.

138

(pg.

1019

-

1031

)

45

HIV integration site distributions in resting and activated CD4+ T cells infected in culture

,

AIDS

,

2009

, vol.

23

(pg.

1461

-

1471

)

46

et al.

Distribution of lentiviral vector integration sites in mice following therapeutic gene transfer to treat beta-thalassemia

,

Blood

,

2011

in press

© The Author(s) 2011. Published by Oxford University Press.

This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

Supplementary data

I agree to the terms and conditions. You must accept the terms and conditions.

Submit a comment

Name

Affiliations

Comment title

Comment

You have entered an invalid code

Thank you for submitting a comment on this article. Your comment will be reviewed and published at the journal's discretion. Please check for further notifications by email.

Citations

Views

Altmetric

Metrics

Total Views 1,743

1,193 Pageviews

550 PDF Downloads

Since 1/1/2017

Month: Total Views:
January 2017 4
February 2017 5
March 2017 1
April 2017 4
May 2017 6
June 2017 1
July 2017 3
August 2017 6
September 2017 21
October 2017 6
November 2017 3
December 2017 15
January 2018 9
February 2018 16
March 2018 15
April 2018 15
May 2018 14
June 2018 26
July 2018 21
August 2018 21
September 2018 19
October 2018 26
November 2018 24
December 2018 17
January 2019 21
February 2019 17
March 2019 28
April 2019 28
May 2019 28
June 2019 17
July 2019 21
August 2019 18
September 2019 14
October 2019 24
November 2019 23
December 2019 17
January 2020 20
February 2020 8
March 2020 20
April 2020 14
May 2020 12
June 2020 11
July 2020 29
August 2020 28
September 2020 14
October 2020 29
November 2020 22
December 2020 12
January 2021 12
February 2021 22
March 2021 17
April 2021 33
May 2021 40
June 2021 14
July 2021 10
August 2021 10
September 2021 15
October 2021 17
November 2021 17
December 2021 12
January 2022 22
February 2022 19
March 2022 27
April 2022 14
May 2022 19
June 2022 16
July 2022 34
August 2022 25
September 2022 40
October 2022 73
November 2022 24
December 2022 33
January 2023 19
February 2023 12
March 2023 15
April 2023 15
May 2023 15
June 2023 22
July 2023 18
August 2023 19
September 2023 12
October 2023 15
November 2023 8
December 2023 18
January 2024 19
February 2024 25
March 2024 28
April 2024 29
May 2024 28
June 2024 16
July 2024 28
August 2024 23
September 2024 13
October 2024 8

Citations

60 Web of Science

×

Email alerts

Citing articles via

More from Oxford Academic