Clinical significance of minimal residual disease in childhood acute lymphoblastic leukemia and its relationship to other prognostic factors: a Children's Oncology Group study (original) (raw)

Abstract

Minimal residual disease (MRD) is an important predictor of relapse in acute lymphoblastic leukemia (ALL), but its relationship to other prognostic variables has not been fully assessed. The Children's Oncology Group studied the prognostic impact of MRD measured by flow cytometry in the peripheral blood at day 8, and in end-induction (day 29) and end-consolidation marrows in 2143 children with precursor B-cell ALL (B-ALL). The presence of MRD in day-8 blood and day-29 marrow MRD was associated with shorter event-free survival (EFS) in all risk groups; even patients with 0.01% to 0.1% day-29 MRD had poor outcome compared with patients negative for MRD patients (59% ± 5% vs 88% ± 1% 5-year EFS). Presence of good prognostic markers TEL-AML1 or trisomies of chromosomes 4 and 10 still provided additional prognostic information, but not in National Cancer Insitute high-risk (NCI HR) patients who were MRD+. The few patients with detectable MRD at end of consolidation fared especially poorly, with only a 43% plus or minus 7% 5-year EFS. Day-29 marrow MRD was the most important prognostic variable in multi-variate analysis. The 12% of patients with all favorable risk factors, including NCI risk group, genetics, and absence of days 8 and 29 MRD, had a 97% plus or minus 1% 5-year EFS with nonintensive therapy. These studies are registered at www.clinicaltrials.gov as NCT00005585, NCT00005596, and NCT00005603.

Introduction

The presence of minimal residual disease (MRD) following therapy for acute lymphoblastic leukemia (ALL) has been shown to be an important prognostic marker in many studies.120 MRD is typically detected either by polymerase chain reaction (PCR) amplification of clonotypic immunoglobulin or T-cell receptor gene rearrangements2026 or by flow cytometry,2741 the latter based on the principle that leukemic cells express combinations of antigens that are different from those present on normal bone marrow cells. The former technique can be more sensitive, though to achieve adequate sensitivity it is necessary to synthesize optimized clone-specific reagents. As a consequence, it is difficult to obtain real-time data that could be used for early intervention.

Molecular detection of MRD has been well standardized.25,4244 Though less widely standardized,37,45 flow cytometry is faster, generally less expensive, and provides informative results in a higher percentage of patients than molecular methods. For these reasons, flow-based MRD assessment has the potential for rapidly identifying patients at increased risk of relapse, allowing for prompt changes in therapy, including earlier intensification.7 Both PCR and flow have successfully been used to help risk-stratify patients, and while there is generally concordance between the methods in direct comparisons,46,47 individual patients may not always be classified in the same way by each method.48

Although the prognostic significance of MRD in ALL is well established, and is used as a criterion for risk stratification in many current studies,49,50 most published studies have been relatively small. In childhood ALL in particular, the value of MRD must be weighed against other well-established prognostic variables, including age, white blood cell count, cytogenetic features of blasts, and conventional assessment of response to therapy.5057 Although MRD has been shown to retain prognostic significance after adjusting for some common risk factors,4,6,19 the relationship between MRD and other prognostic factors has been incompletely explored. It is not clear if MRD by itself is all that is needed to predict outcome, if other risk factors add additional information to that obtained by MRD, or whether there are complex interactions between MRD and other factors. For example, we previously showed a difference between the frequency of positive MRD results at end induction in patients with the 2 most common favorable genetic lesions: the TEL-AML1 translocation and simultaneous trisomies of chromosomes 4 and 10, which raised the question of whether MRD at end induction has the same significance in both groups.28

In 1999, the legacy Pediatric Oncology Group of the Children's Oncology Group began a prospective study of MRD in all patients enrolled on the classification/induction study P9900 (supplemental data available on the Blood website; see the Supplemental Materials link at the top of the online article). Patients enrolled on this study had MRD measured by flow cytometry at a single central reference laboratory in the peripheral blood (PB) at day 8, and in the bone marrow (BM) at the end of induction (day 29). Blasts from patients with precursor B-cell ALL (B-ALL) were analyzed by reverse transcriptase–PCR (RT-PCR) and fluorescence in situ hybridization (FISH) methods at a centralized reference laboratory to determine common cytogenetic abnormalities associated with prognosis. Based on results of these studies, and on other clinical and laboratory features, patients were assigned to one of 4 postinduction treatment protocols. For patients entering the low-, standard-, and high-risk protocols, MRD was again measured in the BM at the end of consolidation. This report describes the relationship of MRD to outcomes for these patients. (Patients entering the very-high-risk protocol are not reported on here.) Our results demonstrate that end-induction MRD is the single most powerful prognostic marker and that it retains validity in all clinical- and laboratory-defined risk groups. We also show that risk grouping is improved by taking into account MRD assessment performed earlier in therapy as well as other clinical features and genetic characteristics of the leukemia.

Methods

Patients

This study was approved by the institutional review board (IRB) of Johns Hopkins University and all participating institutions, and informed consent was obtained in accordance with the Declaration of Helsinki. All patients were enrolled on P9900 (the classification study), which required shipment of PB and BM samples to reference laboratories at the University of New Mexico and the Johns Hopkins Medical Institutions. Patients with precursor B-ALL, as confirmed by the Johns Hopkins laboratory, were eligible for postinduction treatment on therapeutic protocols based on risk-group assignment at the end of induction. From January 2000 through March 2005, 3686 children were enrolled on the study, among whom were 3303 patients with precursor B-ALL. Postinduction risk-stratification was based on NCI risk criteria, central nervous system (CNS) and testicular status, and on genetic features of their leukemia cells as previously described.28 Briefly, patients on P9904 included NCI standard-risk (SR) patients with a TEL-AML1 translocation or simultaneous trisomies of chromosomes 4 and 10, while patients on P9906 included NCI high-risk (HR) patients without favorable genetics who met age-, white blood cell count (WBC)–, and sex-specific criteria for especially high-risk disease originally described by Shuster,28,58 or any patient with rearrangement of the MLL gene, CNS3 status, or testicular involvement. Patients on P9905 included a mixture of NCI SR patients without favorable genetic lesions, NCI HR patients with favorable genetic changes, and other NCI HR patients not meeting the Shuster criteria. Patients younger than 1 year of age, those with Philadelphia chromosome–positive (Ph+) ALL or hypodiploid ALL, and those failing induction chemotherapy were not eligible for any of these studies, and their MRD results are not included here. All patients or their parents or guardians gave separate informed consent for the classification and treatment studies, including consent for measurement of MRD. MRD results were not made available to physicians caring for the patients, or used to adjust therapy. A total of 827 evaluable patients with precursor B-ALL were enrolled on P9904; 1049 were enrolled on P9905; and 267 were enrolled on P9906.

Samples were sent to the Johns Hopkins Reference Laboratory for MRD testing from PB at day 8 of induction, and from BM at the end of induction (day 29) and end of consolidation (weeks 22–30 depending upon the specific protocol). Of the 2143 patients enrolled on 9900 series treatment protocols, samples were submitted from 1946 patients at day 8, 2086 patients at day 29, and 1470 patients at end of consolidation. A day-8 BM was also performed and interpreted at local institutions to assess morphologic response to therapy.

Molecular and cytogenetic testing

Ficoll-Hypaque–purified BM samples were studied by RT-PCR for the presence of the common translocations E2A-PBX1, TEL-AML1, BCR-ABL, and MLL-AF4 as previously described.28 Other MLL gene rearrangements were detected by FISH studies using a break-apart probe strategy. Most patients had conventional cytogenetic studies by Children's Oncology Group (COG)–certified local laboratories with COG central review of karyotypes. In addition, aliquots were stained with propidium iodide and analyzed by flow cytometry to determine DNA index. All specimens with a DNA index greater than 1.0 had centromeric probe testing for trisomies 4 and 10 as previously described.28

Therapy

Drug doses and the schedule for drug administration are provided in Tables S1,S2. Induction therapy was determined by initial WBC count and age using NCI criteria and the presence or absence of CNS3 or testicular disease. Patients who were NCI SR by age and WBC count (age older than 1 year and less than 10 years; WBC < 50 000/μL) without CNS3 or overt testicular disease were assigned to a 3-drug, dexamethasone-based induction; all other patients were assigned to a prednisone-based 4-drug induction.

An induction death rate of 1.3% among the first 900 patients led to the closure and amendment of the 3-drug induction on November 27, 2002. Two changes were made prior to reopening. The intrathecal therapy on day 1 was changed from methotrexate (MTX) to cytarabine, and the 6 doses of native asparaginase at 10 000 IU/m2 each were replaced with a single dose of PEG asparaginase. Subsequently, there were 2 deaths (0.38%) from infection among 527 patients.

Postinduction therapy on protocols 9904 and 9905 included 6 courses of intravenous methotrexate with leucovorin rescue with randomized assignment to either 1 g/m2 as a 24-hour infusion or 2 g/m2 as a 4-hour infusion. Patients enrolled on 9904 with the TEL-AML1 translocation with or without trisomies of chromosomes 4 and 10 were also eligible for randomized assignment to receive or not receive a delayed intensification (DI). Patients enrolled on 9905 were eligible for both the MTX randomization and the DI randomization, except that patients with favorable genetics meeting the Shuster high-risk criteria and patients with E2A-PBX1 were assigned to DI and only eligible for the MTX randomization. Study 9906 was a single-regimen study using an “augmented BFM” regimen adapted from Nachman et al.59

Flow cytometry

MRD was detected by 4-color flow cytometry as previously described.28,41 In most cases, the 2-antibody combinations CD20-FITC/CD10-PE/CD45-PerCP/CD19-APC and CD9-FITC/CD34-PE/CD45-PerCP/CD19-PE were sufficient to identify leukemic cells. A minimum of 500 000 events were collected. For day-8 PB analyses, cell numbers frequently precluded acquiring such a number, although in all cases a minimum of 100 000 events were analyzed. When cell numbers were limiting, only the most informative single-antibody combination was used for analysis. Sensitivity of .01% was achieved in most cases. In 4% of cases, either cell numbers were limiting or the phenotype overlapped substantially with that of normal B cells so that sensitivity at the end of induction was limited to 0.1%; such cases were excluded from analyses that used an MRD threshold of .01%. MRD results were classified as indeterminate in 1.4% of cases, and no results were obtained because no sample was provided in an additional 2.7%, so that overall we obtained satisfactory results at the end of induction at a sensitivity of at least .01% in 92% of patients enrolled on therapeutic studies.

Statistical analyses

Event-survival estimates were obtained using the Kaplan-Meier method,60 and standard errors of the estimates were calculated by the method of Peto and Peto.61 Time to event was calculated as the time from study entry to first event (relapse, secondary malignancy, or death) or date of last contact. The log-rank test was used for comparison of survival curves between various groups. Multivariate analysis was conducted by using Cox proportional hazards regression.62 Categoric data were compared between groups by using the chi-square or Fisher exact test. All tests were conducted at a significance level of 5%. All analyses were carried out using SAS (SAS Institute, Cary, NC) and R (R Foundation for Statistical Computing, Vienna, Austria; http://www.R-project.org) software packages.

Results

End-induction MRD is a strong prognostic factor

Figure 1 shows the event-free survival (EFS) of 1971 patients enrolled on COG studies P9904, P9905, and P9906 for whom we have MRD results at the end of induction. In general, patients with higher levels of MRD do worse, with those having 0.1% to 1% MRD having a 49% plus or minus 6% 5-year EFS, and those with more than 1% MRD having only a 30% plus or minus 8% EFS (not shown). However, even those patients with MRD levels between 0.01% and 0.1% have a much worse outcome (5-year EFS, 59% ± 5%) than those with no MRD (EFS, 88% ± 1%), suggesting that .01% is an appropriate cutoff for identifying patients at increased risk of relapse. As shown in Table 1, the cutoff of .01% MRD was prognostic on each study separately, and in both NCI SR and HR patients in study P9905, which included both subsets of patients.

Figure 1.

Figure 1

EFS of all patients enrolled on 9900 series therapeutic studies with satisfactory end-induction MRD. The 5-year EFS values plus or minus SE are shown for patients with varying levels of MRD. The outcome of those with high levels of MRD is very poor, but even those with 0.01% to 0.1% MRD have only a 59% plus or minus 5% 5-year EFS.

Table 1.

Effect of end-induction MRD on outcome by therapeutic protocol and NCI risk group

Study/risk group 5-y EFS, % ± SE (no.)
MRD− MRD+ (> .01%)*
9904/SR 95 ± 1 (668) 64 ± 7 (105)
9905/SR 89 ± 2 (464) 59 ± 6 (119)
9905/HR 79 ± 4 (295) 33 ± 8 (74)
9906/HR 72 ± 5 (161) 34 ± 8 (85)

End-induction MRD appeared to predict both early (within 3 years) and late relapse. The relapse-free survival of patients with and without MRD, with all patients censored at 3 years, was used to assess the effect on early relapse, while a similar analysis limited to patients in remission at 3 years was used to assess late relapse (Figure 2). In this analysis (censoring all nonrelapse events), we found an early relapse rate of 6.8% among MRD− patients and 28% among MRD+ (> .01%) patients (P < .001). Similarly, the late relapse rate was 4.6% among MRD− compared with 24% in MRD+ patients (P < .001). However, because many patients have not been followed for 3 years, not all late relapses have occurred, so the magnitude of the effect of MRD on late relapse is uncertain.

Figure 2.

Figure 2

Relapse-free survival showing the effect of end-induction MRD on early and late relapse. (A) Early relapse. (B) Late relapse. MRD positivity is defined as greater than .01%. (A) All patients were censored at 3 years from diagnosis. In addition, all nonrelapse events occurring during the first 3 years were censored. (B) Only patients who were in remission at 3 years from diagnosis are included in the analysis and again, all nonrelapse events are censored.

Interaction of favorable genetic features and end-induction MRD

Figure 3 shows separately the outcome of NCI SR patients with favorable cytogenetic features (TEL-AML1 translocation or double trisomies [DT] of chromosomes 4 and 10). MRD is strongly prognostic in both subgroups. Although these patients fare very well overall with relatively nonintensive therapy, the proportion of MRD+ patients is much lower in the TEL-AML1+ group than in the DT group (9% vs 19%; P = .001). The frequency of MRD positivity in the DT patients was similar to that seen in all standard-risk patients, and although these patients fared better than other standard-risk patients, presence of MRD was still associated with a worse outcome.

Figure 3.

Figure 3

EFS of NCI SR patients with favorable genetic features. (A) TEL-AML1. (B) Double trisomies. The very few patients with both lesions are included in panel A as a function of end-induction MRD. Outcome of MRD+ patients in both groups is much worse than those who are MRD−. The 5-year EFS is indicated on each curve as appropriate.

Although MRD was an important prognostic marker in patients with and without favorable cytogenetic features, the prognostic significance of cytogenetics depended upon MRD status. Favorable cytogenetic features were associated with better prognosis among patients who were MRD−, with a 5-year EFS of 92% plus or minus 1% for those with favorable genetic features compared with 83% plus or minus 2% for those without (P < .001). Although TEL-AML1/DT patients who were MRD+ also had a statistically significant better 5-year EFS than those without favorable genetics (57% ± 6% vs 45% ± 4%; P < .02), their outcome was not that good. The difference in the effect of favorable genetics among MRD+ and MRD− patients was more readily appreciated when NCI SR and HR patients were looked at separately, and was particularly striking among NCI HR patients. HR MRD− TEL-AML1 or DT patients had an 83% plus or minus 5% 5-y EFS compared with a 72% plus or minus 4% EFS for MRD− patients without favorable genetics (P = .004). For HR patients who were MRD+, however, the 5-year EFS was 34% plus or minus 12% for patients with favorable genetics and 33% plus or minus 6% for those without (P = .96). For NCI SR patients, the corresponding figures are 94% plus or minus 1% versus 89% plus or minus 2% for MRD− patients (P = .002), and 64% plus or minus 7% versus 58% plus or minus 6% for MRD+ patients (P = .18).

Prognostic significance of day-8 PB MRD

The 88% 5-year EFS for patients who were negative in the BM at the end of induction, while better than that for those who were positive, does not define a group that needs no further treatment intensification. Moreover, 51% (178 of 348) of all treatment failures occurred in this group of patients. To try to identify patients with better outcome and/or fewer events, we examined MRD at an earlier time point, specifically in PB at day 8. Overall, 1323 (68.9%) of 1920 of patients with satisfactory day-8 PB samples were MRD+ at a level of greater than .01%. Figure 4 shows that the presence of day-8 PB MRD was associated with adverse prognosis, and that increasing levels were associated with a progressively poorer outcome. The 5-year EFS of day-8 PB MRD− patients was 90% plus or minus 2% and only 16% of treatment failures were seen in the 31% of patients who were day-8 PB MRD−. The 5-year EFS for MRD+ patients was 86% plus or minus 3% for those between .01% and 0.1% MRD; 79% plus or minus 3% for those between 0.1% and 1%; 67% plus or minus 4% for those between 1% and 10%; and 54% plus or minus 7% for those greater than 10%.

Figure 4.

Figure 4

EFS of all patients enrolled in therapeutic studies as a function of level of day-8 PB MRD. There is a stepwise decrement in 5-year EFS at each 10-fold increase in MRD level.

As shown in Figure 5, patients who had high levels of day-8 PB MRD had a relatively poor outcome even if they cleared their BM of MRD by day 29. Day-29 MRD− patients with more than 1% day-8 MRD had a 5-year EFS of 79% plus or minus 4% compared with 90% plus or minus 1% for those with day-8 PB MRD of 1% or less. This effect of slow early clearance was most striking among NCI HR patients, where those with more than 1% day-8 MRD had only a 63% plus or minus 8% 5-year EFS even if they became MRD− by day 29 (vs 81% ± 3%; P = .006)

Figure 5.

Figure 5

Prognostic significance of day-8 blood MRD in patients who are free of MRD in bone marrow by day 29. Patients with high levels (defined as > 1%) MRD at day 8 fare worse (5-year EFS of 79% ± 4%) than those with lower levels (90% ± 1%), even if they become MRD− by day 29. This difference was especially important in NCI HR patients (see “Results”).

Multivariate analysis of prognostic factors affecting outcome

Multivariate Cox regression analysis (with main effects only) was used to determine the relative importance of different prognostic factors that were available at diagnosis, or within the first month of therapy. As shown in Table 2, end-induction MRD was the most powerful prognostic factor, followed by NCI risk group, presence of favorable trisomies, and day-8 PB MRD. After adjusting for these variables, TEL-AML1 translocation did not achieve statistical significance (P > .1). Of note, morphologic assessment of early response, as measured by achieving an M1 BM on day 8, was also not significant.

Table 2.

Cox multivariate analysis

Variable Hazard ratio P
Day-29 MRD > .01% 4.31 < .001
NCI risk group 2.25 < .001
Trisomies 4 and 10 .570 < .001
Day-8 MRD (PB) > .01% 1.51 .018
TEL-AML1 .778 .151
Day-8 M1 marrow 1.034 .789

Identification of a subgroup of patients with excellent outcome on minimal therapy

Because several prognostic factors were significant in a multivariate model, we looked for ways to combine these factors to identify patients with an exceptionally good outcome. We found that we could identify patients with a 97% plus or minus 1% 5-year EFS achieved with limited therapy. These patients, encompassing approximately 12% of all patients enrolled in these trials, were a subset of those enrolled on the P9904 low-risk trial—defined by meeting NCI SR criteria, without CNS3 or testicular disease, having either DT or _TEL-AML1_—and an absence of MRD in both the day-8 PB and day-29 BM samples. As shown in Figure 6, no other combination of features identified as favorable a prognostic group. For example, NCI SR patients negative for MRD at both day 8 and day 29, but lacking favorable genetic features, had a 92% plus or minus 3% 5-year EFS (P = .020 vs the best group), whereas the NCI SR patients with favorable genetics and absence of day-29 MRD, but with positive day-8 PB MRD, had a 93% plus or minus 2% 5-year EFS (P = .024).

Figure 6.

Figure 6

EFS among variably defined groups of good-prognosis patients. NCI SR patients with favorable genetic features who were MRD− at both day 8 and day 29 were the best group, with a 97% plus or minus 1% 5-year EFS. They had statistically better outcomes than either patients without the genetic features who had the same MRD characteristics (92% ± 3%; P = .020) or end-induction MRD−, favorable genetic patients who were day-8 MRD+ (93% ± 2%; P = .024).

Prognostic significance of end-consolidation MRD

MRD was measured after consolidation therapy, varying from 22 to 30 weeks after the start of induction therapy depending upon the treatment protocol. (In the case of 9906, the measurement was at the end of interim maintenance therapy at week 22, but for convenience these will all be referred to as end-consolidation MRD.) Although relatively few patients (4.8%) were MRD+ greater than .01% at this time point, presence of MRD was associated with a very poor outcome, with a 5-year EFS of 43% plus or minus 7% (Figure 7). End-consolidation MRD was prognostic across studies, with those who were positive having a 5-year EFS of 56% plus or minus 14% on 9904 (n = 14); 46% plus or minus 11% on 9905 (n = 29); and 25% plus or minus 15% on 9906 (n = 15). End-consolidation MRD was also prognostic in multivariate analysis, with a hazard ratio of 2.25 (P < .001), entering the model in a stepwise regression after day-29 MRD and NCI risk group (not shown).

Figure 7.

Figure 7

EFS of all patients as a function of level of end-consolidation MRD. Patients who were MRD+ (> .01%) had a significantly inferior outcome, with a 5-year EFS of 43% plus or minus 7%. This effect was seen on each of the 3 therapeutic studies (see text). Prognostic significance of end-consolidation MRD in all patients enrolled on therapeutic studies.

Discussion

The presence of MRD has been shown to be a very strong predictor of relapse in both children and adults with ALL.120 This has been true across studies using different analytic methods and treatment strategies. In almost all studies, MRD has been shown to be prognostic at essentially every time point studied, though the most useful measurements appear to be relatively early in therapy, during or after induction and early in consolidation.2,4,69,15,16,19 However, many additional variables are associated with outcome, especially in childhood ALL.50,57 Although some studies have been able to demonstrate that MRD is prognostically important even after adjusting for other risk factors,4,6,19 or in patients of a defined risk group,1,20,63,64 most studies have been too small to be able to investigate fully the relationship between MRD and other prognostic factors in outcome prediction.

We evaluated the prognostic significance of MRD determined via flow cytometry in nearly 2000 children with ALL entered on the COG classification study P9900, in which blast cell cytogenetics were determined uniformly for purposes of risk assignment. This sample size allowed us to determine the prognostic significance of MRD in the context of therapies of varied intensity, and also to explore the interaction of MRD with other known risk factors. These protocols had randomizations to different treatments, but data are presented here without respect to the specific therapeutic regimens. However, there was no difference in the proportion of MRD+ and MRD− patients assigned to each of the treatment arms, so it is unlikely that any of our conclusions regarding MRD and outcome are due to differences in postinduction therapy.

Importantly, the methodology we used is robust, as informative day-29 MRD data with a sensitivity of at least 0.01% were generated for 92% of patients overall, and 94% of those for whom specimens were submitted to the central reference laboratory. Results were available within 24 hours of sample receipt. In the current generation of COG ALL trials, we use the end-induction MRD value to alter the intensity of postinduction therapy, and data are available for real-time clinical interventions in more than 95% of more than 5000 patients enrolled to date.

MRD was a strong predictor of outcome in all analyses. In general, higher levels of MRD were associated with increasingly poor outcome, but even those patients with as little as 0.01% to 0.1% had worse EFS, suggesting that .01% is the most appropriate cutoff to identify higher-risk patients for potential intervention. Whereas patients at increasingly poor risk by conventional characteristics had a progressively worse outcome regardless of whether they were MRD+ or not (Table 1), those who had poor clinical and laboratory risk factors in addition to MRD had the worst outcome. In particular, NCI HR patients who were day-29 MRD+ had a 5-year EFS of less than 35% regardless of whether they were treated on P9905 or P9906.

To our knowledge, this is the first study to demonstrate the provocative result that end-induction MRD appears to predict late as well as early relapse. Although the mechanism of early and late relapse appears to be different,65 in both cases we detect resistant tumor cells early in therapy. There were proportionately approximately 4 times as many early relapses and 5 times as many late relapses in the MRD+ group than in the MRD− group. Although the magnitude of the latter effect may not be accurately estimated (because not all late relapses have yet occurred), it is essentially impossible that enough additional relapses could occur to affect this conclusion.

Day-29 MRD was the most important prognostic factor in a Cox stepwise linear regression analysis. MRD was prognostic across all risk groups, including those with favorable genetic features. The higher rate of MRD in patients with favorable trisomies compared with those with TEL-AML1 translocations had led us previously to suggest that the prognostic significance of early MRD might not be seen in the former because of intrinsically slower clearance of the disease.28 However, this is not true; MRD was still an important risk factor in these patients. MRD did not, however, fully account for outcome. When adjusted for MRD, blast genetic features were still a significant prognostic factor, though the relationships were complex. In patients who were MRD−, favorable genetic features were associated with better prognosis, both among NCI SR and NCI HR patients. However, particularly in HR patients, favorable cytogenetic features were not sufficient to overcome the adverse prognosis of MRD.

In our series, about half of events occurred among patients who were MRD− at the end of induction. This contrasts with the results of many studies that use molecular methods, which find very few failures among patients who are MRD− at end induction.1,11,15,19,20 This may in part reflect the greater sensitivity of molecular MRD testing. Other studies that have used flow methods have seen a relatively higher proportion of events in patients who were MRD− at the end of induction.6,9 It is also possible that we missed some MRD+ patients, because our method only detects viable cells and small numbers of residual tumor cells may not survive shipping. However, it should also be noted that the intensity of therapy given to many of our patients was lower than in most other series, and the prognostic importance of MRD has been shown to depend on therapeutic regimen.66 It is thus possible that with different therapy, our MRD− cohort would have a better outcome.

While detection of MRD at the end of induction helps to identify patients at high risk of relapse, it did not do as well at identifying patients at low risk. To address this, we turned to an earlier time, measuring MRD in the PB at day 8. While PB MRD has been shown not to be equivalent to BM MRD in precursor B-ALL,67,68 the prognostic significance of PB MRD itself has not been established. We found that, as with day-29 BM, the presence of MRD in PB at day 8 is associated with adverse outcome, with an increasingly bad outcome seen with progressively higher levels. Most important, only 16% of events were seen among patients who were day-8 MRD−, indicating that this is an even better prognostic group than those who were MRD− in the BM at day 29. However, in order to identify the best group of patients, it was necessary to combine MRD response with other patient characteristics. The patients with the best outcome were NCI SR patients who were MRD− at both day 8 and day 29, and whose blasts had favorable cytogenetic features. Only 7 patients of 261 have had treatment failures (only 5 of which were relapses); this accounts for only 2% of the total failures in the series. This outstanding outcome was attained with relatively limited antimetabolite-based chemotherapy. All SR DT patients and half of the SR TEL-AML1+ patients received this therapy, which included no DI phase, no anthracyclines, and no alkylating agents. The remaining one-fourth of patients (half the TEL-AML1+ group) received a single DI phase. Thus, MRD can identify a cohort of 10% to 15% of patients with precursor B-ALL for whom the least toxic therapy may be most appropriate. These patients will not benefit from more intensive therapy, and can avoid the risks associated with treatment intensification.

The value of looking at clearance of leukemic blasts earlier in therapy has been recognized by others as well. Coustan-Smith et al7 demonstrated a 6% cumulative relapse rate among 51 patients with less than .01% blasts by flow cytometry at day 19 of therapy. In another study using molecular methods of detection, there were no failures seen in 14 patients who had fewer than 10−4 blasts by day 15.16 Day-15 MRD has also been suggested as a means to identify patients at especially high risk,8 and indeed our data also suggest that by using flow cytometry with a high cutoff for positivity, it is possible to identify a poor-risk group of patients as early as day 8. The 5-year EFS of patients with more than 10% leukemic cells in the PB at day 8 was only 54%, and those with more than 1% MRD who cleared their BMs by day 29 had only a 79% 5-year EFS (and only 63% among NCI HR patients).

The patients with the poorest outcome were those who had persistent MRD at the end of consolidation therapy. This time point varied from 22 to 30 weeks depending upon the specific protocol. Not surprisingly, the highest-risk patients treated on 9906 had the worst outcome, with a 25% 5-year EFS. However, it is more surprising that end-consolidation MRD+ patients did as well as they did; in spite of the fact that these patients only got maintenance chemotherapy from then on, the overall 5-year EFS was more than 40%. Most studies that looked at a second postinduction time point have looked earlier in therapy than we have, so direct comparison with those results is difficult. In general, patients with persistent MRD long into therapy have a very poor outcome6,19,69; in most series, however, relatively small numbers of patients continue to be positive this late. In addition, the ability to rescue such patients has been called into question,69 so the practical value of identifying such patients is limited.

It is interesting to contemplate how our results might be used to refine risk grouping for treatment purposes. One of the advantages of flow cytometry over sensitive molecular techniques for detecting MRD is that results are potentially available very early in therapy, and changes can be made shortly after, or conceivably even during induction. Early intensification of therapy has been shown to help patients with slow response to therapy assessed by BM morphology.70 In current COG studies, patients with more than 0.1% end-induction MRD are assigned to augmented therapy during the immediate postinduction period; this threshold was chosen before the outcome results described here were available and was deliberately designed to be conservative, based on published data that most closely reflected the methods and therapy currently used by COG.9 Although these patients do in fact have poor outcome, results of the current study suggest that more intensive intervention may be desirable for all patients with MRD greater than .01% at the end of induction. Whether this intervention should be the same regardless of other risk factors is not yet clear, especially because the results presented here were obtained with many patients receiving less-intensive therapy than is used at the current time. As noted, absence of day-8 PB MRD, combined with other favorable factors, can be used to identify a group of patients in whom to consider more limited or even de-escalation of therapy. Higher levels of day-8 MRD could also be used to identify additional patients for intensification, possibly even during induction. Most significantly, however, by combining day-8 PB and day-29 BM MRD, there is no longer any prognostic significance to morphologic assessment of BM at day 8, indicating this conventional measure of early response55 may not be necessary in the future.

In summary, we have demonstrated that MRD after induction therapy is the most important prognostic factor for outcome in children with ALL. Measurement of MRD in the PB at day 8 provides additional useful information, especially to help identify patients at low risk of relapse, and with these 2 measurements of response, day-8 BMs are no longer necessary to assess response. However, additional clinical and laboratory features, especially the presence of favorable cytogenetic abnormalities among patients who are MRD−, do provide additional prognostic information; by combining all favorable good risk features, it is possible to identify a group of patients that accounts for about 12% of all children with precursor B-ALL who are almost certain to be cured with limited therapy that does not include any anthracyclines or alkylating agents.

Supplementary Material

[Supplemental Tables]

Acknowledgments

The authors are grateful to Karen Bowles and Shirley Fuller for excellent technical assistance with the MRD studies, and to the physicians and clinical research associates at the COG institutions participating in this study for cooperation with the sample submission requirements.

This work was supported by grants R01 CA86011 and U10 CA98543 from the National Cancer Institute.

Footnotes

The online version of this article contains a data supplement.

Presented in part at the 2006 American Society of Hematology Meeting, Orlando, FL.

The publication costs of this article were defrayed in part by page charge payment. Therefore, and solely to indicate this fact, this article is hereby marked “advertisement” in accordance with 18 USC section 1734.

Authorship

Contribution: M.J.B. and B.M.C. designed research, collected, analyzed, and interpreted data, and drafted the manuscript; M.D. analyzed and interpreted data and performed statistical analysis; S.P.H. analyzed and interpreted data and drafted the manuscript; W.P.B., P.L.M., and N.W. collected, analyzed, and interpreted data; A.J.C., D.V., and C.L.W. collected data; W.L.C. analyzed and interpreted data; S.L. performed statistical analysis; D.J.P. designed research and collected, analyzed, and interpreted data.

Conflict-of-interest disclosure: The authors declare no competing financial interests.

Correspondence: Michael J. Borowitz, Johns Hopkins Medical Institutions, Department of Pathology, Weinberg 2335, 401 N Broadway, Baltimore, MD 21231; e-mail: mborowit@jhmi.edu.

References

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

[Supplemental Tables]