1,671
Views
7
CrossRef citations to date
0
Altmetric
Original Articles: Clinical

[18F] Positron emission tomography response after rituximab-containing induction therapy in follicular lymphoma is an independent predictor of survival after adjustment for FLIPI in academic and community-based practice

, , , , , , & show all
Pages 809-815 | Received 01 Feb 2016, Accepted 07 Jul 2016, Published online: 12 Aug 2016

Abstract

Positron emission tomography (PET) after induction therapy in follicular lymphoma (FL) is predictive of survival in clinical trials. We describe use of PET and computed tomography (CT) after rituximab-based induction therapy in FL patients followed by the National LymphoCare Study and explore the association between imaging response assessment and survival. Among 1289 patients, imaging consisted of: PET ± CT (35%), CT alone (42%), other/no imaging (24%). Median follow-up was 7.6 years. In unadjusted analyses, positive PET ± CT and CT were prognostic of inferior OS (HR 1.78; 95% CI: 1.16–2.72 and HR 1.61, 95% CI: 1.13–2.29, respectively) and PFS (HR 1.63, 95% CI: 1.21–2.20 and HR 1.45, 95% CI: 1.12–1.89, respectively). Adjusting for FL International Prognostic Index, PET remained predictive of OS (HR 1.54, 95% CI: 1.01–2.36) and PFS (HR 1.54, 95% CI: 1.14–2.07). Residual disease via PET in FL is prognostic of survival in clinical practice.

Introduction

Follicular lymphoma (FL) is a common, indolent lymphoma with a wide range of clinical behaviors.[Citation1,Citation2] General characteristics of FL include multiple relapses and risk of transformation to aggressive high-grade lymphoma.[Citation2] FL patients often undergo imaging studies during watchful waiting, treatment, and after treatment conclusion.

The high sensitivity of [18F] fluorodeoxyglucose (FDG) positron emission tomography (PET) in FL is well recognized,[Citation3,Citation4] and residual FDG-avidity at the end of front-line chemoimmunotherapy for FL may portend poorer progression-free survival (PFS),[Citation3–8] and most recently, overall survival (OS) [Citation9,Citation10] in patients on clinical trials. Utilizing data from patients on clinical trials, retrospective studies comparing PET to computed tomography (CT) scans have also shown that PET-assessed response may be better in predicting PFS and OS than conventional imaging response by CT scans alone.[Citation6,Citation9] Recent Lugano Classification formally included PET in assessments of FDG-avid non-Hodgkin lymphoma (NHL) such as FL.[Citation11] The Lugano Classification proposed using PET response for end-of-treatment assessment, reserving CT-based response in low or variable FDG-avid lymphomas.[Citation12] By providing standardization for PET imaging response, the revised Lugano Classification aimed to provide recommendations that could lead to improved therapies for NHL.

The prospective, observational National LymphoCare Study (NLCS) database, in which more than 2700 newly diagnosed FL patients from over 200 practice sites in the USA were monitored for presentation, prognosis, treatment, and clinical outcomes from 2004 to 2007, is a distinctive opportunity to evaluate the use of imaging in FL in clinical practice. This study first describes imaging patterns and clinical use of PET and CT after first-line rituximab (R)-based induction therapy, and then examines and compares the prognostic impact of end-of-induction (EOI) imaging choice and response assessments on PFS and OS in clinical practice.

Patients and methods

Study design, patients, and assessment imaging

This retrospective cohort study was determined by an analysis of prospectively collected data from the NLCS. Full details of the NLCS study design have been published elsewhere.[Citation13] The NLCS is an observational study that includes 2740 newly diagnosed FL patients enrolled between March 2004 and March 2007 at participating sites. There was no central pathology review; the local pathology report defined FL diagnosis after investigator education on World Health Organization definitions of FL. Given that this was an observational study, there was no central imaging review; stage and response were determined and reported by the treating physician(s). Since the data were collected before the Lugano Classification (Deauville criteria), PET response was reported as complete response (CR), partial response (PR), stable disease (SD), and progressive disease (PD). CT response was reported as CR, CR unconfirmed (CRu), PR, SD, and PD. Treatment and outcomes (including response, time to progression, and survival) were collected quarterly. Follow-up data were actively solicited from providers at the time of clinical follow-up. Enrolled patients were followed for 10 years from enrollment or until death, withdrawal of consent, or loss to follow-up. Staging procedures (including bone marrow biopsy, diagnostic CT, and/or PET with or without CT scans), were recorded in the database. For this study, we included patients in the NLCS who completed R-based induction therapy and who met all of the following criteria: evaluable; Stage II–IV; [Citation14] FL without large cell histology at diagnosis; received initial therapy (as assigned by the physician) with R-monotherapy or R-chemotherapy, including R-CHOP (rituximab plus cyclophosphamide, doxorubicin, vincristine, and prednisone), R-CVP (rituximab plus cyclophosphamide, vincristine, and prednisone), rituximab plus fludarabine, and R-other chemotherapy; completed planned (not terminated early, as reported by clinical sites) therapy; reported treatment end date (last dose date). Patients were checked for EOI assessments, specified in this study as imaging performed between two cycles before and 12 weeks after completion of first-line treatment. Assessments included PET, CT, other response evaluations (physical exam, lab exam, bone marrow biopsy, and/or imaging other than PET or CT), and no evaluation. All imaging was done at the physician’s discretion. PET and CT scans were done per protocol at individual institutions. Each patient was classified into one of five groups determined by the method used to make assessments at completion of the first-line treatment: PET alone, CT alone, PET with CT, other image evaluation, and no image evaluation. Given imaging was done at the provider’s discretion, and multiple images could be ordered and reported, patients were classified as PET with CT if the PET and CT scans were done on the same day, or within 28 days of each other; responses were classified as the best response via PET during the assessment period. Similarly, for PET (or CT) alone, responses were classified as the best response via PET (or CT) during the assessment period. To examine EOI PET as a predictor of outcome, patients who received a PET with CT or PET alone were combined for analysis and labeled as ‘PET ± CT’. For the purposes of this article, a ‘negative PET (or CT)’ result indicates the assessed response was CR/CRu (if CT) and a ‘positive PET (or CT)’ result indicates the assessed response was PR, SD, and PD.

NLCS data management and analysis are guided by an advisory board comprising academic investigators, some of whom co-authored this article (C.R.F., J.W.F., and A.D.Z.), and a patient advocate. The advisory board participated in all study phases, including initial protocol design, prospective determination of data for collection, and consideration of participating sites. The advisory board collaborated with the primary author (I.W-S.) and sponsor regarding data interpretation and publication. This article was written de novo by the primary author (I.W-S.) and members of the advisory board following approval of a protocol with prespecified endpoints, hypotheses, and plans for analysis.

Statistics

Baseline patient demographics and disease features, center, type, geographic region, therapy regimen, and duration were summarized by imaging group. Between imaging groups, the overall and pairwise comparisons were examined using the Pearson χ2 test. Logistic regression was then used to identify factors related to imaging choice, with covariates included through a backward selection (p > 0.10).

Descriptive statistics of OS and PFS (calculated from the date of EOI response) were estimated using Kaplan–Meier methods. Cox regression models including positive–negative response status, assessment method, and their two-way interaction were used to calculate hazard ratios (HRs) and 95% confidence intervals (CIs) with and without propensity score (PS) matching; additional models also adjusted for the Follicular Lymphoma International Prognostic Index (FLIPI) risk groups. To adjust for potential baseline characteristic imbalances, patients were scored and then matched based on their propensity of being assigned to an imaging modality. Cox proportional hazards models with PS matching were used to compare the impact of PET ± CT response with CT response on OS/PFS. All available variables potentially related to outcome or imaging selection (i.e. age group, sex, race, histology grade, stage, lactate dehydrogenase, hemoglobin, nodal sites, extra-nodal sites, Eastern Cooperative Oncology Group performance status, B symptoms, bone marrow involvement, geographic region, center type, and induction treatment) were included in the calculation of the PS.

Results

Characteristics of patients and EOI assessment choice

A total of 1289 Stage II–IV subjects completed induction R-monotherapy or R-chemotherapy. Among those who completed induction therapy, 447 (35%) had PET ± CT, 537 (42%) had CT alone, 10 (1%) had imaging other than PET or CT, and 295 (23%) had no EOI imaging (Supplementary Figure S1).

Factors associated with EOI assessment choice

EOI assessment by either PET ± CT was performed on average 183 days after diagnosis, and 25 days after the end of therapy. CT alone was obtained on average 169 days after diagnosis, and 26 days after the end of therapy. Imaging via PET or CT scans was more common in patients: aged less than 70 years; with at least five nodal sites; at academic sites; who had received R-chemotherapy; or who had therapy for at least 85 days; it was less common in the West (p < 0.01, Supplementary Table S1). Nearly half of the patients who did not receive imaging via PET or CT were treated for less than 85 days. In a generalized logits model evaluating the association of baseline factors and the choice of assessment method (PET alone, CT alone, PET and CT, or no imaging), region, center type, duration, choice of induction therapy, and histologic grade were associated with whether imaging was performed and which type of imaging was obtained (p < 0.10).

The baseline characteristics of evaluable patients who received imaging via PET ± CT and CT alone and the percentages of patients receiving PET ± CT compared with CT alone are presented in . From the logistic regression analysis of those who received PET or CT imaging, Grade 3 histology and R-CHOP induction were associated with greater likelihood of receiving PET ± CT vs. CT alone, while the Midwest region was less likely to receive PET (p < 0.05). In view of the marked differences among patients who received a PET ± CT or CT alone, PS matching was used to remove potential bias in the comparison of imaging methodologies. A total of 361 pairs were matched using PS for comparative evaluation of OS and PFS. provides the demographics and clinical factors for the PS-matched population.

Table 1. Demographics and characteristics of evaluable and propensity score-matched patients.

EOI response

Of 447 patients who received PET ± CT scans performed at EOI, 292 (65%) were reported as negative and 155 as positive. Of 537 responses evaluated at EOI by CT scans alone, 211 (39%) were reported as negative and 326 as positive. Responses for the PS-matched population were similar to the non-matched population (Supplementary Table S2).

Effect of imaging results on survival

With a median follow-up of 7.6 years, the 5-year OS and PFS for PET-negative patients were 88% and 65%, respectively. PET-positive patients had a 5-year OS of 78% and PFS of 51%. The OS and PFS at 5 years for CT-negative patients were 87% and 64%, respectively. CT-positive patients had a 5-year OS of 78% and a PFS of 54% (. Five-year OS and PFS within the PS-matched population were comparable to those in the non-matched population (). Similar to the 5-year survival, PET-negativity also predicted 2-year PFS, 83% vs. 68% in the PET-positive patients (p value 0.002). Among all evaluable patients, positive EOI PET ± CT and positive EOI CT were both associated with inferior OS (HR 1.78, 95% CI: 1.16–2.72 and HR 1.61, 95% CI: 1.13–2.29, respectively) and PFS (HR 1.63, 95% CI: 1.21–2.20 and HR 1.45, 95% CI: 1.12–1.89, respectively) in unadjusted analysis (). After adjustment for FLIPI scores, PET response remained associated with inferior OS (HR 1.54, 95% CI: 1.01–2.36) and PFS (HR 1.54, 95% CI: 1.14–2.07). However, after adjustment for FLIPI, CT responses were only associated with inferior PFS (HR 1.34, 95% CI: 1.03–1.75), but not OS (HR 1.38, 95% CI: 0.96–1.97; ).

Table 2. Five-year survival for the whole cohort and propensity score-matched cohort.

Table 3. Hazard ratios for survival via PET and CT response.

In the PS-matched subset, there was not a statistically significant improvement with PET vs. CT in the association with OS or PFS (imaging assessment by response interaction p values 0.77 and 0.65, respectively; ). Kaplan–Meier curves comparing the two imaging assessments are presented in Supplementary Figure S2.

Discussion

The NLCS registered more than 2700 patients with FL, and then prospectively followed the patients for more than a decade. This is the largest observational cohort in FL, and it allows for a unique chance to explore practice in a non-clinical trial setting, and explore whether practice in the “real world” leads to similar outcomes to those in clinical trials. This large, multicenter observational study supports PET response as a strong predictor of OS and PFS after R-induction therapy in FL. Importantly, PET response provided additional prognostic information beyond the FLIPI score, the primary clinical tool employed in prognostication of FL.

Our results confirm those from previous studies demonstrating the ability of PET to predict survival after R-induction therapy for FL. In the one prospective study assessing the prognostic value of PET performed at the end of treatment, 121 patients with FL treated with first-line R-CHOP and two cycles of R, 2-year OS rate was 100% for PET-negative vs. 88% for PET-positive patients (p = 0.0128), and 2-year PFS rate was 87% for PET-negative vs. 51% for PET-positive patients (p < 0.001).[Citation9] Our larger study demonstrated even longer-term impact of post-induction PET-response status on survival: a negative PET portended a better 5-year OS (87% vs. 78%) and PFS (65% vs. 51%) compared with a positive PET. In the study by Trotman et al. which had a median follow-up of 54.6 months, the HR for OS for patients with a positive PET scan vs. a negative scan was 6.7 (2.4–18.5); this predictive power was independent of FLIPI and FLIPI-2, and stronger than conventional CT.[Citation10] With a follow-up of 7.6 years in NLCS, our study showed that similarly, in clinical practice, PET response is associated with OS after accounting for FLIPI.

The PRIMA and GOELAMS trials both underwent secondary analyses where PET response remained highly predictive of outcomes in patients after assessment by 1999 International Standardized Response Criteria; [Citation5,Citation6] in contrast, CT response in PET-negative and PET-positive patients was not prognostically significant. In our study using PS-matching, though PET response had numerically greater association with outcome than did CT response, we were not able to show that PET response compared to CT response portended a superior survival. Because the NLCS was an observational study of FL treatment, practices, and outcomes, in which imaging were ordered per provider preference, most patients did not receive a concurrent PET and diagnostic CT scan; therefore, we were unable to directly compare PET and CT responses in individuals. With the intention of reducing known bias in this study, including choice of imaging, we applied PS-matching to compare those who received a PET vs. CT. As a result, the findings from these two former studies differed from our finding due in part to this methodologic difference.

While the observational nature of our study has limitations, including bias associated with treatment decisions and imaging choices, no reported EOI imaging from almost a quarter of patients treated, and conclusions that cannot be accounted for in the analysis, our study reflects a much larger scope of academic and community clinical practices countrywide. This makes the findings of this study more broadly applicable to general oncology care. In the GOELAMS/GELA and LYSA/FIL analyses, PET scans were centrally reviewed and the results reported using the Deauville 5-point scale.[Citation9,Citation10] The lack of central review, absence of the Deauville criteria to assist providers in response assessments at the time of this study, and the number of PET scans done without a CT correlate could have accounted for the relatively increased PET-positive patients in this study, dampening this study’s predictive power of PET. However, despite an absence of central review and the advantages of defined criteria for PET response at the time of the study, this study confirmed the benefit of PET imaging for PFS and OS based on local interpretation of the scan and outside of a clinical trial, reflecting general clinical practice in the ‘real world’. In addition, the follow-up in this study was also relatively long compared with other studies, a meaningful aspect of this study given FL is typically an indolent disease.

This is the only known study to examine FL imaging patterns after R-induction therapy. Our data demonstrated that, despite current recommendations, PET is commonly utilized with non-clinical factors, including duration, region, center type, therapy, and driving assessment choice. Whereas most previous studies included patients primarily treated with R-CHOP,[Citation5,Citation6,Citation9] NLCS incorporated all R-based induction treatment regimens commonly used in clinical practice. We established that PET’s prognostic value is independent of FLIPI scores, which is most relevant given the FLIPI score is the primary prognostic tool used in upfront FL. Finally, disease progression within 2 years after diagnosis in chemoimmunotherapy-treated patients is predictive of poorer OS.[Citation15] Our data showed that EOI-PET response was predictive of 2-year PFS (from time of EOI response assessment); therefore, PET response may be an acceptable surrogate for OS in FL. Although 2-year PFS as a predictor of OS has never been validated in R-monotherapy-treated patients, 12-month event-free survival was a predictor of OS in patients treated with R-monotherapy or R-chemotherapy.[Citation16] Our conclusion mirrors the LYSA/FIL results, which demonstrated that PET-CT was predictive of OS in prospective studies.[Citation10] Consequently our results not only confirmed the revised Lugano Classification’s recommendation for PET imaging to assess response in FDG-avid lymphomas such as FL, but also indicate that in clinical practice, PET response assessment should be the preferred mode of response assessment and performed in patients after induction treatment for FL.

In conclusion, this study showed that use of PET after completion of R-containing therapy for FL was prevalent but varied by center and region. Our data confirmed that PET provides important prognostic information after treatment initiation by highlighting the inferior OS in patients remaining PET positive after therapy, independent of FLIPI scores, supporting and confirming use of PET imaging at the end of R-induction therapy outside of a clinical trial, even in an indolent lymphoma-like FL. Further study of EOI PET imaging in FL could facilitate major changes in FL treatment strategies.

Potential conflict of interest

Disclosure forms provided by the authors are available with the full text of this article at http://dx.doi.org/10.1080/10428194.2016.1213824.

Supplemental material

ICMJE Forms for Disclosure of Potential Conflicts of Interest

Download Zip (8.9 MB)

Supplementary material

Download MS Word (1.1 MB)

References

  • Morton LM, Wang SS, Devesa SS, et al. Lymphoma incidence patterns by WHO subtype in the United States, 1992–2001. Blood. 2006;107:265–276.
  • Al-Tourah AJ, Gill KK, Chhanabhai M, et al. Population-based analysis of incidence and outcome of transformed non-Hodgkin’s lymphoma. J Clin Oncol. 2008;26:5165–5169.
  • Janikova A, Bolcak K, Pavlik T, et al. Value of [18F]fluorodeoxyglucose positron emission tomography in the management of follicular lymphoma: the end of a dilemma? Clin Lymphoma Myeloma Leuk. 2008;8:287–293.
  • Bishu S, Quigley JM, Bishu SR, et al. Predictive value and diagnostic accuracy of F-18-fluoro-deoxy-glucose positron emission tomography treated grade 1 and 2 follicular lymphoma. Leuk Lymphoma. 2007;48:1548–1555.
  • Le Dortz L, De Guibert S, Bayat S, et al. Diagnostic and prognostic impact of 18F-FDG PET/CT in follicular lymphoma. Eur J Nucl Med Mol Imaging. 2010;37:2307–2314.
  • Trotman J, Fournier M, Lamy T, et al. Positron emission tomography-computed tomography (PET-CT) after induction therapy is highly predictive of patient outcome in follicular lymphoma: analysis of PET-CT in a subset of PRIMA trial participants. J Clin Oncol. 2011;29:3194–3200.
  • Lopci E, Zanoni L, Chiti A, et al. FDG PET/CT predictive role in follicular lymphoma. Eur J Nucl Med Mol Imaging. 2012;39:864–871.
  • Zinzani PL, Musuraca G, Alinari L, et al. Predictive role of positron emission tomography in the outcome of patients with follicular lymphoma. Clin Lymphoma Myeloma Leuk. 2007;7:291–295.
  • Dupuis J, Berriolo-Riedinger A, Julian A, et al. Impact of [(18)F]fluorodeoxyglucose positron emission tomography response evaluation in patients with high-tumor burden follicular lymphoma treated with immunochemotherapy: a prospective study from the Groupe d'Etudes des Lymphomes de l’Adulte and GOELAMS. J Clin Oncol. 2012;30:4317–4322.
  • Trotman JLS, Luminari S, Boussetta S, et al. Prognostic value of PET-CT after first-line therapy in patients with follicular lymphoma: a pooled analysis of central scan review in three multicentre studies. Lancet Haematol. 2014;1:e17–e27.
  • Cheson BD, Fisher RI, Barrington SF, et al. Recommendations for initial evaluation, staging, and response assessment of Hodgkin and non-Hodgkin lymphoma: the Lugano classification. J Clin Oncol. 2014;32:3059–3068.
  • Barrington SF, Mikhaeel NG, Kostakoglu L, et al. Role of imaging in the staging and response assessment of lymphoma: consensus of the International Conference on Malignant Lymphomas Imaging Working Group. J Clin Oncol. 2014;32:3048–3058.
  • Friedberg JW, Taylor MD, Cerhan JR, et al. Follicular lymphoma in the United States: first report of the National LymphoCare Study. J Clin Oncol. 2009;27:1202–1208.
  • Friedberg JW, Byrtek M, Link BK, et al. Effectiveness of first-line management strategies for stage I follicular lymphoma: analysis of the National LymphoCare Study. J Clin Oncol. 2012;30:3368–3375.
  • Casulo C, Byrtek M, Dawson KL, et al. Early relapse of follicular lymphoma after rituximab plus cyclophosphamide, doxorubicin, vincristine, and prednisone defines patients at high risk for death: an analysis from the National LymphoCare Study. J Clin Oncol. 2015;33:2516–2522.
  • Maurer MJ, Ghesquieres H, Ansell SM, et al. 1664 event-free survival at 12 months (EFS12) from diagnosis is a robust endpoint for disease-related survival in patients with follicular lymphoma in the immunochemotherapy era. 56th ASH Annual Meeting; 2014: Session 622, Poster I.