In its base case, the company modelled differences in PFS and OS between treatments based on extrapolated data from IMROZ Kaplan–Meier curves and the ITCs. The company jointly fitted distributions to MAIC-adjusted IMROZ data (Isa‑Bor‑Len‑Dex) and comparator (Dar‑Len‑Dex, Mel‑Bor‑Pred and Len‑Dex) trial OS and PFS Kaplan–Meier data. For Bor‑Cyclo-Dex, the company also jointly fitted distributions to IMROZ ITT data for Isa‑Bor‑Len‑Dex. For Bor‑Cyclo‑Dex, it used PFS data and OS adjusted with inverse probability weighting. The company then estimated time-varying hazard ratios by comparing intervention and comparator survival estimates at different time points. These hazard ratios were applied to IMROZ ITT population OS and PFS distributions to generate survival estimates for people having Dar‑Len‑Dex, Mel‑Bor‑Pred, Len‑Dex and Bor‑Cyclo‑Dex. The company thought that the most appropriate distributions to generate long-term survival estimates were the Gompertz distribution for OS and the Gamma distribution for PFS. The EAG explained its view that the company's approach was overly complicated. It thought that the fitted distributions used to estimate time-varying hazard ratios could have been used directly in the company's model. It further explained that, at all time points, survival estimates based on IMROZ MAIC-adjusted OS and PFS data were lower than the survival estimates based on IMROZ ITT data. So, generating survival estimates based on IMROZ Isa‑Bor‑Len‑Dex ITT Kaplan–Meier data generated optimistic OS estimates for Isa‑Bor‑Len‑Dex. The EAG also noted that the company had fitted distributions to the full 68 months of available IMROZ data but that, after 60 months, the only events remaining were censoring events. The company agreed with the EAG that most events past 60 months were censoring events. But it explained that limiting analysis to 60 months did not take into account all the available evidence. This was particularly true in the case of the MAIA trial (Dar‑Len‑Dex compared with Len‑Dex in people with newly diagnosed multiple myeloma in whom an ASCT is unsuitable). Survival data from this trial was available for up to 100 months of follow up.
The company explained that conventional extrapolation techniques apply less emphasis to the 'tail' of data when there are fewer people at risk. Censoring people after 60 months for Isa‑Bor‑Len‑Dex only marginally changed the survival estimates for OS and PFS. This suggested that the tail did not introduce statistically significant uncertainty. The EAG disagreed with the company's preference for including data beyond 60 months, and noted that including this data contributed to overly optimistic OS estimates for Isa‑Bor‑Len‑Dex. So, at the clarification stage, it asked the company to provide analysis in which distributions were fitted only to the first 60 months of data. The company provided 2 new analyses in response to the EAG's request. Scenario A was limited to 60 months of data from IMROZ and MAIA, and was the analysis preferred by the EAG. Scenario B used the full 68‑month follow up of IMROZ and also included additional follow up from MAIA up to 100 months. But the company did not do as the EAG had requested. This was to fit separate distributions to the first 60 months of MAIC-adjusted IMROZ Isa‑Bor‑Len‑Dex data and comparator trial data and use these distributions directly in the economic model. Instead, the company maintained its original approach, but using the 60‑month data. The clinical experts explained that the high proportion of censoring events after 60 months in IMROZ added uncertainty to the survival analysis. They added that it is often preferable to use as much clinical trial data as is available. But they thought it was reasonable to exclude IMROZ data after 60 months from the analysis.
After the EAG's request at the clarification stage for new analyses up to 60 months, the company revised its selection of parametric curves for Isa‑Bor‑Len‑Dex. It preferred using Weibull for PFS and generalised gamma for OS. For PFS, the EAG explained that it thought that the Gompertz distribution was a better choice than Weibull. This was because it was similarly ranked and generated estimates that were more closely aligned to clinical expert opinion. For OS, the EAG noted that the Gompertz distribution was a better fit based on Akaike information criterion and Bayesian information criterion statistics. It also generated OS estimates that were closer to clinician landmark estimates. At the first meeting, the clinical experts noted that, because of the age of people at diagnosis, it was very difficult to validate estimates of PFS and OS. This was particularly difficult out to a 20‑year timepoint in which the model extrapolations were all overoptimistic. But, despite this caveat, the clinical experts thought that the clinical expert opinion provided by the company was reasonable. They also preferred the EAG's chosen distributions because of their closer alignment with these clinician landmark estimates. The EAG explained that the choice of extrapolation had a relatively small impact on cost effectiveness. This was particularly so when compared with the choice of whether to use the 60‑month or 68‑month analysis.
At the first meeting, the committee agreed with the EAG that the company's approach was overly complicated, and that the calculation of time-varying hazard ratios was an unnecessary step. The committee noted that the company's and EAG's approach to modelling OS and PFS was highly uncertain because it relied on the results from the unanchored MAIC. It recalled that it would have preferred to see ITC results from an NMA that maintained randomisation (see section 3.4). So, it was unable to conclude on the most appropriate OS and PFS parametric distribution, and whether 60 or 68 months of data should be used. The committee concluded that, because it had not seen an appropriate analysis to model PFS and OS, it could not state its preference. If appropriate, and data allows, the committee's preferred method to model OS and PFS for Isa‑Bor‑Len‑Dex and comparators would be to apply the hazard ratio generated from an NMA (which maintained randomisation) to an appropriate reference curve such as Dar‑Len‑Dex OS and PFS curves from MAIA or Dar‑Len‑Dex SACT data.
At consultation, the company applied the hazard ratio generated from the MAIC to the Dar‑Len‑Dex curves from MAIA. Also, it used the same extrapolations for Dar‑Len‑Dex that were used in TA917. For OS, the generalised gamma reference curve informed the company's base case. For PFS, the gamma reference curve was used. A 68‑month IMROZ data cut was maintained in its analysis. The company said that the updated survival estimates aligned with the clinical experts that they consulted. The EAG highlighted that the decision to anchor the hazard ratios to MAIA or IMROZ should be based on which trial population is most similar to the NHS. The committee heard that the average age of people in MAIA was slightly older compared with IMROZ. At the second committee meeting, the clinical experts confirmed that the updated survival and progression-free estimates predicted in the model were more aligned with what they expected for Dar‑Len‑Dex. The committee concluded that the company's approach was appropriate for extrapolating OS and PFS and that the hazard ratios should be applied to the Dar‑Len‑Dex curves from MAIA because the average age in MAIA is more reflective of people having treatment in the NHS.