East Training EUGM Day 2 Pantelis Vlachos - - PDF document

east training eugm day 2
SMART_READER_LITE
LIVE PREVIEW

East Training EUGM Day 2 Pantelis Vlachos - - PDF document

Shaping the Future of Drug Development East Training EUGM Day 2 Pantelis Vlachos Pantelis.Vlachos@cytel.com Agenda Cytel Software map East Overview/Introduction Phase I: Dose Escalation with cohort expansion (ESCALATE + PROGRAM)


slide-1
SLIDE 1

10/28/19 1

Shaping the Future

  • f Drug Development

East Training EUGM Day 2

Pantelis Vlachos Pantelis.Vlachos@cytel.com

Agenda

  • Cytel Software map
  • East Overview/Introduction
  • Phase I: Dose Escalation with cohort expansion (ESCALATE + PROGRAM)
  • Phase II: Model-based dose finding (MCPMod)
  • Phase II/III: Multi-Arm Multi-Stage designs (MAMS)
  • Phase III: Group Sequential designs for TTE (SURVIVAL)
  • Phase III: Unblinded Sample-Size Re-estimation (SURVADAPT)
  • Phase III: Adaptive Population Enrichment (ENRICH)
  • Q/A

EUGM, Nov 2019 2

slide-2
SLIDE 2

10/28/19 2

Software Solutions

DATA ANALYSIS CLINICAL TRIAL DESIGN CLINICAL OPERATIONS EUGM, Nov 2019 3

Cytel Software Solutions

COMMERCIA L PROPRIETARY TOOLS EUGM, Nov 2019 4

slide-3
SLIDE 3

10/28/19 3

Shaping the Future

  • f Drug Development

East overview

1994 2000 2010

EUGM, Nov 2019 6

slide-4
SLIDE 4

10/28/19 4

Broad coverage of designs for biostatisticians One integrated tool for all types of designs: fixed sample, group sequential or adaptive Superior User Interface

  • Multiple windows with graphs and tables
  • Organized storage of designs in workbooks

Rapid creation, viewing and filtering of multiple scenarios for design parameters Commitment to continuous improvement and expansion of features

What Is Special About The New East?

EUGM, Nov 2019 7

East 6.5 (Three New Modules)

EUGM, Nov 2019 8

slide-5
SLIDE 5

10/28/19 5

East BASE

Complete package for fixed-sample clinical trial designs

Functions:

  • Wide variety of fixed sample options (Normal, Binomial, Survival endpoints)
  • Multiple scenarios in one go
  • Complex patterns of accruals, dropouts and response lag
  • Run R functions
  • Bayesian probability of success (Normal / Binomial/Survival)

New in 6.5:

  • Super Superiority designs

SUCCESSFUL OUTCOME:

Test variety of superiority, non-inferiority, equivalence, and confidence-interval based designs

EUGM, Nov 2019 9

Intro To East

  • Introduction to the new East on the Architect platform
  • A Normal-Endpoint Example to Illustrate New Features of East

Schizophrenia Trial: illustrates more flexible boundaries; delayed response; incorporating Bayesian priors into design

  • Q & A

10 EUGM, Nov 2019

slide-6
SLIDE 6

10/28/19 6

Two arm trial: asenapine vs. olanzapine (control) Primary endpoint is negative symptoms assessment (NSA) on 24-point scale observed at week 26 Design for d = 2 and s = 7.5 with two sided a=0.05 and 80% power But d could be as small as 1.6 so examine a range of sample size requirements Enrollment ramps up to 8 patients/week Up to 8% drop-outs anticipated

Example 1: Schizophrenia Trial Input Window

12 EUGM, Nov 2019

slide-7
SLIDE 7

10/28/19 7

Inputs With Output Preview

13 EUGM, Nov 2019

Use Of Filter To Eliminate Designs

14 EUGM, Nov 2019

slide-8
SLIDE 8

10/28/19 8

Convert Single Look Design into GSD

First save the selected design(s) to the library for further editing Then change number of looks and boundary information

15 EUGM, Nov 2019

View the Boundaries & Compute

With futility boundary, sample size has increased to 625 patients

16 EUGM, Nov 2019

slide-9
SLIDE 9

10/28/19 9

Bayesian Probability of Success (Assurance)

Assign a prior distribution to d to capture the uncertainty associated with it

The “Assurarce” of success is only 0.685 due to uncertainty about d In this case the “Assurance is 0.798 due to less uncertainty about d

17 EUGM, Nov 2019

Compare Design Summaries

18 EUGM, Nov 2019

slide-10
SLIDE 10

10/28/19 10

Incorporate Delayed Response & Dropouts

19 EUGM, Nov 2019

Compare Designs With & Without Lag

  • Notice that, due to

the 26 week response lag, there is very little saving in expected sample size.

  • Maximum study

duration is 110 weeks

20 EUGM, Nov 2019

slide-11
SLIDE 11

10/28/19 11

Reason for lack of sample size saving in GSD when there is a long delay in the primary response

  • The is a large gap (> 208 patients) between sample size and completers
  • By the time 669 completers for IA 3 the total enrollment is over. Hence

no saving of sample size if trial stops at look 3

  • But, there is a saving of time; trial stops 20 weeks earlier if stopped at look

3

21 EUGM, Nov 2019

Examine Final Design in Detail

Icons for viewing design details, graphs, tables and summaries

22 EUGM, Nov 2019

slide-12
SLIDE 12

10/28/19 12

Shaping the Future

  • f Drug Development

Phase I: Dose Escalation East ESCALATE

ADAPTIVE PHASE 1 DOSE-ESCALATION STUDIES

Description:

  • Simulate dose-toxicity profiles and designs
  • Dose recommendations for next cohort of patients
  • Methods include: 3+3, Continual Reassessment Method, Bayesian

Logistic Regression Model, and modified Toxicity Probability Interval

  • Dual-agent dose combination designs (BLRM, PIPE)

Benefits:

  • Better characterization of the MTD
  • Clearly communicate critical information to clinicians
  • Improve screening and selection of active agents to carry forward

New in 6.5: • mTPI-2

Pre-Requisites: East BASE

SUCCESSFUL OUTCOME: IDENTIFICATION OF THE MAXIMUM TOLERATED DOSE (MTD)

EUGM, Nov 2019 24

slide-13
SLIDE 13

10/28/19 13

Phase I Dose-Escalation Trials

  • Assess dose-toxicity relationship
  • First-in-human (FIH) studies – single agent
  • Determine maximum tolerated dose (MTD) or recommended phase II dose (RP2D)
  • Observe Dose limiting toxicities (DLTs)
  • Combination dose finding studies (Phase Ib)
  • Same primary objective as FIH studies
  • Combination of two (or more) drugs
  • Addition of a new drug to a registered treatment to increase efficacy

25 EUGM, Nov 2019

East ESCALATE (Summary)

Bayesian? Model dose-toxicity? (number of parameters) Probability Intervals? Single Agent Designs 3+3 No No No CRM Yes Yes (1) No BLRM Yes Yes (2) Yes mTPI Yes No Yes Double Agent Designs comb2BLRM Yes Yes (5) Yes PIPE Yes No No

26 EUGM, Nov 2019

slide-14
SLIDE 14

10/28/19 14

Single-Agent Dose Escalation

27 EUGM, Nov 2019

Phase I dose-escalation (general frame)

Only consider trials with fixed doses. A sequence of K doses, d1,d2,…,dK, as candidates. Dose i has a toxicity probability of pi (unknown). Monotonicity : pi < pi+1 Goal: to find the MTD , defined as the highest dose with toxicity rate lower (or close to) a fixed rate, pT, e.g., pT = 0.30.

28 EUGM, Nov 2019

slide-15
SLIDE 15

10/28/19 15

Ethical Requirement

“The major difficulty in phase I trial design and conduct is the ethical requirement that the number of patients in the trial who experiencetoxicity must be limited.”

29 EUGM, Nov 2019

Other Challenges and Design Requirements

Alessandro Matano, Novartis, http://www.smi-online.co.uk/pharmaceuticals/archive/4-2013/conference/adaptive-designs

30 EUGM, Nov 2019

slide-16
SLIDE 16

10/28/19 16

Rule-based vs Model-based

Rule-based Model-based

Applicability Easy More complex due to statistical component (e.g. evidence-based prior derivation) Flexibility Not very flexible

  • Fixed cohort size
  • Fixed doses

Flexible: allows for

  • Different cohort sizes
  • Intermediate doses

Extensibility Rather difficult Easily extendable

  • 2 or more treatment arms
  • combinations

Inference for true DLT rates Observed DLT rates

  • nly

Full inference, uncertainty assessed for true DLT rates Statistical requirements None “reasonable” model, “good” statistics Decisions Algorithm decides Clinically-driven recommendations

(Based on Matano. Bayesian Adaptive Designs for Oncology Phase 1 Trials, 2013)

31 EUGM, Nov 2019

The “standard” design

32 EUGM, Nov 2019

slide-17
SLIDE 17

10/28/19 17

3+3 (Prevalence)

Over 98% of published Phase 1 trials (1991-2006) use variations of 3+3

(Rotgako et al., 2007)

33 EUGM, Nov 2019

94% In Trials Until 2014

The 3+3 design

Rule Based Design Start by allocating lowest dose level to first cohort Adaptively Escalate/De-escalate based on observed DLTs Maximum tolerated dose (MTD) is considered highest dose at which 1 or 0 out of six patients experiences DLT. Doses need to be pre-specified Different versions: 3+3L, 3+3Lmod and 3+3H

34 EUGM, Nov 2019

slide-18
SLIDE 18

10/28/19 18

The 3+3 Design (Schematic)

Yuan Ji, KOL Lecture Oct. 2013

35 EUGM, Nov 2019

Advantages

Simplicity Of Design, Execution, Inference Meet Ethical Needs Of Exploring Low Doses First Provide Simple, Operational Definition Of The Target Dose Considerable Clinical Experience And Comfort With Their Use They Can Be Easily Studied Quantitatively And Possibly Improved

36 EUGM, Nov 2019

slide-19
SLIDE 19

10/28/19 19

Issues

Generally, not very accurate depiction of true dose-response (or dose-toxicity) curve Require dose levels specified in advance Usually start far from target dose Don’t fully use information from previously treated patients Don’t use information on ordinal response (e.g. graded toxicity) Estimate of MTD is seriously biased or invalid

37 EUGM, Nov 2019

(more) limitations of 3+3

Ignores dosage history other than previous cohort

  • 0/3, 0/3, 0/3, 0/3, 0/3, 2/6 provides more information than 0/3, 2/6

Same action under qualitatively different situations

  • 0/3 and 1/6 lead to same action (escalate to the next provisional dose)
  • 2/3, 3/3, 2/6, 3/6, and 4/6 lead to same action (de-escalate)

Ignores uncertainty:

  • If true DLT rate is p=0.5, 11% of the time we will see 0 or 1 DLT in 6 patients
  • If true DLT rate is p=0.166, 26% of the time we will see at least 2 DLT in 6 patients

Cannot re-escalate Fixed cohort sizes (either 3 or 6) Pre-defined dose levels to be potentially tested Low probability of selecting true MTD (e.g. Thall and Lee. 2003) High variability in MTD estimates (Goodman et al. 1995)

Alessandro Matano, Novartis, http://www.smi-online.co.uk/pharmaceuticals/archive/4-2013/conference/adaptive-designs

38 EUGM, Nov 2019

slide-20
SLIDE 20

10/28/19 20

Target Toxicity?

Common misconception target toxicity is fixed (eg., 17%, or 33%). He et al. (2006) showed via simulation that the expected toxicity level at the MTD for the 3+3 is between 19-22%.

39 EUGM, Nov 2019

Accelerated Titration

Simon et al. (1997): "...cohorts of one new patient per dose level. When the first instance of first course DLT is observed,...expand the cohort for current dose level.

40 EUGM, Nov 2019

slide-21
SLIDE 21

10/28/19 21

Accelerated Titration

Similar to traditional design with small cohorts at low doses Attempts to use information in ordinal toxicity responses at lower doses May reduce the number of patients needed to reach MTD

41 EUGM, Nov 2019

Regulatory Guidelines

FDA Guidance (Clinical Considerations for Therapeutic Cancer Vaccines) EMEA / CHMP Guideline on Clinical Trials in Small Populations

Alternative Approach Needed To Meet Design Requirements

42 EUGM, Nov 2019

slide-22
SLIDE 22

10/28/19 22

Bayesian Framework

(Based on Matano. Bayesian Adaptive Designs for Oncology Phase 1 Trials, 2013)

43 EUGM, Nov 2019

The Continual Reassessment Method (CRM)

Bayesian model-based method (O'Quigley et al. 1990) Uses all available information from doses to guide dose assignment Inputs to specify:

  • Target toxicity pT (usually at 33%)
  • A single-parameter (θ) dose-toxicity curve
  • prior distribution for θ
  • prior mean probabilities at each dose (“skeleton”)

Next recommended : posterior toxicity probability closest to target

44 EUGM, Nov 2019

slide-23
SLIDE 23

10/28/19 23

CRM: The Process

1. Patient cohorts treated at each dose level 2. Toxicity outcome observed 3. Using Bayes theorem, prior distribution and observed outcomes are used to calculate the posterior mean of the probability of toxicity at each dose level, ̂ 𝑞# 4. Next cohort of patients assigned to dose level that has its ̂ 𝑞# closest to target toxicity 5. Repeat 1-4 until termination criteria met

45 EUGM, Nov 2019

Modified CRM

46 EUGM, Nov 2019

slide-24
SLIDE 24

10/28/19 24

Modified CRM

Differences:

  • Can start at lowest dose
  • Allow multiple patients per cohort
  • Restrict escalation to one dose (do not allow skipping when escalating)

“The unmodified CRM...produces only modest increases in accuracy over the modified CRM, but at the price of greater toxicity, and, most important, clinical acceptability.”

47 EUGM, Nov 2019

CRM Simulation Parameters

48 EUGM, Nov 2019

slide-25
SLIDE 25

10/28/19 25

Uncertainty in toxicity rate

CRM relies on point estimate, ignores uncertainty. eg, same posterior mean, but Pr(p > 0.6) = 0.168 vs 0.002

49 EUGM, Nov 2019

Bayesian Logistic Regression Model (BLRM)

50 EUGM, Nov 2019

slide-26
SLIDE 26

10/28/19 26

Two parameter logistic: is the “reference dose” (arbitrary scaling dose) α>0 is the odds of DLT at β>0 is the increase in log-odds of DLT for unit increase in log-dose

51 EUGM, Nov 2019

Bayesian Logistic Regression Model (BLRM) Bayes Risk

Choose dose that minimizes posterior expected loss.

52 EUGM, Nov 2019

slide-27
SLIDE 27

10/28/19 27

Escalation With Overdose Control (EWOC)

  • Choose dose that maximizes targeted toxicity probability, given not overdosing.

53 EUGM, Nov 2019

Prior Specification (Direct vs Indirect)

Enter directly bivariate normal for log(α) and log(β):

54 EUGM, Nov 2019

Indirectly:

slide-28
SLIDE 28

10/28/19 28

Prior Specification (Indirect)

Assuming logits of toxicity are linear, calculate prior probabilities of toxicity (predicted median) at each dose level Assign a “minimally informative unimodal” Beta distribution at each dose level (Neuenschwander et al., 2008 Appendix A) Generate n sets of logits from Beta distributions, to obtain n estimates of log(α) and log(β) using least squares Use sample means, variance, correlation for bivariate normal

55 EUGM, Nov 2019

Modified Toxicity Probability Interval (Mtpi)

mTPI is Bayesian like CRM and BLRM, but rule-based like 3+3 Challenges for model-based methods: complexity (esp for non-statisticians); sensitivity to priors

56 EUGM, Nov 2019

slide-29
SLIDE 29

10/28/19 29

Modified Toxicity Probability Interval (mTPI)

“...almost all phase I oncology trials conducted at Merck in past 2 years have been based on the mTPI design”

57 EUGM, Nov 2019

Trial Monitoring Table

Yuan Ji, KOL Lecture Oct. 2013

58 EUGM, Nov 2019

slide-30
SLIDE 30

10/28/19 30

modified Toxicity Probability Interval (mTPI)

Probability of toxicity at each dose modeled by independent Beta distributions Set of decision intervals specified (like in BLRM) Dosing decisions determined by 'normalized' posterior probability in each interval at the current dose di :

  • Escalate to di+1 if di is 'underdosing'
  • Stay at di if 'proper dosing'
  • De-escalate to di-1 if di is 'overdosing'

59 EUGM, Nov 2019

mTPI Priors

“[W]e believe that for phase I trials with small sample sizes...the dependence

introduced by prior models will have a strong influence on the operating characteristics...The independent prior models performs quite well compared to existing approaches.”

60 EUGM, Nov 2019

slide-31
SLIDE 31

10/28/19 31

Equivalence Intervals

The Equivalence Interval (EI) is defined as [pT-ε1; pT +ε2] pT-ε1 is the lowest toxicity probability that the physician would be comfortable using to treat future patients without dose escalation pT +ε2 is the highest toxicity probability that the physician would be comfortable using to treat future patients without dose de-escalation

61 EUGM, Nov 2019

Unit Probability Mass

UPM (interval) = Post Pr(interval) / length(interval)

62 EUGM, Nov 2019

slide-32
SLIDE 32

10/28/19 32

mTPI Dose Exclusion / Stopping Rule

63 EUGM, Nov 2019

Summary

64 EUGM, Nov 2019

Bayesian? Model dose-toxicity? (number of parameters) Probability Intervals? Single Agent Designs 3+3 No No No CRM Yes Yes (1) No BLRM Yes Yes (2) Yes mTPI Yes No Yes Double Agent Designs comb2BLRM Yes Yes (5) Yes PIPE Yes No No

slide-33
SLIDE 33

10/28/19 33

Case Study (Shaw et al 2014)

65 EUGM, Nov 2019 66 EUGM, Nov 2019

slide-34
SLIDE 34

10/28/19 34

67 EUGM, Nov 2019 68 EUGM, Nov 2019

slide-35
SLIDE 35

10/28/19 35

69 EUGM, Nov 2019

NB: The published protocol uses the following covariate model, which is not currently implemented in East ESCALATE.

Covariate Model

70 EUGM, Nov 2019

slide-36
SLIDE 36

10/28/19 36

71 EUGM, Nov 2019 72 EUGM, Nov 2019

slide-37
SLIDE 37

10/28/19 37

73 EUGM, Nov 2019 74 EUGM, Nov 2019

slide-38
SLIDE 38

10/28/19 38

75 EUGM, Nov 2019

Option: Copy from Excel

76 EUGM, Nov 2019

slide-39
SLIDE 39

10/28/19 39

77 EUGM, Nov 2019 78 EUGM, Nov 2019

slide-40
SLIDE 40

10/28/19 40

79 EUGM, Nov 2019

1. Use prior calculator (indirect methods) 2. Enter informative prior parameters for bivariate normal. Plot & refine. 3. Enter weakly-informative prior parameters for bivariate normal

Prior Specification Options

80 EUGM, Nov 2019

slide-41
SLIDE 41

10/28/19 41

Prior calculator

81 EUGM, Nov 2019 82 EUGM, Nov 2019

slide-42
SLIDE 42

10/28/19 42

Weakly-informative priors

83 EUGM, Nov 2019

Stopping Rules

84 EUGM, Nov 2019

slide-43
SLIDE 43

10/28/19 43

Posterior Sampling Methods

85 EUGM, Nov 2019

Simulation Controls

86 EUGM, Nov 2019

slide-44
SLIDE 44

10/28/19 44

In Output Preview, right-click Sim1, select “Save in Workbook” to save to Library. In Library, right-click Sim1, select “Details”. In Library, select Plots icon

To View Simulation Details:

87 EUGM, Nov 2019

Interim Monitoring Exercise

88 EUGM, Nov 2019

slide-45
SLIDE 45

10/28/19 45

89 EUGM, Nov 2019

Use default Stopping Rules (none), and Response Generation. Simulate 1 trial, then save to Library.

90 EUGM, Nov 2019

slide-46
SLIDE 46

10/28/19 46

Open Interim Monitoring Dashboard.

91 EUGM, Nov 2019

Click “Enter Interim Data”.

Enter Cohort Data As Below.

92 EUGM, Nov 2019

slide-47
SLIDE 47

10/28/19 47

Match Results?

93 EUGM, Nov 2019 94 EUGM, Nov 2019

slide-48
SLIDE 48

10/28/19 48

95 EUGM, Nov 2019 96 EUGM, Nov 2019

slide-49
SLIDE 49

10/28/19 49

97 EUGM, Nov 2019 98 EUGM, Nov 2019

slide-50
SLIDE 50

10/28/19 50

References

3+3

B.E. Storer. Design and analysis of phase I clinical trials. Biometrics, 45:925-937, 1989.

mTPI

  • Y. Ji, P. Liu, Y. Li, and N. Bekele. A modified toxicity probability interval method for dose finding trials. Clinical trials,

7:653-656, 2010.

CRM

  • J. O’Quigley, M. Pepe, and L. Fisher. Continual reassessment method: A practical design for phase I clinical trials in
  • cancer. Biometrics, 46:33-48, 1990.

S.N. Goodman, M.L. Zahurak, and S Piantadosi. Some practical improvements in the continual reassessment method for phase I studies. Statistics in Medicine, 14:1149-1161, 1995

BLRM

  • B. Neuenschwander, M. Branson, and T. Gsponer. Clinical aspects of the Bayesian approach to phase I cancer trials.

Statistics in Medicine, 27:2420-2439, 2008.

  • L. W. Huson and N. Kinnersley. Bayesian fitting of a logistic dose– response curve with numerically derived priors.

Pharmaceutical Statistics , 8: 279–286, 2009

Combination

  • B. Neuenschwander, et al. A Bayesian Industry Approach to Phase I Combination Trials in Oncology. Statistical

Methods in Drug Combination Studies, 95-135, 2015 A.P. Mander and M.J. Sweeting. A product of independent beta probabilities dose escalation design for dual-agent phase I trials. Statistics in Medicine, 34:1261-1276, 2015

99 EUGM, Nov 2019

Shaping the Future

  • f Drug Development

Phase I: Dose Escalation With Cohort Expansion

slide-51
SLIDE 51

10/28/19 51

Define true underlying scenario(s) for endpoint(s), study design(s), decision rule(s) Generate many repetitions Summarize results Use to choose and justify trial design

  • Demonstrates design performance for a span of potential true scenarios

Widely used in Drug Development

Clinical Trial Simulation

EUGM, Nov 2019 101

Define a sequence of clinical trial simulations and decision rules and design options for moving from one trial to the next Aim to optimize the sequence of trials for a particular set of drug program objectives

Drug Program Simulation

EUGM, Nov 2019 102

slide-52
SLIDE 52

10/28/19 52

Program Design

Design & Simulate Sequence of Trials Two types planned for 6.5:

  • Dose Escalation followed by a cohort expansion study
  • Stage1: Dose escalation design (3+3, mTPI, CRM, BLRM)
  • Stage2: Single-arm cohort expansion
  • Frequentist or Bayesian GNG rules
  • Phase 2 oncology trial followed by Group Sequential
  • Stage1: Single-arm binomial, Simon’s two-stage, or 2-arm survival
  • Stage2: A group sequential design

EUGM, Nov 2019 103

Example: Dose Escalation with Cohort Expansion

Stage 1 Parameters:

  • Design: mTPI
  • Sample Size: 30
  • Cohort Size: 3
  • Target Probability of Toxicity: 30%
  • No. of Doses: 7

EUGM, Nov 2019 104

slide-53
SLIDE 53

10/28/19 53

Stage 2 Parameters

  • Design: Single Proportion GNG rule
  • Cohort expansion phase Sample Size: 20\
  • Clinically meaningful level (k1): 10%
  • desired level of clinical activity (k2): 20%
  • Prior: Beta (1, 1)
  • The true response rate is 0.1 for the three lowest doses below the true MTD, and 0.2 for all

doses at the true MTD and higher.

EUGM, Nov 2019 105

Design in East

Click Program: Program Simulation on the Design tab, and then click Dose Escalation with Cohort Expansion.

EUGM, Nov 2019 106

slide-54
SLIDE 54

10/28/19 54

Design in East

Right-click the Start node and add an mTPI design with all default inputs.

EUGM, Nov 2019 107

Design in East

Right-click the mTPI node and add a Single Proportion GNG Rule.

EUGM, Nov 2019 108

slide-55
SLIDE 55

10/28/19 55

Design in East

Right-click the mTPI node and add a Single Proportion GNG Rule.

EUGM, Nov 2019 109

Design in East

Test Parameters Tab

EUGM, Nov 2019 110

slide-56
SLIDE 56

10/28/19 56

Design in East

Response Generation Tab

EUGM, Nov 2019 111

Design in East

On the Simulation Controls tab, enter Number of simulations as 10000 trials and click Simulate in the bottom right corner.

EUGM, Nov 2019 112

slide-57
SLIDE 57

10/28/19 57

Analysis

  • Dose D4 (25 mg) is the true MTD and Dose D5 (40 mg) had a true DLT rate at the upper

limit of the Proper Dosing interval (0.35)

  • These two doses are selected as MTD the most often - 36% and 18%, respectively

EUGM, Nov 2019 113

Analysis

When doses D4 and D5 are selected as MTD, a Go decision is made most

  • ften, around 70% of the time

EUGM, Nov 2019 114

slide-58
SLIDE 58

10/28/19 58

Shaping the Future

  • f Drug Development

Phase II: MCPMod

Agenda

  • Introduction to MCPMod
  • The traditional approach
  • Why use MCPMod?
  • Case study
  • Questions

116 EUGM, Nov 2019

slide-59
SLIDE 59

10/28/19 59

Dose-Response Studies

Establish Proof-of-Concept (PoC)

  • Change in dose desirable change in endpoint of interest

Dose finding step

  • Select one (or more) “good” dose levels for confirmatory Phase III once

PoC has been established

117 EUGM, Nov 2019

Traditional Approach

Proof-of-Concept: Conducted using (multiple) active arms and placebo Selection of Target Dose: 1. statistically significant at the proof-of-concept stage 2. smallest of statistically significant doses but also clinically relevant Dose-Response Modeling: 1. use data from PoC and earlier trials 2. find a statistical model capturing the effects of target dose on dose-response

118 EUGM, Nov 2019

slide-60
SLIDE 60

10/28/19 60

Traditional Approach

Straight-forward approach However:

  • focuses on narrow dose range where sponsors can have faith that they will establish a

clear dose-signal

  • dose-response model should itself play a greater role in choosing the right dose
  • focuses on modeling at the very end of the process

119 EUGM, Nov 2019

MCP vs Mod

MCP:

  • Dose is a qualitative factor
  • Inference about target dose restricted to the discrete set of doses used in the trial

Mod: Dose Response (parametric functional relationship)

  • Dose is quantitative
  • Modeling approach validity depends on pre-specification of appropriate dose-response model

120 EUGM, Nov 2019

slide-61
SLIDE 61

10/28/19 61

Multiple Comparison Procedures – Modelling

MCP-step

  • Establish a dose-response signal (the dose-response curve is not flat) using multiple

comparison procedures Mod-step

  • Estimate the dose-response curve and target doses of interest (ED50, ED90, MED, etc)

using modelling techniques Modelling is pre-specified at the design stage Model uncertainty addressed using

  • A candidate set of models
  • A procedure on how to perform model selection

121 EUGM, Nov 2019

MCP + Mod = MCPMod

Design Stage

  • Pre-specification of candidate

dose-response models Analysis Stage (MCP-step)

  • Statistical test for dose-

response signal. Model selection based on significant dose response models Analysis Stage (Mod-step)

  • Dose response and target

dose estimation based on dose-response modeling

122

Trial Design Stage Trial Analysis Stage

EUGM, Nov 2019

slide-62
SLIDE 62

10/28/19 62

MCP + Mod = MCPMod

123

Trial Design Stage Trial Analysis Stage

EUGM, Nov 2019

Design Stage

  • Pre-specification of candidate

dose-response models Analysis Stage (MCP-step)

  • Statistical test for dose-

response signal. Model selection based on significant dose response models Analysis Stage (Mod-step)

  • Dose response and target

dose estimation based on dose-response modeling

List of dose-response models

124 EUGM, Nov 2019

slide-63
SLIDE 63

10/28/19 63

MCP-Mod : Scope

Development Phase

  • Ph II dose-response studies to support dose selection for Phase III

Response can be continuous, binary, count, time-to-event Number of doses, dose-range

  • Minimum: 2 active doses (for the MCP-step), 3 active doses (Mod step)
  • Recommendations (rules of thumb): 4-7 active doses, >10-fold dose range

Control

  • MCP-step makes most sense when there is a placebo control in the trial

125

Bjorn Bornkamp, EFSPI Meeting, Nov 2015

EUGM, Nov 2019

Regulatory Opinion

CHMP: First opinion issued in 2010, since then 12 qualification opinions (biomarkers, technologies/devices, simulation models)

  • MCP-Mod first statistical methodology qualified

FDA: Issued its Fit-for-Purpose (FFP) designation for guiding dose selection for Phase III testing.

  • https://www.fda.gov/downloads/Drugs/DevelopmentApprovalProcess/UCM508700.pdf

126 EUGM, Nov 2019

slide-64
SLIDE 64

10/28/19 64

Example: Continuous Data

BIOM study

  • randomized double-blind parallel group trial with patients being allocated to either

placebo or one of four active doses coded as 0.05, 0.20, 0.60, and 1

  • Response variable is baseline adjusted abdominal pain score
  • larger values correspond to better treatment effect

127 EUGM, Nov 2019

Example: The Design Part

  • One-sided type I error of 0.05
  • Calculate Power for a total sample size of 100

128 EUGM, Nov 2019

slide-65
SLIDE 65

10/28/19 65

Example: The Design Part

  • Candidate Models:

129 EUGM, Nov 2019

Example: Patient Allocation

Optimized: Equal:

130 EUGM, Nov 2019

slide-66
SLIDE 66

10/28/19 66

Example: Output Summary

131 EUGM, Nov 2019

Example: Output Details

132 EUGM, Nov 2019

slide-67
SLIDE 67

10/28/19 67

Example: Power Plot for Mean Power fct

133 EUGM, Nov 2019

Example: Sample Size for 90% Mean Power

134 EUGM, Nov 2019

slide-68
SLIDE 68

10/28/19 68

Example: The Analysis Part

The Dataset Consists Of Following Variables:

  • subjectID - This corresponds to the subject ID in the trial.
  • dose - The dose values administered to these subjects.
  • resp - The baseline adjusted abdominal pain score for subjects

135 EUGM, Nov 2019

Example: The Analysis Part

The Set Of Candidate Models Includes: 1. emax1: Emax(0.2) 2. linlog1: Lin in Log Dose 3. linear1: Linear 4. expn1: Exponential(1.13) 5. quad1: Quadratic(-0.7322) 6. beta1: Beta(2,4)

136 EUGM, Nov 2019

slide-69
SLIDE 69

10/28/19 69

Example: The Analysis Part

Some More About The Options Used:

  • Significant models will be selected using the pValue method (adjusted p-values).
  • The adequate model will be selected based on its AIC (default choice).
  • Estimate: Target dose is specified as the dose level that achieves a target effect of

Delta over placebo

137 EUGM, Nov 2019

Example: The Analysis Part

138 EUGM, Nov 2019

slide-70
SLIDE 70

10/28/19 70

Example: Analysis/Candidate Models

139 EUGM, Nov 2019

Example: Analysis/Candidate models

140 EUGM, Nov 2019

slide-71
SLIDE 71

10/28/19 71

Analysis: Output

141 EUGM, Nov 2019

Analysis: Output

142

MCP Part:

  • Multiple Contrast test: Observe that the adjusted p-values for all models except

Beta are smaller than 0.05 Mod part:

  • Fit the significant models to the data and estimate the target dose.
  • Model selection is based on the AIC criterion

EUGM, Nov 2019

slide-72
SLIDE 72

10/28/19 72

Analysis: Output

143 EUGM, Nov 2019

Analysis: Output

144 EUGM, Nov 2019

slide-73
SLIDE 73

10/28/19 73

Analysis: Output

145

Modelling part:

  • Based on the model selection criteria of minimum AIC, LinLog is the adequate model.
  • The corresponding target dose is 0.6.
  • The target doses on continuous scale differ for the different models though. Emax

calculates it approximately as 0.289, LinLog as 0.299 and Quadratic as 0.387.

EUGM, Nov 2019

References

146

  • Bornkamp, B. et al (2007) Innovative Approaches for Designing and Analyzing Adaptive

Dose-Ranging Trials, Journal of Biopharmaceutical Statistics, 17, 965-995

  • Bretz, F., Pinheiro, J.C., and Branson, M. (2005) Combining multiple comparisons and

modeling techniques in dose-response studies. Biometrics, 61, 738–748

  • EMA (2014) Qualification opinion of MCP-Mod as an efficient statistical methodology

for model-based design and analysis of Phase II dose finding studies under model uncertainty, http://goo.gl/imT7IT

  • Pinheiro, J.C., Bornkamp, B., Glimm, E., and Bretz, F. (2014) Model-based dose finding

under model uncertainty using general parametric models. Statistics in Medicine, 33, 1646–1661

EUGM, Nov 2019

slide-74
SLIDE 74

10/28/19 74

Shaping the Future

  • f Drug Development

Phase II/III: Multi-Arm Multi-Stage Designs

East MAMS

MULTI-STAGE MULTI ARM DESIGNS

Functions:

  • 2-stage “Treatment Selection” design using p-value combination approach by Posch et. al.

(Statistics in Medicine, 2005)

  • MAMS design – Extension of GSD to more than 2 arms

Benefits:

  • Identification of promising therapies and inference on selected treatments performed in

two or more stages

  • Multiple treatments to be compared with a control
  • Allows several primary research questions to be answered in a single trial with increased

efficiency compared to separate trials

New in 6.5:

  • Binomial multi-stage; Survival p-value combination

SUCCESSFUL OUTCOME: Compare Operating Characteristics Of Multi-arm Group Sequential Designs

EUGM, Nov 2019 148

slide-75
SLIDE 75

10/28/19 75

Overview

  • Two-stage Multi-Arm Designs using p-value combination
  • Multi-stage Multi-Arm Designs (MaMs)

149 EUGM, Nov 2019

Two-Stage Design: Motivation

  • Identification of promising therapies and inference on selected treatments

performed in two stages

  • Multiple treatments to be compared with a control
  • After interim analysis in first stage, trial may be terminated or continued with second stage
  • Set of treatments may be reduced due to
  • lack of efficacy
  • presence of safety problems with some of the treatments

Highly flexible procedure with possible re-estimation of the sample size for the second stage

150 EUGM, Nov 2019

slide-76
SLIDE 76

10/28/19 76

Example: Acute Coronary Syndrome

  • Placebo controlled, double blind, randomized trial to evaluate the efficacy, pharmacokinetics,

safety and tolerability of a New Chemical Entity (NCE) given as multiple weekly infusions in subjects with a recent acute coronary syndrome

  • Four dose regimens to be investigated
  • Treatment effect is assessed through change in PAV (percent atheroma volume) from

baseline to Day 36

  • 2-stage adaptive design
  • Stage 1: 250 subjects randomized equally to one of four treatment arms and placebo
  • Stage 2: continue with additional 225 subjects enrolling into 1/2/3 arms or stop due to

toxicity

  • One-sided level 0.025 test

151 EUGM, Nov 2019

Design in East

Setting est parameters

152 EUGM, Nov 2019

slide-77
SLIDE 77

10/28/19 77

Design in East

Specifying Stopping Boundaries

153 EUGM, Nov 2019

Design in East

Specifying response generation

154 EUGM, Nov 2019

slide-78
SLIDE 78

10/28/19 78

Design in East

Treatment selection for stage II

155 EUGM, Nov 2019

Keeping Best Two Treatments In Stage II

156 EUGM, Nov 2019

slide-79
SLIDE 79

10/28/19 79

Keeping All Four Treatments In Stage II

157 EUGM, Nov 2019

Keeping Only Treatments Within Ε = 0.05 In Stage II

158 EUGM, Nov 2019

slide-80
SLIDE 80

10/28/19 80

Maintaining Type I Error

159 EUGM, Nov 2019

Design Under Different Alternative Hypotheses

160 EUGM, Nov 2019

slide-81
SLIDE 81

10/28/19 81

Sample Size Re-Estimation At Interim

161 EUGM, Nov 2019

Multi-Stage Design: Motivation

  • Identification of promising therapies and inference on selected treatments performed in two
  • r more stages
  • Multiple treatments to be compared with a control
  • Allows several primary research questions to be answered in a single trial with increased

efficiency compared to separate trials

  • Various modifications lead to distinct MaMs designs
  • group-sequential MaMs designs
  • drop-the-loser(s) designs

162 EUGM, Nov 2019

slide-82
SLIDE 82

10/28/19 82

MaMs in EAST

  • Generalization of group sequential design to more than two arms
  • An alternative to the combination function approach of Posch et. al. (SiM, 2005)
  • Current implementation:
  • trial stops if any arm crosses efficacy boundary
  • trial stops if all arms cross futility boundary
  • drop the losers at each interim look
  • Under development: dose selection and adaptive SSR

163 EUGM, Nov 2019

K-look GSD

  • Only 1 comparison to a control, made K times

K-look MAMS:

  • D comparisons to common control, made K times
  • Generalization of Dunnett’s test

Mathematical Framework

[ ]

i 1 K j1 jD j i1 iD i i 1 j 1

P max{W ...W } e and max{W ...W } e )

  • =

=

é ù ( < ³ = a ë û

å !

[ ]

i 1 K j j i i i 1 j 1

P W e and W e )

  • =

=

é ù ( < ³ = a ë û

å !

slide-83
SLIDE 83

10/28/19 83

Boundary Computations

165

Two Arm Trial:

  • Wj, j=1,2,3, are scalars. Trial stops if

W1≥e1 or W2≥e2 or W3≥e3

  • We want P0(W1≥e1 or W2≥e2 or W3≥e3)=α
  • Computations are simplified because Wj and (Wj-Wj-1) are independent

EUGM, Nov 2019

Boundary Computations

166

Multi-Arm Trial:

  • Wj=(Wj1,Wj2,...WjD) are vectors. Trial stops if

max(W11,W12,...W1D)≥e1 or max(W21,W22,...W2D)≥e2 or max(W31,W32,...W3D)≥e3

  • Want P0{max(W11,W12,...W1D)≥e1 or max(W21,W22,...W2D)≥e2 or

max(W31,W32,...W3D)≥e3}= α

  • Computations are complex because the components (Wj1,Wj2,...WjD) are

correlated whereas Wj and (Wj-Wj-1) are independent

EUGM, Nov 2019

slide-84
SLIDE 84

10/28/19 84

Once daily bronchodilators for COPD (Am. J. Respiratory & Critical Care, vol 182, 2010) Compare three doses (150 mg, 300 mg, 500 mg) of Indacaterol to Placebo Endpoint: Week 12 change from baseline in 24 hour trough FEV1 Expect differences from placebo of between 0.14 and 0.18 liters with standard deviation s=0.5 Design for 90% power at one-sided a=0.025

Inhance Trial: Chronic Obstructive Pulmonary Disease

Requires 165 patients/arm for d=0.18, s=0.5 Expected sample size under H1 is 132/arm

Two-Arm, Three-look GSD

slide-85
SLIDE 85

10/28/19 85

Four-Arm, Three-look MAMS

Requires 130 patients/arm for d=0.18, s=0. Expected sample size under H1 is 105/arm

EUGM, Nov 2019 169

Compare the 2 Arm & 4-Arm Boundaries

2-arm Boundaries On Z-scale 4-arm boundaries on Z-scale

EUGM, Nov 2019 170

slide-86
SLIDE 86

10/28/19 86

Table of Boundary Comparisons

Look Info Fraction Two Arm Four Arm 1 0.333 3.704 3.976 2 0.667 2.514 2.856 3 1.0 1.992 2.391

Plot of Boundary Comparisons

Higher Hurdle With 4-arm Trial

171 EUGM, Nov 2019

The boundaries for the 4-arm trial are higher than for the 2 arm trial This compensates for the greater probability of boundary crossing under H0 But the sample size/arm is lower for 4-arm trial. (More chances to exit under H1) What would happen if the value of d was not the same for each treatment

Boundary and Sample Size Comparison

slide-87
SLIDE 87

10/28/19 87

4-arm Design With Different D Values

  • Same boundaries, but requires commitment of 169/arm
  • Expected sample size under H1 is 136/arm
  • Here 4-arm design requires more patients/arm than 2-arm design
  • The higher efficacy boundary hurdle is not offset by extra opportunities to

cross the efficacy boundary because only one dose has a strong effect

EUGM, Nov 2019 173

Introduce A Futility Boundary For 676 Patient Trial With D=(0.18, .14, .14)

174 EUGM, Nov 2019

slide-88
SLIDE 88

10/28/19 88

Impact of futility boundary; 2% power drop

175

  • Power dropped to 88% due to introduction of a futility boundary
  • The efficacy boundary is unchanged since futility boundary is non-binding
  • Trial stops for futility only if ALL the arms cross the futility boundary
  • But individual arms that cross the futility boundary will be dropped

EUGM, Nov 2019

Simulate Trial For Additional Insight

EUGM, Nov 2019 176

slide-89
SLIDE 89

10/28/19 89

More Simulation Details

177 EUGM, Nov 2019

Marginal & Detailed Outcome Tables

178 EUGM, Nov 2019

slide-90
SLIDE 90

10/28/19 90

What If Two Treatments Were Ineffective

179 EUGM, Nov 2019

More Simulation Details

180 EUGM, Nov 2019

slide-91
SLIDE 91

10/28/19 91

Marginal & Detailed Outcome Tables

181 EUGM, Nov 2019

Simulation under the global null

182 EUGM, Nov 2019

slide-92
SLIDE 92

10/28/19 92

Shaping the Future

  • f Drug Development

Phase III: Group Sequential Designs -- Survival

East SEQUENTIAL

Functions:

  • Extensive selection families of stopping rules for efficacy and futility
  • Display boundaries on multiple scales.
  • Optimize trial design for savings in sample size, study duration, & cost
  • Conditional and Predictive Power calculations for interim decisions

New in 6.5:

  • Equivalence Group Sequential Designs

Pre-Requisites: East BASE

SUCCESSFUL OUTCOME: Compare operating characteristics of group sequential designs

EUGM, Nov 2019 184

slide-93
SLIDE 93

10/28/19 93

East SURVIVAL

Test Survival Endpoints In Superiority & Non-inferiority Studies

Functions:

  • Variable & fixed subject follow-up
  • Piecewise hazard rates, accruals, & dropouts
  • Charts for predicting events/sample size, accrual & study duration
  • Simulate non-proportional hazards

New in 6.5:

  • Go-No-Go Based on Surrogate Endpoints

Pre-Requisites: East SEQUENTIAL

SUCCESSFUL OUTCOME:

Compute Events, Sample Size, Study Duration, For Complex Survival Designs

EUGM, Nov 2019 185

Survival Studies

For studies with survival or time-to-event endpoints, the asymptotic distribution theory and the derivation of stopping boundaries remains the same There are however some special considerations

  • The information is directly proportional to the number of events
  • Thus the number of events, not the number of patients, determines the power of the study
  • If study duration is not fixed, there is a trade-off between sample size and study duration.

186 EUGM, Nov 2019

slide-94
SLIDE 94

10/28/19 94

Survival Studies

If we recruit more patients to the study, we obtain the required number of events sooner and the total study duration is reduced

187 EUGM, Nov 2019

Example: The JUPITER Study

  • Justification for the Use of Statins in Prevention: an Intervention Trial Evaluating

Rosuvastatin (JUPITER) examined the question of whether treatment with 20 mg

  • f rosuvastatin daily, as compared with placebo, would reduce the rate of first

major cardiovascular events (Ridker et al., 2008)

188 EUGM, Nov 2019

slide-95
SLIDE 95

10/28/19 95

Example: The JUPITER study (cont.)

  • JUPITER was a randomized, double-blind, placebo-controlled, multicenter trial

conducted between 2003 and 2008 by AstraZeneca at 1315 sites in 26 countries

  • Composite primary endpoint: occurrence of a first major cardiovascular event,

defined as nonfatal myocardial infarction, nonfatal stroke, hospitalization for unstable angina, an arterial revascularization procedure, or confirmed death from cardiovascular causes

189 EUGM, Nov 2019

Example: The JUPITER study (cont.)

  • Designed for statistical power of 90% to detect a 25% reduction in the

rate of the primary end point, with a two-sided significance level of 0.05

  • We are interested in a 25% reduction in the rate of the primary endpoint,

i.e. a hazard ratio λt= λc = 0.75; the effect size is thus δ = -ln(λt /λc ) = 0.2877

  • Baseline hazard rate in placebo arm of 0.0077
  • Study to complete in 7.5 years, with 4 years accrual and 3.5 years of

follow-up

  • Two interim analyses are planned with LD(OF) spending function defined

boundaries at 37.5% and 75% of the information

  • How do we design such a trial?

190 EUGM, Nov 2019

slide-96
SLIDE 96

10/28/19 96

JUPITER Study Design in East

  • We can use the Logrank Test Given Accrual Duration and Study Duration design in

East to obtain a 3-look GSD for the JUPITER trial

  • East tells us that the required number of events is Dmax = 517 as previously

determined, and that we will require 14,229 subjects accrued over 4 years

191 EUGM, Nov 2019

Planning for DMC Meetings

  • The Events vs. Time Chart is useful in planning for interim analyses & DMC meetings;

eg., after the second look, 389 events should occur roughly 6.1 years into the study assuming the treatment effect is δ1

192 EUGM, Nov 2019

slide-97
SLIDE 97

10/28/19 97

JUPITER Design Dropouts

  • Dropouts: 5% per year in both the treatment and placebo arms
  • Like events: Dropouts assumed exponential, with corresponding hazard

rates

  • Like Cumulative % survival: Cumulative % dropout calculated for patient

time (from study entry), not calendar time (from study start)

193 EUGM, Nov 2019

JUPITER Design Accruals

  • Non constant accrual (piece-wise linear):

15% by end of year 1; 35% by end of year 2; 65% by end of year 3

194 EUGM, Nov 2019

slide-98
SLIDE 98

10/28/19 98

Revisited JUPITER Design in East

Note that since we have not changed the effect size δ1 that we are powering the trial for, Dmax = 517 has not changed. However, the dropouts and slow starting accrual means we now need 17,344 patients if we want to finish the study within 7.5 years

195 EUGM, Nov 2019

Surrogate Endpoints in Survival Studies

EUGM, Nov 2019 196

slide-99
SLIDE 99

10/28/19 99

Background

  • Most time to event endpoint requires longer follow-up.
  • Any early stopping (futility) decision using primary endpoint only be performed at late time

point trial.

  • Secondary endpoint (PFS, DFS, ORR) can be used to make Go-No Go decision about the trial

without inflating type I error.

EUGM, Nov 2019 197

Example : Malignant Mesothelioma

  • Available treatments are macroscopic complete resection or systemic chemotherapy
  • Median survival time is 1.5 years in average.
  • Recently a newer type of treatment being tested on mesothelioma is gene therapy.
  • Consider a study to evaluate a new gene therapy treatment compared against placebo in

patients with MPM after completing surgery + chemotherapy

EUGM, Nov 2019 198

slide-100
SLIDE 100

10/28/19 100

2 Stage GSD Based on OS

  • Primary Endpoint: Overall survival (OS)
  • 2-stage GSD at 0.025 alpha (1-sided); LD(OF) spending function efficacy; Median OS of 16

months in the placebo group vs a 24 months on the new treatment.

  • 30 months Accrual Duration and final analysis will be taken at month 48.
  • Detecting 90% power required 257 events and sample size 192 on each arm (total 382).

EUGM, Nov 2019 199

Interim Analysis

  • IA at 28 months(129 OS events).
  • 90% of total population will be enrolled by 28 months and therefore stopping for

futility at IA only saves 8% resources.\

  • Stopping for futility earlier will force us to stop without seeing 50% of planned

number of events.

  • Should consider secondary endpoint like PFS, ORR.
  • Each patient will have regular PFS assessment in every 3 months for at least 3 years
  • Median PFS is 8 months in the placebo and 11 months
  • Correlation between OS and PFS is 0.75

EUGM, Nov 2019 200

slide-101
SLIDE 101

10/28/19 101

Event vs Time Chart

EUGM, Nov 2019 201

Simulation Results

EUGM, Nov 2019 202

slide-102
SLIDE 102

10/28/19 102

Shaping the Future

  • f Drug Development

Phase III: Sample Size Re-Estimation East ADAPT / SURVADAPT

Incorporate Unblinded Sample Size Re-estimation Rules

Functions:

  • Adaptive rules for increasing sample size, or other possibilities
  • Methods include CHW, CDL, Müller-Schäfer
  • Specific adaptive tools for survival (eg., adapt sample size and events)
  • Müller-Schäfer Method for Interim Monitoring
  • Unique to East:
  • Promising Zone Design based on unblinded interim data
  • Adjusted unbiased point estimates, confidence Intervals, and p-values

New in 6.5:

  • SSR for Non-Inferiority designs

SUCCESSFUL OUTCOME: Improve statistical power when results are ‘promising’

EUGM, Nov 2019 204

slide-103
SLIDE 103

10/28/19 103

Adaptive SSR

EUGM, Nov 2019 205

We don't know what δ and σ to power the study for

  • Prior experience limited to small pilot studies
  • Improved standard of care dilutes treatment effect
  • Powering for smallest clinically important effect expensive
  • Better safety profile at interim might justify smaller δ
  • Opportunity to combine internal and external data

If only σ is unknown, blinded SSR is recommended by FDA

Motivation for Mid-Course Sample Size Correction in Pivotal Trials

206

slide-104
SLIDE 104

10/28/19 104

Large effects are uncommon, but designing for very small clinically meaningful effects requires huge up- front investments that management will not approve. A strategy of staged investment is more practical

  • Unreliability of Pilot Studies: Most large treatment effects emerge from small studies, and when

additional trials are performed, the effect sizes typically become much smaller. Well-validated large eff ects are uncommon and pertain to nonfatal outcomes. Pereira et. al., JAMA. 2012; 308(16): 1676-1684

  • Milestone-Driven Investment: Sunesis Pharmaceuticals to Implement One-Time Sample Size

Increase to Phase 3 VALOR Trial in AML. DSMB Recommends Increase Following Single, Pre- Planned Interim Efficacy and Safety Analysis of VALOR; DSMB Recommendation Triggers $25.0 Million Investment in Sunesis from Royalty Pharma. Press Release, September 11, 2012. Sunesis Pharma, South San Francisco

Why Not Design For The Smallest Clinically Meaningful Treatment Effect?

207

Two arm, multicenter trial with second line therapy for metastatic non-small cell lung cancer Primary endpoint is overall survival (OS) Median for control arm is 8 months Require 90% power to detect HR = 0.7 (median = 11.4 months on experimental arm) One-sided level 0.025 test with one interim look for early efficacy or futility stopping Design 24 month enrollment and 12 months additional follow-up

Example: Metastatic Lung Cancer

208

slide-105
SLIDE 105

10/28/19 105

Group Sequential Design

  • Uncertainty about

HR=0.7;

  • HR = 0.77 is still

clinically meaningful but requires 760 patients and 618 events.

  • Up-front commitment is

impossible

209

Design optimistically (HR=0.7; 332 events; 416 subjects) One interim analysis after 50% information

  • Stop if overwhelming evidence of efficacy ( $

𝐼𝑆 ≤ 0.63)

  • Stop if overwhelming evidence of futility ( $

𝐼𝑆 > 1.02)

  • Increase number of events and sample size at the interim if interim results fall in a

promising zone Can define promising zone in terms of conditional power, or HR, or Z-statistic Special CP calculator available in East

Adaptive Strategy

210

slide-106
SLIDE 106

10/28/19 106

Partition the interim outcome into three zones based on the estimated conditional power. For example:

  • Unfavorable: 𝐷𝑄 < 35% ; no change in design
  • Promising: 35% ≤ 𝐷𝑄 < 90% ; increase resources
  • Favorable: 𝐷𝑄 ≥ 90%; no change in design

Use simulation to experiment with promising zones Use simulation to experiment with sample size re-estimation rules Use Cui, Hung, Wang (CHW), Chen, DeMets & Lan (CDL) or Mueller and Shaeffer (MS) methods to control type-1 error

The Promising Zone Design

211

Conditional Power Calculator

35% CP corresponds to $ 𝐼𝑆 = 0.83 at the interim analysis 90% CP corresponds to $ 𝐼𝑆 = 0.73 at the interim analysis

212

slide-107
SLIDE 107

10/28/19 107

Schema of Adaptive Design

213

Primary driver of power is number of events FDA guidance recommends increase only, not decrease Increase events by amount needed to achieve some target conditional power, subject to a cap Compute sample size increase necessary to achieve the desired increase in events without undue prolongation of the trial Complex relationship exists between increase in events, increase in sample size and study

  • duration. Best evaluated by simulation

Adaptation Principles

214

slide-108
SLIDE 108

10/28/19 108

Adaptive Simulation Worksheet

215

Operating Characteristics

216

slide-109
SLIDE 109

10/28/19 109

It is believed that true HR is between 0.7 and 0.77 Option 1: Power the trial for HR=0.77 with aggressive early stopping boundaries

  • Large up-front commitment is often an obstacle
  • Aggressive stopping boundaries require spending more alpha at the interim
  • Stopping a trial prematurely with aggressive boundaries is unlikely to alter medical practice
  • Overruns can be problematic

Option 2: Power the trial for HR=0.7 and increase resources in promising zone

  • Requires a lower up-front commitment
  • Additional commitment only called forth if it is needed
  • Compromise design: Better than non-adaptive trial powered at HR=0.7 but not as

powerful (unconditionally) as the non-adaptive design powered at HR=0.77.

Concluding Observations

217

Shaping the Future

  • f Drug Development

Phase III: Adaptive Population Enrichment

slide-110
SLIDE 110

10/28/19 110

Extending SURVAdapt

EUGM, Nov 2019 219

Overview

  • Background
  • Method and Assumptions
  • Software
  • Case study in AS
  • Q&A

EUGM, Nov 2019 220

slide-111
SLIDE 111

10/28/19 111

Background

Population Enrichment (PE): prospective use of any patient characteristic to obtain a study population in which detection is more likely than in unselected population Types of PE:

  • Prognostic: identify high risk patients based on biomarkers
  • Predictive: identify patients more likely to respond

Issues:

  • Cost of clinical trials increasing
  • Discovery of blockbuster drugs on the decline; move away from “one size-fits-all idea”

EUGM, Nov 2019 221

Background (cont)

Importance:

  • Help identify high responsive group, detect treatment effect with smaller sample size
  • Failed molecules from one study, may succeed in a different group

Examples:

  • BMS immunotherapy Opdivo failed in lung cancer study whereas Merck competitor

Keytruda succeeded: In later case study population was enriched by including only subjects with high level of PD-L1

  • ISPY2 identifies different combination therapies across different biomarker subgroups

EUGM, Nov 2019 222

slide-112
SLIDE 112

10/28/19 112

Method & Assumptions

Based on Predictive Enrichment Study population: divided in two groups based on a predefined biomarker Study will materialize into two independent cohorts

  • First cohort recruits from full population
  • Second cohort recruitment depends on an interim analysis based on the first cohort data only

At interim:

  • Continue with full population
  • Continue with sub-population
  • Stop the trial for futility

Subpopulation prevalence will be user-specified

EUGM, Nov 2019 223

Angiosarcoma is an ultraorphan disease Poorly addressed by current treatments

  • Pazopanib a VEGF inhibitor shows modest benefit
  • TRC105 can compliment Pazopanib by inhibiting endoglin, a different angiogenic target

Adaptive trial considered optimal due to:

  • Small population (1800 cases/year)
  • Limited prior data
  • Subgroup interaction. Greater benefit possible with TRC105 for cutaneous vs visceral tumors

Case study: The TAPPAS trial

EUGM, Nov 2019 224

slide-113
SLIDE 113

10/28/19 113

The TAPPAS Trial: Objective

Demonstrate superior PFS of the combination of TRC105 + pazopanib vs single agent pazopanib

EUGM, Nov 2019 225

Advanced cutaneous and non-cutaneous AS not amenable to curative intent surgery Measurable disease by RECIST 1.1 No prior treatment with a VEGF inhibitor 0,1 or 2 prior lines of therapy ECOG ≤ 1

Eligibility

EUGM, Nov 2019 226

slide-114
SLIDE 114

10/28/19 114

p1: p-value for data from cohort 1 p2: p-value for data from cohort 2

2-Stage Design with SSR & Enrichment

ALL COMERS Interim Analysis

Favorable: Continue As Planned Promising: Increase Sample Size Unfavorable Continue As Planned Enrich With Cutaneous Subgroup TRC105 +Pazopanib

Pazopanib

Stop For Futility

EUGM, Nov 2019 227

Let 𝐼9

:and 𝐼9 ; denote the null hypotheses for the full population and the

cutaneous group respectively Let 𝐼9

:; = 𝐼9 :⋂𝐼9 ; denote the global null hypothesis

Closed testing principle states that type I error is strongly controlled as long as

  • Each of the hypotheses in the closed family is tested at local level-α
  • 𝐼9

: significant only if both tests for 𝐼9 : and 𝐼9 :;are significant at local level-α

  • 𝐼9

; significant only if both tests for 𝐼9 ; and 𝐼9 :;are significant at local level-α

Preserving type I error

EUGM, Nov 2019 228

slide-115
SLIDE 115

10/28/19 115

In case of no enrichment, declare significance on Full population if

𝑥>Φ@> 𝑞>

:; + 𝑥BΦ@> 𝑞B :;

> 𝑨D 𝑥>Φ@> 𝑞>

: + 𝑥BΦ@> 𝑞B :

> 𝑨D

In case of enrichment, declare significance on cutaneous subgroup if

𝑥>Φ@> 𝑞>

:; + 𝑥BΦ@> 𝑞B ;

> 𝑨D 𝑥>Φ@> 𝑞>

; + 𝑥BΦ@> 𝑞B ;

> 𝑨D

Closed Testing With Combination Of P-values

EUGM, Nov 2019 229

Independence of p1 and p2 essential for valid level-a test Use of auxiliary data in the censored observations of cohort 1 that become completers in cohort 2 is forbidden

  • ORR data, lab values, toxicities of patients censored for PFS in cohort 1 can provide insights

about their eventual PFS result in cohort 2. Type I error could be inflated Use of auxiliary data to decide on enrichment destroys independence of p1 and p2 (Jenkins, Stone and Jennison, Pharmaceutical Statistics, 2011)

Special Considerations for Event-Driven Trials

EUGM, Nov 2019 230

slide-116
SLIDE 116

10/28/19 116

Do not choose the interim analysis time point to split data into cohort 1 and cohort 2 Instead, pre-specify allocation of events and sample size to each cohort before taking the interim analysis

  • 70 patients and 60 events for cohort 1
  • 54 patients and 35 events for cohort 2
  • Interim analysis after 40 events have arrived

That will permit full inspection of all cohort 1 data at interim analysis, including censored data

How to permit use of auxiliary data

EUGM, Nov 2019 231 EUGM, Nov 2019 232

slide-117
SLIDE 117

10/28/19 117

Unfavorable Enrichment Cohort 1 Promising Favorable

EUGM, Nov 2019 233

Control arm (Pazopanib): median time to progression = 4 months HR expected to be 0.55 Accruals: 8 pts/month Power: 83%

The TAPPAS trial: non-adaptive case

EUGM, Nov 2019 234

slide-118
SLIDE 118

10/28/19 118

The TAPPAS trial: Design parameters

EUGM, Nov 2019 235

The TAPPAS Trial: Adaptation Parameters Tab

(No Adaptation)

No sample size increase in full population Disable Enrichment and Futility stopping

EUGM, Nov 2019 236

slide-119
SLIDE 119

10/28/19 119

The TAPPAS Trial: Adaptation Parameters Tab

(No Adaptation)

EUGM, Nov 2019 237

The TAPPAS Trial: Simulation Results (No Adaptation)

Design has 74% power (compared to 83% for fixed design) An adaptive population enrichment should only be considered if there is a possibility that there is a strong treatment effect in the subpopulation, but NOT its complement

EUGM, Nov 2019 238

slide-120
SLIDE 120

10/28/19 120

The TAPPAS trial: Design Parameters

When 40 events have arrived, cohort 1 will have met its sample size quota (62 pts) while the total sample size from cohort 1 plus cohort 2 will be 98 patients. Data from half of these extra 36 pts assigned to cohort 2 will NOT be used if there is enrichment Should increase n and events for cohort 1

EUGM, Nov 2019 239

The TAPPAS Trial: Design Parameters

16 pts have been assigned to cohort 2 at the time of the interim analysis If interim results fall in the enrichment zone, data from at most 8 cohort 2 patients will be useable

EUGM, Nov 2019 240

slide-121
SLIDE 121

10/28/19 121

The TAPPAS trial: Adaptation Parameters

EUGM, Nov 2019 241

The TAPPAS trial: Simulation results (with adaptation)

Design has 64% overall power Power conditional on falling in Enrichment zone is 75% Power conditional on falling in promising zone is 81% Power conditional on falling in favourable zone is 83%

EUGM, Nov 2019 242

slide-122
SLIDE 122

10/28/19 122

Simulation Outputs

HR(C,NC) Zone P(Zone) Power E(Dur) E(N) 0.55,0.55 Unfavorable 15 45 21 124 Enrich 5 81 35 163 Promising 38 93 30 198 Favorable 42 92 22 124 Total 100 80(97) 25(25) 149(200) 0.55,0.70 Unfavorable 20 32 21 124 Enrich 11 78 34 160 Promising 41 83 30 198 Favorable 28 87 21 124 Total 100 70(86) 25(25) 151(200) 0.55,0.85 Unfavorable 25 23 20 124 Enrich 19 76 33 158 Promising 37 79 30 198 Favorable 19 83 20 124 Total 100 62(66) 25(25) 150(200) 0.55,1.00 Unfavorable 28 18 20 124 Enrich 27 75 32 156 Promising 33 76 29 198 Favorable 12 79 20 124 Total 100 57(43) 25(25) 148(200)

EUGM, Nov 2019 243

Compared to fix design of n=200, adaptive design provides

  • Greater power
  • Smaller sample size
  • Shorter duration

Adaptive design maintains 80% power in favorable, promising and enrichment zones for HR=0.55 (cutaneous) even with larger HRs in the non-cutaneous subgroup

Results

EUGM, Nov 2019 244

slide-123
SLIDE 123

10/28/19 123

Shaping the Future

  • f Drug Development

Q&A Session

Shaping the Future

  • f Drug Development

Backup Slides

slide-124
SLIDE 124

10/28/19 124

EUGM, Nov 2019 247