Kwangmin Kim - FDA Adaptive Designs

출처

이 글은 사전지식 기반 (교재 미확인 — agent 사전학습 기반). 핵심 인용 — FDA (2019), Lan & DeMets (1983), Lehmacher & Wassmer (1999), FDA Project Optimus (2021).

이 글은 J-ADAPT 시리즈의 마지막 글. FDA Adaptive Designs Guidance (2019) 의 정통 — pre-specification + type I + integrity — 을 다룬다.

1 진입 직관 — “유연성과 통제의 균형”

Adaptive trial 의 위험: post-hoc data dredging — 데이터 본 후 protocol 변경. spurious 효과 발생.

FDA 의 입장: adaptive design 허용 하지만 strict pre-specification 필수. 유연성 ≠ 자유.

3 대 원칙:

Pre-specification: 모든 adaptation rule 은 protocol 에 명시

Type I error control: 다중 비교 보정

Trial integrity: blinding 유지, interim 결과 leak 방지

2 FDA Adaptive Designs Guidance (2019)

2.1 의의

FDA 의 공식 guideline. 이전의 ad-hoc, 논쟁 적 사용 → 명확한 path 제공.

2.2 인정되는 4 카테고리

Group sequential design (GSD)
Sample size re-estimation (SSR)
Response-adaptive randomization (RAR)
Adaptive enrichment (AE)

2.3 각 카테고리의 FDA 권고

GSD: 표준화 — Pocock, O’Brien-Fleming boundary. Lan-DeMets alpha spending 권장.

SSR: Blinded SSR 선호 (interim 결과 보지 않고 variance 만으로 결정). Unblinded SSR 은 Bauer-Köhne combination test 필요.

RAR: 통계 효율 손실 주의 (Korn-Freidlin). 큰 효과 size + 심각 outcome 에서 정당화.

AE: Subgroup pre-specified. Population shrinkage 의 통계 보정.

3 Pre-Specification — 절대 필수

3.1 명시 항목

Protocol 에 명시되어야 할 항목:

Adaptation 시점 (interim analysis n_1, n_2, …)

Adaptation rule (예: \(P(\text{efficacy}) > 0.95\) 시 stop)

Boundary (alpha spending, futility)

Subgroup 정의 (AE)

Sample size cap (SSR)

3.2 Post-Hoc 변경 금지

Trial 진행 중 에 protocol 변경 시 trial integrity 손상. FDA 는 pre-specified 외 변경 거부.

3.3 예외: Pandemic, 안전 이슈

DSMB 의 권고 로 안전 이슈 대응 변경 가능. Documented + transparent.

4 Type I Error Control — Alpha Spending

4.1 동기

Multiple interim analysis 시 각 interim 에서 type I 누적. Family-wise error rate (FWER) 보정 필요.

4.2 Lan-DeMets (1983) Alpha Spending

Pre-specified alpha spending function \(\alpha^*(t)\) — fraction of information used at time \(t\).

매 interim 에서 cumulative alpha spent 계산:

\[ \alpha_k = \alpha^*(t_k) - \alpha^*(t_{k-1}) \]

각 interim 의 boundary — 해당 alpha 의 critical value.

4.3 표준 spending function

O’Brien-Fleming (OBF): \(\alpha^*(t) = 2(1 - \Phi(z_{\alpha/2} / \sqrt{t}))\) — early conservative, later liberal
Pocock: \(\alpha^*(t) = \alpha \cdot \log(1 + (e - 1) t)\) — equal alpha at each interim
Power: \(\alpha^*(t) = \alpha \cdot t^c\) — flexible

4.4 FDA 권장

OBF 가 표준 — early stopping 보수적, late stopping 표준 critical value 가까움.

5 Trial Integrity

5.1 1: Blinding 유지

Interim 결과를 blinded SSR 에서 우선. Unblinded interim 은 DSMB only — investigator, sponsor 차단.

5.2 2: Interim 결과 Leak 방지

Firewalls: DSMB 와 investigator 분리. Statistical Analysis Plan (SAP) 의 strict 준수.

5.3 3: DSMB 의 역할

독립 DSMB 가 unblinded interim 검토. Adaptation 권고 또는 protocol 준수 확인.

5.4 4: Communication

Sponsor → IRB → site 의 adaptation 결과 전달. Patient 등록 영향.

6 Estimator Bias 처리

6.1 동기

Adaptive 후 naive estimator biased — interim 의 효과 큰 arm 이 over-represented.

6.2 1: Conditional Maximum Likelihood (Whitehead 1997)

Interim stopping 의 conditional likelihood 로 보정.

6.3 2: Median Unbiased Estimator (Emerson 1988)

Unbiased point estimator 의 alternative — median unbiased.

6.4 3: Repeated Confidence Interval (Jennison-Turnbull 2000)

Sequential CI — interim stopping 후의 valid CI.

6.5 4: Inverse Probability Weighting (RAR)

RAR 의 adjusted estimator — IPW 로 sample selection bias 보정.

7 FDA Critical Path Initiative

7.1 동기

2004 년 FDA 의 drug development modernization initiative. Adaptive 가 핵심 도구.

7.2 영향

Phase I-II 의 biomarker-driven adaptive 표준화. I-SPY 2 등 platform trial 권장.

8 FDA Project Optimus (2021)

8.1 동기

항암제 dose finding 의 modernization. 전통 MTD (Maximum Tolerated Dose) 가 항상 optimal 아님.

8.2 Adaptive Approach

Multiple dose 동시 평가 + biomarker-driven. 각 dose 의 efficacy + safety 의 종합 optimal.

8.3 의의

FDA 의 adaptive 의 적극 도입 사례. Phase I 의 confirm Phase II target 결합.

9 Lehmacher-Wassmer (1999) — Combination Test

9.1 의의

Adaptive design 의 type I 통제 표준 도구.

9.2 메커니즘

각 stage 의 p-value 를 pre-specified function 으로 결합. Stage 별 p-value 의 weight 가 변경 안 됨 (pre-specified).

9.3 Combination

Inverse normal: \(z = \sum w_k z_k\) — weighted sum of stage z-scores.

Fisher: \(\chi^2 = -2 \sum \log p_k\).

9.4 응용

Bauer-Köhne 의 adaptive sample size 등의 inference 에 사용.

10 시뮬레이션 — Alpha Spending (OBF)

import numpy as np
from scipy.stats import norm

np.random.seed(42)

# Trial setup
n_total = 1000
n_interim = 4   # 4 interim + final
information_fractions = np.array([0.25, 0.50, 0.75, 1.00])
alpha = 0.05

# OBF spending function
def obf_spending(t, alpha=0.05):
    """Lan-DeMets OBF spending function."""
    if t <= 0:
        return 0
    if t >= 1:
        return alpha
    z_a2 = norm.ppf(1 - alpha / 2)
    return 2 * (1 - norm.cdf(z_a2 / np.sqrt(t)))

# Compute boundaries
boundaries = []
prev_alpha = 0
for t in information_fractions:
    cumul_alpha = obf_spending(t, alpha)
    incremental = cumul_alpha - prev_alpha
    # Approximate boundary z (simple — full computation requires recursion)
    # For OBF: z_k = z_critical / sqrt(t)
    z_critical = norm.ppf(1 - alpha / 2)   # 1.96 for 0.05
    z_boundary = z_critical / np.sqrt(t)   # OBF approximation
    boundaries.append(z_boundary)
    prev_alpha = cumul_alpha

print(f"[Alpha Spending (OBF) 시뮬레이션]\n")
print(f"Total alpha: {alpha}")
print(f"Information fractions: {information_fractions}\n")

print(f"Cumulative alpha + boundaries:")
for k, t in enumerate(information_fractions):
    cumul = obf_spending(t, alpha)
    z = boundaries[k]
    print(f"  Interim {k+1} (t={t:.2f}): cumul α = {cumul:.4f}, boundary |Z| > {z:.3f}")

print(f"\n[해석]")
print(f"  OBF: early interim 은 *strict* (큰 효과 발견 시만 stop), late 은 표준 1.96")
print(f"  → 조기 중단 conservative, 통계 power 유지")

# Simulate trial under H1
print(f"\n[H1 (true effect = 0.5 SD) 시뮬레이션]")
true_diff = 0.5
n_per_arm = n_total // 2

control = np.random.normal(0, 1, n_per_arm)
treat = np.random.normal(true_diff, 1, n_per_arm)

# Patient enrollment is sequential — interim n's
interim_n_per_arm = [int(t * n_per_arm) for t in information_fractions]

stopped = False
for k, n_k in enumerate(interim_n_per_arm):
    diff = treat[:n_k].mean() - control[:n_k].mean()
    se = np.sqrt(2 / n_k)
    z = diff / se
    print(f"  Interim {k+1} (n_per_arm = {n_k}): diff = {diff:.3f}, Z = {z:.2f}, boundary = ±{boundaries[k]:.3f}")
    if abs(z) > boundaries[k]:
        print(f"    → STOP — efficacy 발견 (early stopping)")
        stopped = True
        break

if not stopped:
    print(f"\nFinal analysis 통과 — 통계 유의 또는 H0 reject")

# Type I error simulation (under H0)
print(f"\n[Type I error 시뮬레이션 (H0: true diff = 0)]")
n_sims = 1000
n_rejected = 0
for sim in range(n_sims):
    control_sim = np.random.normal(0, 1, n_per_arm)
    treat_sim = np.random.normal(0, 1, n_per_arm)

    rejected = False
    for k, n_k in enumerate(interim_n_per_arm):
        diff = treat_sim[:n_k].mean() - control_sim[:n_k].mean()
        se = np.sqrt(2 / n_k)
        z = diff / se
        if abs(z) > boundaries[k]:
            rejected = True
            break
    if rejected:
        n_rejected += 1

empirical_alpha = n_rejected / n_sims
print(f"  Nominal α: {alpha}")
print(f"  Empirical type I error: {empirical_alpha:.3f}")
print(f"  → OBF boundary 가 type I 잘 통제 (≈ 0.05)")

11 결론

FDA Adaptive Designs Guidance (2019) 가 정통. Pre-specification + Type I control (alpha spending) + Trial integrity (blinding, DSMB) 의 3 대 원칙. Estimator bias 처리 표준 도구. Project Optimus 등 modern application 가속. Adaptive 의 유연성 + 통제 의 균형.

핵심 메시지:

FDA 2019 guidance: 4 카테고리 인정
Pre-specification: protocol 명시, post-hoc 금지
Lan-DeMets (1983): alpha spending — OBF, Pocock, Power
Trial integrity: blinding, DSMB, firewalls
Estimator bias: conditional MLE, median unbiased, IPW
FDA Critical Path / Project Optimus: modernization
Lehmacher-Wassmer 1999: combination test

J-ADAPT 시리즈 종합:

J-ADAPT-0: Overview (4 카테고리 + COVID-19)
J-ADAPT-1: RAR + Play-the-Winner
J-ADAPT-2: Bayesian + Platform Trials
J-ADAPT-3: FDA Adaptive Designs — 이 글

Phase J (Advanced Applications) 종합 (28 편):

HTE — HER Ch.4·5 + ML HTE (12 편)
Causal Inference — DiD (4 편) + RDD (4 편)
AB Testing — Switchback/Geo/Spillover (4 편)
Clinical Trial — Adaptive (4 편)

12 관련 주제

선행 지식

J-ADAPT 시리즈 이전 글들
(Phase F) Sequential testing in A/B
(Phase C) RCT 무작위·protocol

다음 단계

Phase J 완결. 후속 — Phase F (Trustworthy A/B) 의 sequential 응용 또는 응용 프로젝트로.

13 참고문헌

FDA (2019). Adaptive Designs for Clinical Trials of Drugs and Biologics: Guidance for Industry. Center for Drug Evaluation and Research.
Lan, K. K. G. & DeMets, D. L. (1983). Discrete sequential boundaries for clinical trials. Biometrika 70, 659-663.
Lehmacher, W. & Wassmer, G. (1999). Adaptive sample size calculations in group sequential trials. Biometrics 55, 1286-1290.
Bauer, P. & Köhne, K. (1994). Evaluation of experiments with adaptive interim analyses. Biometrics 50, 1029-1041.
Whitehead, J. (1997). The Design and Analysis of Sequential Clinical Trials (2nd ed.). Wiley.
Emerson, S. S. (1988). Parameter estimation following group sequential hypothesis testing. Biometrika 75, 731-739.
Jennison, C. & Turnbull, B. W. (2000). Group Sequential Methods with Applications to Clinical Trials. Chapman & Hall/CRC.
FDA (2023). Optimizing the Dosage of Human Prescription Drugs and Biological Products for the Treatment of Oncologic Diseases (Project Optimus). Draft Guidance.