|
1
|
|
|
2
|
- Ross Prentice, PhD
- Surrogate Endpoint Definition and Application
- Stuart Baker, ScD
- Recent Approaches to Surrogate Endpoint Validation
- David Ransohoff, MD
- New Complexity: The “Omics” Revolution
- Daniel Hayes, MD
- Methods of Biomarker Validation
|
|
3
|
- Surrogate outcome definition
- Conceptual framework for associations of treatment, surrogate, and true
endpoint
- Proposed meta-analysis approach of borrowing information in prior
studies of similar treatments in similar populations
|
|
4
|
- Process of validating markers or endpoints
- Hypothesis testing framework
- Estimation framework
- Recommended meta-analysis estimation approach to validate surrogate
endpoint
- Real examples?
- Has this method been validated empirically?
- Other approaches? Bayesian method?
|
|
5
|
- Promises and disappointments of cancer markers
- Rules of evidence not well developed
- Current overly optimistic interpretation of “omics” data
- Bias as threat to validity
|
|
6
|
- Many proposed tumor markers
- Most inadequately validated
|
|
7
|
- Current concepts
- How best to evaluate existing evidence?
- How to design better future studies?
|
|
8
|
- Evidence is seldom single sourced (basic science, animal, human
observations, human experiments)
- Observational studies vs RCTs
- Surrogates vs clinical outcomes
- Mega-trials vs (meta-analyses) small trials
- Large RCTs vs large RCTs
- Methodological quality of the studies
- Publication bias
|
|
9
|
- BMJ 1998; Oxman et al.
- NEJM 2000; Concato et al.
- NEJM 2000; Benson et al.
- JAMA 2001; Ioannidis et al.
|
|
10
|
- A total of 45 topics were considered.
- They were identified from comprehensive searches of MEDLINE, The
Cochrane Library, previous relevant publications and personal archives
– c. 3,000 meta-analyses were
screened.
- The 45 topics included 408 primary studies with available binary data
(240 RCTs and 168 NROS)
- NROS included 71 prospective studies, 40 retrospective cohort studies,
25 case-control studies, 29 studies with historical controls, and 3
studies with unclear designs
|
|
11
|
|
|
12
|
|
|
13
|
|
|
14
|
- Statistically significant heterogeneity between randomized trials was
seen in 9 of 39 topics with at least 2 RCTs included
- Statistically significant heterogeneity between the non-randomized
studies was seen in 13 of 32 topics with at least 2 NROS included
- The estimated between-study heterogeneity tended to be smaller among
RCTs than among NROS (p=0.032)
|
|
15
|
- In 25 of 45 cases, the non-randomized studies showed a larger treatment
effect for the experimental treatment than the randomized trials. The opposite occurred in 14 cases, but
it was a data artifact in 3 of them.
In 6 topics there was either no clear-cut experimental arm or the
effects were similar (p=0.009).
|
|
16
|
- Discrepancies beyond chance were observed in 12 of 45 cases by fixed
effects and in 7 of 45 cases by random effects
- In these discrepancies, almost always the treatment effect was more
favorable in NROS
- When limiting analyses to prospective studies, there were disagreements
in 2 of 26 topics (8%)
|
|
17
|
- Treatment effects in RCTs and observational studies on the same topic
tend to be highly correlated
- Nevertheless, discrepancies do occur in about 1 out of 6 cases, even
when between-study heterogeneity is accounted for
- Typically, discrepant pairs tend to show more favorable results in
observational studies
- Discrepancies in the absolute magnitude of effect (=“how much it works”)
are very common
|
|
18
|
- Observational studies exhibit larger variability in their treatment
effects than RCTs
- Discrepancies are more common when retrospective observational designs
are considered
- Both RCTs and NROS must be carefully scrutinized for sources of genuine
heterogeneity and bias
- RCTs and NROS should not be seen as mutually exclusive domains of
research
|
|
19
|
- Villar et al. Lancet 1995
- Cappelleri et al. JAMA 1996
- LeLorier et al. NEJM 1997
|
|
20
|
- Definition of large (arbitrary, power)
- Source of meta-analyses (why done?)
- Source of large trials
- Types of outcomes ( 1o,
2o )
- Meta-analysis statistics (FEM, REM)
- Definition of agreement (p-value, corr.)
- Reasons for disagreement
|
|
21
|
|
|
22
|
- Cappelleri JC, Ioannidis JPA,
deFerranti SD, Schmid CH, Aubert M, Chalmers TC, Lau J. Large trials versus meta-analyses of
smaller trials: How do their results compare? JAMA 1996; 276:1332-38.
|
|
23
|
|
|
24
|
- By random effect calculations, agreements found between large and
smaller trials in:
- 90% selected by sample size approach (1,000); 82% by statistical power
approach
- Twice as many disagreements appeared when the variability among large
studies and the variability among smaller studies was not considered
(fixed effects calculations).
|
|
25
|
|
|
26
|
|
|
27
|
- Of 15 disagreements between results of large and smaller trials using
the random effects model, plausible explanations were identified in 10
meta-analyses:
- 5 with differences in the
control rate between large
- and smaller trials
- 4 with specific protocol or study differences
- 1 with potential publication bias
- 2 other disagreements were not clinically important
- tentative reasons could be identified for 2 of the remaining 3
disagreements
|
|
28
|
- Meta-analyses of smaller studies are generally comparable with results
from large studies.
- Differences can be attributed to insufficient sample sizes, control
rates, or protocols.
- These reasons are not mutually exclusive.
- Publication bias is a possibility but has never been proven to be a
factor.
- Need to explore reasons for heterogeneity.
|
|
29
|
|
|
30
|
|
|
31
|
- “megatrial” defined as >1,000 patients
- 289 pairs identified in Cochrane Library
- 79/289 (27%) pairs were statistically significantly different from each
other
- 133 comparisons in LeLorier article
- 36/133 (27%)were statistically significantly different
|
|
32
|
- Agreement among megatrials was approximately as large as that reported
between meta-analyses and megatrials
- If we were to base the recommendation for the treatment in question on
the primary outcome, 53% (Cochrane set) and 31% (LeLorier set) of the
treatment recommendation by a megatrial was not confirmed by a later
megatrial.
- On the other hand, 30% to 47% of the treatments once found ineffective
or harmful in a megatrial were shown to be beneficial by a later
megatrial.
|
|
33
|
- Heterogeneity of treatment effects is common among clinical trials,
whether they are large or small; RCTs or observational studies
- Meta-analysis of small trials (dis)agree with large trials approximately
as often dis(agreement) among large trials themselves
- We need to understand the cause of heterogeneity in clinical trials and
learn how to handle them in meta-analysis
|
|
34
|
- Gotszche and Olsen. Lancet 2000;355:129
- A 1999 study found no decrease in breast cancer mortality in Sweden,
where screening has been recommended since 1985
- Reviewed methodological quality of mammography trials and repeated a
meta-analysis
|
|
35
|
- 8 trials identified
- Baseline imbalances were found in 6 of 8 trials
- 2 adequately randomized trials found no effect of screening on on breast
cancer mortality
- pooled risk ratio 1.04 (95% CI 0.84 - 1.27)
- 6 inadequately randomized trials found significant effect
- Pooled risk ratio 0.75 (95% CI
0.67 – 0.83)
|
|
36
|
|
|
37
|
|
|
38
|
- Based on Randomization adequacy
- Based on minor differences in mean age
- Failed to consider other explanations for difference in mean ages
- Failed to consider other measures of quality
|
|
39
|
|
|
40
|
|
|
41
|
|
|
42
|
|
|
43
|
|
|
44
|
|
|
45
|
|
|
46
|
|
|
47
|
|