|
1
|
- David F. Ransohoff, MD
Departments of Medicine and Epidemiology
University of North Carolina - Chapel Hill
|
|
2
|
|
|
3
|
|
|
4
|
|
|
5
|
- 1. Lessons from history
• evaluating diagnostic markers (‘rules of evidence’)
- 2. ‘Threats to validity’ from
• chance
• bias
- 3. Problems in current research
• genomics - RNA arrays/BrCa prognosis
• proteomics - serum/ovarian cancer screening
- 4. Lessons for future
• exploring ‘-omics’ fields
|
|
6
|
- 1. Non-invasive markers: Holy Grail of cancer diagnosis
- CEA, CA19-9, CA125
- MRI of blood
- 2. CEA
- initial report (PNAS): ~100% sensitivity, specificity for
CRC
- high expectations
- disappointment when expensive ACS/CCS study did not
reproduce initial
results
- Disappointment would have been
predicted and avoided
if ‘rules of evidence’ were available.
|
|
7
|
- 3. Lessons from CEA, for field of clinical epi/HSR
- led to methodology to
evaluate diagnostic tests
-- rules of evidence
-- concepts of bias, validation, ‘spectrum’
(Ransohoff and Feinstein. NEJM 1978)
Methodology is still
underdeveloped in 2005.
- 4 Past problems due to ‘culture clash’
- fields of laboratory medicine, clinical epidemiology have
different ways of thinking, methods, rules of evidence,
- ‘Culture clash’
continues to be problem in 2005.
|
|
8
|
- Knowledge of molecular biology provides targets to measure
- past: knew little about
what to target
- now: know DNA ‘path’
from normal.. adenoma.. CRC
- Assays to measure targets
- past: ‘one dimensional’ assays, like CEA, FOBT, PSA
- now: multi-dimensional assays (measure almost any target)
-DNA - primers and probes;
amplify signal
-protein - mass spectroscopy
|
|
9
|
- But Mother Nature closely guards (her) secrets.
- New reductionist methods mean more data, but not necessarily more
knowledge.
- Rules of evidence: not changed.
- Our job:
- -- to efficiently explore new technologies/fields
-- to avoid predictable mistakes, inflated expectations.
- Exploration must be interdisciplinary, translational:
molecular biology, clinical epidemiology, biostatistics
|
|
10
|
- 1. Definitions of “validation” are diverse, confusing.
- 2. But main concepts
• are simple
• can be usefully applied, right now
|
|
11
|
|
|
12
|
- 1. Chance
Does chance explain ‘discrimination’?
- 2. Bias
Does bias explain ‘discrimination’?
- 3. Generalizeability
Does discrimination occur in clinically useful groups?
- (1) and (2) must be addressed in every study
• or the study is not worth publishing
• and (3) is not worth asking.
|
|
13
|
|
|
14
|
|
|
15
|
|
|
16
|
|
|
17
|
- for clinical practice
- “... gene-expression patterns of primary tumours are better than
available clinicopathological methods for determining the prognosis of
individual patients.6,10,11”
- Ramaswamy and Perou, Lancet
2003;361:1576-7
- for biological research
- “... compelling evidence... genetic program of a cancer cell at
diagnosis defines its biologic behavior many years later, refuting a
competing hypothesis....”
Wooster and Weber, NEJM 2003;348:2339-47
|
|
18
|
- Definition: In multivariable predictive models, overfitting occurs when
a large number of predictor variables is fit to a small N of
subjects. A model may ‘fit’ well
or perfectly by chance, even if no real relationship. Simon, JNCI 2003
- Consequence: results not reproducible in new set of data
- Method to check for: assess reproducibility of model in independent
validation group (done <10% of reports.
Ntzani, Lancet 2003)
|
|
19
|
|
|
20
|
- to the editor:
- “In research to validate a prognostic system, the inclusion of 61
patients [of the 295 in the ‘validation group’] from the original
training group... [means] the validation group is not independent....
[and] the degree of prognostic discrimination may have been
inflated....”
- NEJM 2003;348:1716.
|
|
21
|
- Purpose: “to look at...
utility... of the Amsterdam 70-gene prognostic signature (Van de Vijver
et al, N Engl. J Med, 2002)... external, independent validation of the
signature.”
- Results: “...overall performance
of the 70-gene prognostic signature was inferior... compared to the
original Amsterdam series”
[emphasis added]
|
|
22
|
- RNA expression and prognosis: Does it work?
-It may work, in some tumors (e.g., lymphoma)
-But perhaps not as well as claimed for tumors like BrCa
that are heterogeneous
(discuss Paik NEJM 2004; did avoid chance)
- 2. “Validation”: Is it
necessary?
-Yes; must be appropriately addressed in
every study
- See, in press:
- Bias as a threat to validity of cancer molecular-marker research. Nat Rev Cancer 2005.
Lessons from controversy: ovarian cancer screening and serum
proteomics. JNCI 2005.
|
|
23
|
- This is only one example of chance/ overfitting as ‘threat to validity’
in ‘-omics’ research.
- Are many other examples (Ntzani, Lancet, 2003); but problems will be
easy to avoid, by checking for reproducibility in ‘independent’ group of
subjects.
|
|
24
|
- In contrast, bias is:
• less appreciated (and even less commonly addressed)
• more difficult to handle (as in any observational epi.)
- Bias: systematic difference
between compared groups, causing ‘erroneous’ comparison. (example...)
- Bias is a “plague upon the house of epidemiology.” (Cole)
- Biases are:
• numerous (also no agreed-on nomenclature)
• difficult to avoid
• some are difficult to identify
|
|
25
|
- Proving innocence involves process:
a. in design - to avoid bias
b. in conduct - to measure if it occurred
c. in interpretation - to determine if important
- Included in process also is to report a,b,c in detail in
Methods, Results, and Discussion.
- See Ransohoff DF. Bias as a
threat to validity of cancer
molecular-marker research.
Nat Rev Cancer 2005;5:in press
|
|
26
|
|
|
27
|
- • claims for multiple cancers (ovary, prostate, breast)
-sensitivity: 95-100%
-specificity: 95-100%
- • claims appear in Lancet, JNCI, WSJ, NBC, PBS, Redbook, etc.
- • Correlogic - patent filing for ‘pattern recognition’ process
- • LabCorp - license to use patented process in lab test
|
|
28
|
|
|
29
|
- Purpose
to diagnose ovarian cancer vs no cancer
- Methods
• ovarian cancer, controls
• serum assessed by mass spectroscopy (SELDI-TOF)
• spectra analyzed by ‘genetic algorithm’ (Correlogic)
- Results
‘patterns’ discriminate
|
|
30
|
|
|
31
|
|
|
32
|
- “The discriminatory pattern correctly identified all 50 ovarian cancer
cases in the masked set... This result yielded a sensitivity of 100%…
specificity of 95%…”
|
|
33
|
|
|
34
|
|
|
35
|
- (from news
report of analysis of Keith Baggerly, Nature 04)
- Bias (one example) can occur if ‘signal’ is introduced by
‘run order’ of specimens on mass spectroscopy machine.
- E.g., If cancers and non-cancers
are run on different days and different chips and if the mass spec
machine ‘drifts’ over time, then non-biologic ‘signal,’ associated with
Ca vs no-Ca, is hard-wired into results.
Signal is not removed by ‘splitting’ sample into ‘training’ and
‘validation’.
|
|
36
|
- Bias is not ‘icing on the cake’; it is the cake.
- Bias is a large topic, difficult*:
-multiple biases; require different methods to address
(e.g., randomization, blinding, uniform handling, etc)
-some methods not available in observational research
-some biases may be impossible to identify
-even 1 bias may be fatal
-bias, and process to address, is routinely ignored
by authors, reviewers, editors in ‘omics’ research
- *See Ransohoff, DF. Bias as a threat to validity of
molecular-marker research. Nat Rev Cancer;2005;5:xx-yy
|
|
37
|
|
|
38
|
|
|
39
|
- 1. Lessons from history
• evaluating diagnostic markers (‘rules of evidence’)
- 2. ‘Threats to validity’ from
• chance
• bias
- 3. Problems in current research
• genomics - RNA arrays/BrCa prognosis
• proteomics - serum/ovarian cancer screening
- 4. Lessons for future
• exploring ‘-omics’ fields
|
|
40
|
- 1. An exciting era, because we:
• know so much biology
• have such powerful tools to measure biology
- 2. But rules of evidence have not changed.
- 3. The degree of hope and hype in 2005 is greater than results will
likely support.
- 4. Disappointment may occur that, in retrospect, will have been
predictable, and is due in part to culture clash.
- 5. We must improve scientific process (e.g., handle bias).
- 6. Our job: Go back to #1... and figure out how to avoid predictable
disappointment and wasted effort... and how to generate useful knowledge
about new markers.
|
|
41
|
- 1. Sullivan Pepe M, Etzioni R, Feng Z, Potter JD, Thompson ML,
Thornquist M, et al. Phases of biomarker development for early detection
of cancer. JNCI 2001; 93:1054-61.
- 2. Ransohoff DF. Developing molecular biomarkers for cancer. Science
2003; 299:1679-80.
- 3. Ransohoff DF. Rules of evidence for cancer molecular-marker discovery
and validation. Nat Rev Cancer 2004; 4:309-14.
- 4. Ransohoff DF. Bias as a threat to validity of cancer molecular-marker
research. Nat Rev Cancer 2005; in press.
- 5. Ransohoff DF. Lessons from controversy: ovarian cancer screening and
serum proteomics. JNCI 2005; in press.
|