Part I
Phase I Trials
CONTENTS
LIST OF ABBREVIATIONS
AE adverse event
ADR adverse drug reaction
AT acceptable toxicity
AUC area under the curve
(b)CRM (bivariate) continual reassessment method
CTC Common Toxicity Criteria of the National Cancer Institute
DLT dose limiting toxicity (often CTC grade 3 and higher)
EORTC European Organization on Research for Treatment of Cancer
EWOC escalation with overdose control
GCP good clinical practice
ICH International Conference on Harmonization
LD10 lethal dose 10%
MELD10 minimal effective dose level for 10% death
MFDE modified Fibonacci dose escalation
MTD maximum tolerated dose
NCI U.S. National Cancer Institute
PGDE pharmacokinetically guided dose escalation
RW random walk
SAM stochastic approximation method
(S)TER (strict) traditional escalation rule
TDL toxic dose low
UaD up-and-down design
UAT unacceptable toxicity (often CTC grade 4)
WHO World Health Organization
1.1 INTRODUCTION
The phase I clinical trial constitutes a research methodology for the search and esta-
blishment of new and better treatment of human diseases. It is the first of the three
phases, IIII, that became a gold standard of medical research during the second
half of the 20th century.1 The goal of the phase I trial is to define and to character-
ize the new treatment in humans to set the basis for later investigations of efficacy.
Therefore, the safety and feasibility of the treatment are at the center of interest. A
positive riskbenefit judgment should be expected such that possible harm of the
treatment is outweighed by possible gains in cure, suppression of the disease and its
symptoms, and improved quality of life and survival. The phase I trial should define
a standardized treatment schedule to be safely applied to humans that is worthy of
further investigation for efficacy.
For non-life-threatening diseases, phase I trials are usually conducted on human
volunteers, at least as long as the expected toxicity is mild and can be controlled
without harm. In life-threatening diseases such as cancer, AIDS and so on, phase I
studies are conducted with patients because of the aggressiveness and possible harm-
fulness of cytostatic treatments, possible systemic treatment effects, and the high
interest in the new drugs efficacy in those patients directly. After failure of standard
treatments or in the absence of a curative treatment for seriously chronically ill
patients, the new drug may be the small remaining chance for treatment.
The methodology presented below is restricted to the treatment of cancer
patients. The phase I trial is the first instance where patients are treated experimen-
tally with a new drug. Therefore, it has to be conducted strictly under the regulations
of the Declaration of Helsinki2 to preserve the patients rights in an extreme experi-
mental situation and to render the study ethically acceptable. Section 1.2 provides an
outline of the task including the definition of the maximum tolerated dose (MTD),
which is crucial for the design and analysis of a phase I trial. Basic assumptions
underlying the conduct of the trial and basic definitions for the statistical task are
given. The presentation of phase I designs in Section 1.3 distinguishes between the
determination of the dose levels (action space) and the choice of the dose escalation
scheme (decision options). This constitutes the core of this chapter. Phase I designs
proposed during the past fifteen years are introduced there. The sample size per dose
level is discussed separately. Validations of phase I trials rely mostly on simulation
studies because the designs cannot be compared competitively in practice. Practical
aspects of the conduct of a phase I trial, including the choice of a starting dose, are
presented in Section 1.4. Individual dose adjustment and dose titration studies are
also addressed. Further, basic pharmacokinetic methods are outlined, and regulatory
aspects and guidelines are dealt with. The methodology for phase I trials is far from
being at an optimal level at present, and work on the improvement of design used at
present as well as on the development of designs addressing new medical hypothe-
ses is ongoing. Therefore, Section 1.5 will address practical needs, problems occur-
ring during the conduct of a phase I trial, and future research topics.
1. Establishment of an MTD
2. Determination of qualitative and quantitative toxicity and of the toxicity
profile
3. Characterization of DLT
The primary goal is the determination of a maximum safe dose for a specified mode
of treatment as the basis for phase II trials. Activity against tumors is examined and
assessed, but tumor response is not a primary endpoint.
Inherent in a phase I trial is the ethical issue that anticancer treatment is poten-
tially both harmful and beneficial to a degree that depends on dosage. The dose-
dependency is at that stage of research not known for humans.12 To be on the safe
side, the treatment starts at low doses that are probably not high enough to elicit a
beneficial effect. Even worse, after having passed the clinical drug development pro-
gram, the experimental drug may appear as inefficacious or of harm only. The
dilemma of probably unknowingly underdosing patients in the early stages of a phase
I trial has been of concern and has challenged the search for the best possible method-
ology for the design and conduct of a phase I trial. The goal is to obtain the most
information on toxicity in the shortest possible time with the fewest patients.13
Important clinical issues in phase I trials are patient selection, identification of
factors that determine toxicity, drug schedules, and the determination and assess-
ment of target toxicity.6,11 Important statistical issues are the design parameters
(starting dose, dose levels, and dose escalation) and the estimation of the MTD.
1.2.2 ASSUMPTIONS
Most designs for dose finding in phase I trials assume that there exists a monotone
dosetoxicity relationship and a monotone dose(tumor) response relationship.14
This idealized relationship can be depicted as
Methods considered below apply to adult cancer patients with a confirmed diagnosis
of cancer not amenable to established treatment. Usually excluded are leukemia and
tumors in children.9 Phase I studies in radiotherapy may require further consideration
because of long delayed toxicity. The conduct of a phase I trial requires an excellently
equipped oncological center with high quality means for diagnosis and experimental
treatment, for detection of toxicity, and for fast and adequate reaction in the case of
serious adverse events. Furthermore, easy access to a pharmacological laboratory is
needed for timely pharmacokinetic analyses. These requirements indicate the advisa-
bility of restricting a phase I trial to one center or a very few centers.
1.2.3 DEFINITIONS
Throughout this article we denote the set of dose levels at which patients are treated
by D {xi, i 1, . . .} assuming xi xi1. The dose unit is usually mg/m2 body
surface area,15 but depends on the methods or route of application. It is assumed that
patients enter the study one after the other numbered by j, j 1, 2, . . . and that treat-
ment starts immediately after entry under informed consent. Denote by x(j) the dose
The CTC grade 5 is not used in the sequel because death usually is taken as a very seri-
ous adverse event and is preceded by a CTC grade 4 toxicity. Of course, a death related
to treatment has to be counted as DLT. The extensive list of NCICTC criteria has
replaced the list of the WHO17 based on an equivalent 0 4 scale. Investigators plan-
ning a phase I trial have to identify in the CTC list a subset of candidate toxicities for
dose limitation, and they have to fix the grades for which that toxicity is considered to
be dose limiting such that either treatment has to be stopped or the dose has to be
reduced. Usually, a toxicity of grade 3 or 4 is considered as dose limiting. That identi-
fied subset of toxicities from the CTC list and the limits of grading define the DLT for
the investigational drug. Sometimes the list of DLTs is open such that any AE from the
CTC catalog of grade 3 and higher related to treatment is considered as a DLT. Notice
the recent change of the Common Toxicity Criteria, version 2.0 (published April 30,
1999), to the Common Terminology Criteria for Adverse Events (CTCAE), version 3.0
(published June 10, 2003), now containing 28 main categories with a total of more than
1000 single AEs. Changes in the CTC grade classification have to be taken into
account when toxicities are assessed in studies graded under different CTC versions.
During cancer therapy, patients may show symptoms from the candidate list of
DLTs not caused by the treatment but by the cancer disease itself or by concomitant
treatment. Therefore, the occurrence of any toxicity is judged by the clinician or
study nurse for its relation to the investigational treatment. A commonly used assess-
ment scale is as follows:18
(realized in I-MTDs) to describe this distribution. The probability that x exceeds the
random MTD is
and describes the proportion of the population showing a DLT when treated by dose
x. This then becomes the adequate statistical model for describing the MTD. This
probabilistic approach, known as a tolerance distribution model for a quantal
doseresponse relationship,22 allows any reasonable cumulative distribution function
F for the right side of Eq. (1.2). F is a nondecreasing function with values between
0 and 1. In practice one should allow F(0)0 as a baseline toxicity and also
F(xmax) 1 for saturation of toxicity. Classes of well-known tolerance distributions
are the probit, logit, and Weibull models.22 On this probabilistic basis, a practicable
definition of an MTD of a phase I trial is obtained as a percentile of the statistical
distribution of the (random) MTD as follows.
Determine an acceptable proportion 0 1 of tolerable toxicity in the patient
population before accepting the new anticancer treatment. Define the MTD as that
dose xMTD for which the proportion of patients exceeding the DLT is at least as large
as : FMTD (xMTD) or xMTD F 1 MTD ( ). Obviously, there is a direct correspon-
dence between F in Eq. (1.2) and in Eq. (1.1) as
MTD = 1(
, a) (1.4)
Figure 1.1 shows how the MTD is calculated. The choice of depends on the nature
of the DLT and the type of the target tumor. For an aggressive tumor and a transient
and non-life-threatening DLT, could be as high as 0.5. For persistent DLT and less
aggressive tumors, it could be as low as 0.1 to 0.25. A commonly used value is
1/3 0.33.
1.0
0.9
0.8
0.7
0.6
(x, a)
0.5
0.4
0.3
0.2
MTD
0.1
0
1 2 4 6 8 10
x (dose in mg/m2)
F 1( ) a0
MTD (1.6)
a1
1.3 DESIGN
A phase I trial design has to determine which dose levels are applied to how many
patients and in which sequel given the underlying goal of estimating the MTD. This
implies three tasks: determining of the possible set of dose levels, choosing the dose
levels sequentially, and determining the number of patients per dose level.
xi x1 (i 1) x i 1, 2, . . . (1.7)
xi x1 f i 1 i 1, 2, . . . (1.8)
where f denotes the factor by which the starting dose x1 is increased. A pure multi-
plicative set cannot be recommended and is not used in phase I trials because of its
extreme danger of jumping from a nontoxic level directly to a highly toxic level.
Modifications of the multiplicative scheme that start with a few large steps and slow
down later are in use. Such a modified action space could be the result of a mixture
of Eq. (1.7) and Eq. (1.8), where the first steps at low doses are obtained multiplica-
tively and the remaining steps additively. Another, smoother set of doses is obtained
when the factors are decreasing with higher doses, for example:
xi fi 1 xi 1 i 1, 2, 3, . . . (1.9)
where {fi} is a nonincreasing sequence of factors that may start with f1 2 as dou-
bling dose from x1 to x2 and continues with 1fi2 for i2. The modified Fibonacci
scheme described next is of this type. It has been in use from the beginning of sys-
tematic phase I research (see Edler).24
The weakness of the foundation of the MFDE for phase I trials becomes even more
evident when one looks deeper into history.
Fibonacci (Figlio Bonaccio, the son of Bonaccio), also known as Leonardo da
Pisa or Leonardo Pisanus, lived from 1180 to 1250 as a mathematician at the court
of Friedrich II in Sicily working in number theory and biological applications. The
sequence of numbers named after Fibonacci is created by a simple recursive additive
rule: each number is the sum of its two predecessors: fn1 fn fn 1 and it starts
from f1 1 using f0 0 (Table 1.1). Fibonacci related this sequence to the breed-
ing of rabbits problem in 1202 and also to the distribution of leaves about a stem.
The sequence fn grows geometrically and is approximately equal to a[(1 5)/2]n,
where a (1 5)/5. The ratio of successive numbers fn/fn 1 converges to
(1 5)/2 (1 2.236)/2 1.618, the golden section, a famous principle of
ancient and Renaissance architecture. The Fibonacci numbers have been used in
optimization and dynamic programming in the 1950s31 for determining the maxi-
mum of a unimodal function. One application can be illustrated as follows: How
many meters long can a bent bridge be, such that one can always locate its maximum
height in units of meters by measuring at most n times? The solution is given by
Bellmans theorem31 as fn meters. The results say nothing on the placement of the
measurements but only on the needed number of measurements.
In his article on methods for early clinical trials research, Schneiderman10
showed that he was familiar with this work on optimization and cites Bellmans
result but gives the wrong page number (correct is (31) page 34, not page 342). The
optimization result is even better explained in32 on page 152. Schneiderman10 tried
to transpose the Fibonacci search by fixing an initial dose x1, a maximum possible
dose xK , and the number n of steps for moving upwards from x1 to xK. By taking a
Fibonacci series of length n 1, inverting the order, and spacing the doses in pro-
portion to the n intervals in the series,10 Schneiderman obtained an increasing
sequence of doses. In contrast to the MFDE, which is based on a multiplicative set
of doses, this approach is somehow still additive but leads to smaller and smaller
steps toward higher doses similar to the MFDE. However, the steps obtained by
TABLE 1.1
Fibonacci Numbers and Consecutive Ratios of Fibonacci Numbers as well as
Modified Fibonacci Dose Escalation Schemes Used in Clinical Practice24
Fibonacci Numbers Fibonacci Multiples Modified Smoothed Modified
fn , n 1, 2, . . . fn
1 / fn Fibonacci Fibonacci
1
2 2.0 2 2.0
3 1.5 1.65 1.67
5 1.67 1.52 1.50
8 1.60 1.40 1.40
13 1.63 1.33 1.30 1.35
21 1.62 1.33 1.30 1.35
34 1.62 1.33 1.30 1.35
55 1.62 1.33 1.30 1.35
Schneidermans inversion are at the beginning very large and later very small. This
escalation is fixed to K doses and does not open to higher doses if the MTD is not
reached at xK. Schneiderman discussed a restarting of the scheme if no toxicity is
seen at xK and concludes that then no guide seems to exist for the number of steps.
And he confesses that he has not seen it in any published account of preliminary
dose finding. The number of steps in this reversed Fibonacci scheme is strongly
related to the escalation factor and so provides no guidance for dose escalation. In
the same paper, however, he cites a paper of de Vita et al.33 in which a dose escala-
tion with the factors of 2, 1.5, 1.33, and 1.25 is used. A hint to the use of the inverse
of a Fibonacci scheme where the dose increments decrease with increasing numbers
is also given by Bodey and Legha,8 who refer to26 who examined the usefulness of
the modified Fibonacci method as a guide to reaching the MTD.
In summary, it seems that the idea of the so-called MFDE came up in the 1960s
in the NCI where the early clinical trials programs started and that it was promoted
by the scientists mentioned above. They searched for a dose escalation scheme that
slows from doubling the dose to smaller increases within a few steps. The MFDE
(Table 1.1), slowing down the increase from 65% to 33% within the first 5 steps,
seemed reasonable enough to be used in many trials. The method has been success-
ful to the extent that MTDs have been determined through its use. From empirical
evidence and the simulation studies performed later, the MFDE seems now to be too
conservative in too many cases.
from them to Bayesian rules and the continual reassessment method (CRM). Methods
for the determination of the MTD have to be addressed for each of the rules.
TABLE 1.2
Example of a Phase I Study Performed According to TERa
Dose Level Dosage Escalation Factor N DLT Grade
1 0.45 3 0 000
2 0.9 2 6 1 111311
3 1.5 1.67 6 1 041111
4 3 2 3 0 012
5 4.5 1.5 4 0 1122
6 6.0 1.33 3 0 111
7 7.5 1.25 3 1 113
8 10.5 1.4 3 0 111
9 13.5 1.29 4 1 0320
10 17.5 1.29 3 0 010
11 23.1 1.33 6 1 123122
12 30.0 1.3 5 4 33331
13 39.0 1.3 1 1 3
a
Columns show dose level, dosage (units), escalation factor, number of patients N, number of cases with dose
limiting Toxicity (DLT) defined as grade 34, and the actually observed toxicity grades of the N patients.37
Escalation stops
Escalation stops
If the escalation stops at a dose level xi, that level would be the first level of unac-
ceptable DLT, and one would conclude that the MTD is exceeded. The next lower
level xi 1 is then considered as the MTD. It is common practice that at the dose level
of the MTD at least six patients are treated. For this reason, options 2 and 3 above
are often applied at the end of a phase I trial. Therefore, the MTD can be character-
ized practically as the highest dose level below the stopping dose level xi at which at
least six patients have been treated with no more than one case of DLT. If no such
dose level can be identified, the starting dose level x1 would be taken as, MTD.
UaD-C: Proceed as in UaD but escalate only if two consecutive patients are
without DLT.
No DLT DLT
UaD-D: Three patients are treated at a new dose level. Escalate if no DLT and
deescalate if more than one DLT occurs. If exactly one patient shows a DLT,
another three patients are treated at the same level and the rule is repeated.
The two designs were combined with the elementary UaD to Storers BC and BD
two-stage designs:
UaD-BC: Use the UaD until the first toxicity occurs and continue with the
UaD-C at the next lower dose level.
UaD-BD: Use the UaD until the first toxicity occurs and continue with UaD-D
design. For a detailed description of this two-stage design see.38
Simulations revealed a superiority of the UaD-BD over the UaD-BC and the ele-
mentary UaD.39 Although the single-stage designs UaD, UaD-B, and UaD-C are not
considered as sufficient and only the two-stage combinations are proposed for use,
unfortunately, proposals of new designs have been calibrated mostly on the one-
stage designs instead of on the more successful two-stage designs. Therefore, rec-
ommendations for practice are hard to deduct from those investigations.
A new sequential RW is the so-called biased coin design (BCD)40 applicable for
an action space DK {x1. . .xK}. The BCD is shown in Figure 1.5. Given patient
j has been treated on dose level x( j) xi , the next patient j 1 is treated at the next
lower level xi 1 if a DLT was observed in patient j, otherwise at xi with some prob-
ability p# not larger than 0.5 or at xi1 with probability 1 p# (not smaller than 0.5).
When reaching boundaries x1 and xK , the procedure must stay there. This design cen-
ters the dose allocation unimodally around the MTD for any
for 0 0.5 if p#
is chosen as /(1 ) and if the MTD is within the action space DK. Interestingly, the
33rd percentile MTD1/3 is obtained with a nonbiased coin of probability p# =
(1/3)/(2/3) = 0.5.
No DLT DLT
thus enter into the particulars of the ethical dilemma of phase I trials. Bayesian
designs are best suited when patients are treated one at a time and when toxicity
information is available quickly. A full Bayesian design44 is set up by Eq. (1.1), the
prior distribution g(a) of the model parameter a, and a continuous set actions D. A
gain function (negative loss) G is needed to characterize the gain of information if an
action is taken given the true (unknown) value of a, or equivalently the MTD.
The sequentially collected dosetoxicity information up to the j 1 th patient is
summarized by
j = {y1, . . . ,yj 1}. The information upon the parameter a is described
by the posterior density distribution of the parameter a given
j. Using the dose toxi-
city function (1.1), the likelihood of the data
j for the first j 1 patients is given by
j 1
(x(s), a) ys [1 (x(s), a)]1 ys
fj 1(
j , a) = s1 (1.10)
fj 1(
j , a)g(a)
g(a|
j) = .
fj 1(
j , a)g(a)da
(1.11)
The next dose x( j) is determined such that the posterior expected gain E[g(a) |
j] is
maximized with respect to x( j). For details see.44 Gatsonis and Greenhouse45 used a
Bayesian method to estimate directly the dosetoxicity function and the MTD in
place of the parameter a. The probability P(DoseMTD
) of overdosing a
patient was used as the target parameter for Bayesian estimation in the so-called esca-
lation with overdose control (EWOC) method.43 Whitehead and Brunier44 use the pre-
cision of the MTD estimate obtained from the next patient as the gain function.
updated with each patients observation with regard to the absence or presence of DLT.
Each patient is treated at the dose level closest to the currently estimated MTD.
Assume a finite action space DK {x1. . .xK} of dose levels, a fixed sample size
N, and a one-parameter dosetoxicity function (x, a) as defined in Eq. (1.1).
Estimation of the MTD is therefore equivalent to the estimation of a. Assume that there
exists an unique solution a0 at the MTD satisfying (MTD, a0)
. Using the nota-
tion of the previous section, assume fj 1(
j, a) is normalized by its integral over a.
Using CRM, the next dose level is determined such that it is closest to the cur-
rent estimate of an MTD
. For this, the probability of a toxic response is calculated
for each xi DK as
P(xi ,a) (xi, a) f (a,
j)da (1.12)
0
and the dose level x x( j) for the jth patient is selected that minimizes the distance
from P(x, a) to the target toxicity rate
. After observing toxicity Yj at dose level x( j),
the posterior density of the parameter a is updated by the likelihood of the jth obser-
vation. After the planned N patients have been treated, the MTD is estimated as
MTD = x(N + 1), the dose for the (N + 1)st patient. Consistency in the sense that the
recommended dose converges to the target level has been shown even under model
misspecification.47
xCRM ( j) if xCRM ( j) x( j 1)
x( j) xi1 if xCRM ( j) x( j 1) and if yj 1 0 (no DLT)
xi if xCRM ( j) x( j 1) and if yj 1 1 (DLT)
1. Modified CRM but stay with x( j) always one dose level below xCRM ( j)
(version 1 in)49
2. Modified CRM but x( j) is not allowed to exceed the MTD estimate based
on
j (version 2 in)49
3. Modified CRM but use the starting level of the CRM and enter there three
patients (version 4 in)49
4. CRM with escalation restricted within DK {x1 . . . xK} to one step
only (restricted version in)50
5. Modified CRM but stopping if the next level has been already visited by
a predetermined number of patients. Korn et al.48 introduce additionally a
dose level x0x1 for an estimate of the MTD if a formally proposed MTD
equal to x1 would exhibit an unacceptable toxicity.
6. Modified CRM run in three variants of one, two, or three patients per
cohort.52 This version is identical to modification 5 except that a dose
level cannot be passed after a DLT was observed.
7. CRM but allow more than one patient at one step at a dose level and limit
escalation to one step.51 A simulation study showed a reduction of trial
duration (5070%) and reduction of toxicity events (2035%) compared
to the CRM.
see also.46 Unfortunately, the results are based on different simulation designs and dif-
ferent criteria of assessment such that a comparison is rather limited.
Roughly summarizing the findings of those studies it appears that the TER is
inferior to the UaD-BC and the UaD-BD39 in terms of the fraction of successful tri-
als. The UaD-BD is superior to UaD-BC at least in some situations. The BCD design
from the RW class and the CRM performed very similary.40 By examining the dis-
tribution of the occurrence of toxic dose levels, percentage of correct estimation, or
percentage treated at the MTD or one level below, the single-stage UaD designs were
inferior to the CRM;23 the UaD recommended toxic levels much more often than the
CRM. STER designs were inferior to some modified CRMs with respect to the per-
centage of correct estimation, the percentage treated at the MTD or one level below,
and the percentage of patients treated with very low levels.48,49 The CRM and some
modified CRMs needed on the average more steps than the STER, and they showed
a tendency of treating more patients at levels higher than the MTD. The STER rec-
ommended more dose levels lower than the true MTD. The STER provided no effi-
cient estimate of the MTD and did not stop at the specified percentile of the
doseresponse function.60
estimate of a standard error of the toxicity rate at the chosen dose level is impaired
by the small number of cases (6) and also by the design. A general method for ana-
lyzing dosetoxicity data is the logistic regression of the Yj on the actually applied
doses x(j), j 1, . . . . , n, of all patients treated in the trial.63 This would, however,
disregard any dependency of the dosetoxicity data on the design that had created
the data and may therefore be biased. If the sampling is forced to choose doses below
^
the true MTD, MTD may be shifted toward lower values.
AUC(LD10, mouse)
F
AUC(StartingDosehuman) (1.13)
was used to define a range of dose escalation. One tenth of MELD10 is usually taken
as the starting dose x1. Two variants have been proposed. In the square root method,
the first step from the starting dose x1 to the next dose x2 is equal to the geometric
mean between x1 and the target dose F x1, i.e., x2 xF
1
x1 = x1F. Subsequent dose
escalation continues with the MFDE. In the extended factors of two method, the first
steps are achieved by a doubling dose scheme as long as 40% of F has not been
attained. Then one continues with the MFDE. Potential problems and pitfalls were
discussed in Ref. 69. Although usage and further evaluation were encouraged and
guidelines for its conduct were proposed, the PGDE has not often been used. For a
more recent discussion and appraisal see Newell.70
more than one phase I trial for the same drug. This poses, however, ethical concerns
about the premises of the phase I being the trial under which the first-time patients
are treated. The repeat of phase I trials has not found much methodological consider-
ation in the past.
Given multicentricity and enough patients per center, some restricted type of
randomization may be feasible and should be considered, for example, randomly
choosing the center for the first patient for the next dose level. Further improvement
was put into the perspective of increasing the sample size of phase I trials and so
increasing the information.13 The definition of the toxicity criteria and their assess-
ment rules are mostly beyond the statistical methodology but are perhaps more cru-
cial than any other means during the conduct of trial. Care must be taken that no
changes in the assessment of toxicity occur progressively during the trial. The recent
changes of the CTC criteria from version 2 to version 3 may pose some challenges
on the comparability of study results.
Statistical estimation of the MTD has been restricted above to basic procedures
leaving aside the question of estimation bias when neglecting the design.
Babb et al.43 noted that they can estimate the MTD using a different prior than used
in the design and refer to work of others who suggested to use a Bayesian scheme
for design and a maximum likelihood estimate of the MTD. Laboratory toxicity is,
however, available on a quantitative scale, and this information could be used for the
estimation of an MTD.76 Mick and Ratain77 proposed therefore ln(y) a0 a1 x
a2 ln(z) as dose-toxicity model where y was a quantitative measure of toxicity (nadir
of white blood cell count) and where z was a covariate (pretreatment white blood cell
count). Further research and practical application are needed to corroborate the find-
ings. A multigrade dose escalation scheme was proposed in78 that allows escalation
as well as reduction of dosage using knowledge of the grade of toxicity. They
showed that the multigrade design was superior to the standard design and that it
could compete successfully with the two-stage designs. But it was not
uniformly better. The authors also admitted that a multigrade design is harder to
comprehend and use, and to these authors knowledge that design has never been
used in practice, although it may have influenced recent research in titration designs.
ACKNOWLEDGMENTS
This work could not have been done without the long-standing extremely fruitful
cooperation with the Phase I/II Study Group of the AIO in the German Cancer
Society, the work done with Wolfgang Queier, Axel Hanauske, Heiner Fiebig, and
the more recent work for the Biometric Center of the Central European Society for
Anticancer Drug Research (CESAR) guided by Christian Dittrich and Bernhard
Keppler. We are indebted also for the support and statistical input from the Biometry
Departments of Martin Schumacher (Freiburg) and Michael Schemper (Vienna),
Harald Heinzl (Vienna), and our next-door colleague, Annette Kopp-Schneider. For
technical assistance we have to thank Gudrun Friedrich for analyses and Regina
Grunert and Renate Rausch for typing and for the bibliography. Finally, we would
like to thank John Crowley for all the help and encouragement.
REFERENCES
1. Simon RM. A decade of progress in statistical methodology for clinical trials. Stat.
Med. 1991;10:17891817.
2. World Medical Association. Declaration of Helsinki, (http://www.wma.net/e/policy/
b3.htm).
3. Carter SK. Study design principles for the clinical evaluation of new drugs as devel-
oped by the chemotherapy programme of the National Cancer Institute. In: Staquet
MJ, ed. The Design of Clinical Trials in Cancer Therapy. Brussels: Editions Scient.
Europ., 1973:242289.
4. Schwartsmann G, Wanders J, Koier IJ, Franklin HR, Dalesio O, Hornstra HW,
et al. EORTC New Drug Development Office coordinating and monitoring pro-
gramme for phase I and II trials with new anticancer agents. Eur. J. Cancer.
1991;27:11621168.
5. Spreafico F, Edelstein MB, Lelieveld P. Experimental bases for drug selection. In:
Buyse ME, Staquet MJ, Sylvester RJ, eds. Cancer Clinical Trials; Methods and
Practice. Oxford: Oxford Univ. Press, 1984:193209.
6. Carter SK, Bakowski MT, Hellmann K. Clinical trials in cancer chemotherapy. In:
Carter SK, Bakowski MT, Hellmann K, eds. Chemotherapy of Cancer. 3rd ed. New
York: John Wiley, 1987:2931.
7. EORTC New Drug Development Committee. EORTC guidelines for phase I trials
with single agents in adults. Eur. J. Cancer Clin. Oncol. 1985;21:10051007.
8. Bodey GP, Legha SS. The phase I study: general objectives, methods, and evaluation.
In: Muggia FM, Rozencweig M, eds. Clinical Evaluation of Antitumor Therapy.
Nijhoff, The Netherlands: Dordrecht, 1987:153174.
9. Leventhal BG, Wittes RE. Phase I trials. In: Leventhal BG, Wittes RE, eds. Research
Methods in Clinical Oncology. New York: Raven Press, 1988:4159.
10. Schneiderman MA. Mouse to man: statistical problems in bringing a drug to clinical
trial. In: Proc. 5th Berkeley Symp. Math. Statist. Prob. 4th ed. Berkeley: Univ. Calif.
Press, 1967:855866.
11. Von Hoff DD, Kuhn J, Clark GM. Design and conduct of phase I trials. In: Buyse ME,
Staquet MJ, Sylvester RJ, eds. Cancer Clinical Trials; Methods and Practice.
Oxford: Oxford Univ. Press, 1984:210220.
12. Ratain MJ, Mick R, Schilsky RL, Siegler M. Statistical and ethical issues in the
design and conduct of phase I and II clinical trials of new anticancer agents. J. Nat.
Cancer Inst. 1993;85:16371643.
13. Christian MC, Korn EL. The limited precision of phase I trials. J. Nat. Cancer Inst.
1994;86:16621663.
14. Kerr DJ. Phase I clinical trials: adapting methodology to face new challenges. Ann.
Oncol. 1994;5:S67S70.
15. Pinkel D. The use of body surface area as a criterion of drug dosage in cancer
chemotherapy. Cancer Res. 1958;18:853856.
16. National Cancer Institute (NCI). Cancer Therapy Evaluation Program, Common
Terminology Criteria for Adverse Events. Version 3.0. DCTD, NCI, NIH, DHHS
March 31, 2003 (http://www.ctep.cancer.gov) Publication date June 10, 2003.
17. World Health Organization (WHO). WHO Handbook for Reporting Results of Cancer
Treatment. WHO Offset Publication No.48. Genf: WHO, 1979.
18. Kreuser ED, Fiebig HH, Scheulen ME, Hanauske AR, Keppler BK, Mross KB, et al.
Standard operating procedures and organization of German phase I, II, and III study
groups, New Drug Development Group (AWO), and Study Group of Pharmacology
41. Dixon WJ, Mood AM. A method for obtaining and analyzing sensivity data. J. Am.
Statist. Assoc. 1948;43:109126.
42. Robbins H, Monro S. A stochastic approximation method. Ann. Math. Stat. 1951;
29:400407.
43. Babb J, Rogatko A, Zacks S. Cancer phase I clinical trials: efficient dose escalation
with overdose control. Stat. Med. 1998;17:11031120.
44. Whitehead J, Brunier H. Bayesian decision procedures for dose determining experi-
ments. Stat. Med. 1995;14:885893.
45. Gatsonis C, Greenhouse JB. Bayesian methods for phase I clinical trials. Stat. Med.
1992;11:13771389.
46. OQuigley J. Phase I and Phase I/II dose finding algorithms using continual reassess-
ment method. In: Crowley J, ed. Handbook of Statistics in Clinical Oncology. New
York, Marcel Dekker, 2005: This volume.
47. Shen LZ, OQuigley J. Consistency of continual reassessment method under model
misspecification. Biometrika. 1996;83:395405.
48. Korn EL, Midthune D, Chen TT, Rubinstein LV, Christian MC, Simon RM. A com-
parison of two phase I trial designs. Stat. Med. 1994;13:17991806.
49. Faries D. Practical modifications of the continual reassessment method for phase I
clinical trials. J. Biopharm. Stat. 1994;4:147164.
50. Moller S. An extension of the continual reassessment methods using a preliminary
up-and-down design in a dose finding study in cancer patients, in order to investigate
a greater range of doses. Stat. Med. 1995;14:911922.
51. Goodman SN, Zahurak ML, Piantadosi S. Some practical improvements in the con-
tinual reassessment method for phase I studies. Stat. Med. 1995;14:11491161.
52. Ahn C. An evaluation of phase I cancer clinical trial designs. Stat. Med. 1998;
17:15371549.
53. Hanauske AR, Edler L. New clinical trial designs for phase I studies in hematology
and oncology: principles and practice of the continual reassessment model.
Onkologie 1996;19:404 409.
54. OQuigley J, Reiner E. A stopping rule for the continual reassessment method.
Biometrika 1998;85:741748.
55. Heyd JM, Carlin BP. Adaptive design improvements in the continual reassessment
method for phase I studies. Stat. Med. 1999;18:13071321.
56. OQuigley J. Continual reassessment designs with early termination. Biostat. 2002;
3:8799.
57. Rademaker AW. Sample sizes in phase I toxicity studies. ASA Proc. Biopharm. Sect.
1989;137141.
58. Geller NL. Design of phase I and II clinical trials in cancer: a statisticians view.
Cancer Invest. 1984;2:483491.
59. OQuigley J, Chevret S. Methods for dose finding studies in cancer clinical trials: a
review and results of a Monte Carlo study. Stat. Med. 1991;10:16471664.
60. Storer BE. Small-sample confidence sets for the MTD in a phase I clinical trial.
Biometr. 1993;49:11171125.
61. Simon RM, Freidlin B, Rubinstein LV, Arbuck SG, Collins J, Christian MC.
Accelerated titration designs for phase I clinical trials in oncology. J. Nat. Cancer Inst.
1997;89:11381147.
62. Benner A, Edler L, Hartung G. SPLUS support for analysis and design of phase I trials
in clinical oncology. In: Millard S, Krause A, eds. SPLUS in Pharmaceutical Industry.
New York: Springer, 2000.
63. Finney DJ. Statistical Method in Biological Assay. London: C. Griffin, 1978.