Doubly adaptive biased coin designs

Two-arm case. We come back to the PW rule and RPW rule. As it is known, the PW rule is too deterministic and is not applicable when we have delayed responses from patients of treatments. The RPW rule and its generalizations seem solve this problem. However, in using the RPW rule, when qi+q2 < 0.5, the limiting distributions of the proportions of patients assigned to each treatments are unknown. But in practice, both q\ and <j2 are usually very small. So, the RPW rule is not practical in such cases.

The asymptotic variation of the proportion becomes a big problem in using the RPW rule (even the adaptive designs based on the GPU model) for 9i + ?2 < 0.5. Even in the case that q\ + 92 > 0.5, if 91 + 92 is near 0.5, aRPW is much larger than apw That is to say, the RPW rule is too random so that the asymptotic variance of proportion of patients assigned to each treatment is very large when the cure rates are large (near 1) and so it is much less stable than the PW rule. Also, in using the multi-arm RPW, when K > 3, the expression of A becomes very complex, it is very hard (even impossible in more general cases) to verify the condition A < 1/2.

Now, with keeping the desired allocation proportions V\ = 92/(91 +92) and v2 = 91/(91+92)) just as the case of the PW rule and RPW rule, our goal is to reduce the asymptotic variance. A natural way is as follows. At the (m+l)-th stage, we assign a patient to a certain treatment by comparing the value Nmi/m with v\, or Nm2/m with v?,. If Nmi/m is larger than v\, then we assign a patient to the treatment 1 with a probability less than vi; If Nmi/m is less than v\, then we assign a patient to the treatment 1 with a probability larger than v\\ If ATml/m equals v\, then we assign a patient to the treatment 1 with probability v\ and to the treatment 2 with probability vi- By choosing suitable allocation function, we may minimize the asymptotic variance. However, v\ and V2 are unknown, so they should be replaced by their estimators based on the sample of the previous m stages. So, the following adaptive design of clinical trial is considered and proposed.

At the first stage, a patient is assigned to each treatment with the same probability 1/2. After m assignments, we let Smk be the number of successes of all the Nmk patients on the treatment k in the first m assignments, k = 1,2, as usual. And let pmk = (Smk + l/2)/(Nmk + 1) be the sample estimate of pk, and write qmk = 1 — pmh, k = 1,2. At the (m + l)-th stage, the (m + l)-th patient is assigned to the treatment 1 with probability g(Nmi/m,vmi), and to the treatment 2 with probability

1 - g(Nmi/m,vmi), where vm\ — qm2/(qmi + 9m2) is the sample estimate of Vi = 92/(91 + qi). The function g(x,p) is called allocation rule. A large class of functions can be chosen as a allocation rule. If it is of the following form:

9(0, p) = 1, 5(1, P) = 0, g{x, p) = where a > 0, then we have

P(f)fl

p(f)ai + (i-p)(TE£)a'

W„i /log log n /-/-^"i

-vi=0{\ ) o.s. and Vn( —— ~ *>i) -* N(0,erDABC) n V n n where

5 _ 2 _ glg2(Pl+P2) , 2gig2 , „n

*ZMBC - *ô - {qi+q2)3 + (i + 2a)(ft+ft)3 ( 2 1 ) (c.f., Hu and Zhang [20]). It should be noted that the asymptotic normality holds for all 0 < pi < 1 and 0 < P2 < 1. Recall that ap\y and CTRPW are defined in (1) and (16), respectively. It is easily seen that a\ is a strictly monotonous decreasing function of a > 0 and a% —> crpw as a —• +00.

Also, a2a < a\pW for all a > 1, whenever 91+92 > 1/2. Furthermore, if 91 + 92 is near 1/2, then aa is much smaller than ORPW SO, this adaptive design is more stable than the RPW rule. This design is compromise between the stability in the PW rule and the randomization in the RPW rule. Such design can keep the spirit of the RPW rule in that it assigns more patients to the better treatment and allows delayed responses by the patients (c.f., Hu, Zhang, Chan and Cheung [22]).

In such kind of designs, the assignments are adapted by both the re- sults of responses and the current proportions of patients assigned, and its original idea came from Efron's [13] biased coin design. So, they are called doubly adaptive biased coin (DABC) design, first introduced by Eisele [10]

and Eisele and Woodroofe [12] in the two-arm case. Hu and Zhang [20]

filled a gap of their proof and studied a general multi-arm case.

Multi-arm case. Now, consider an K-treatment clinical trial. Assume the response sequence {£m} is a sequence of i.i.d random vectors. Suppose the desired allocation proportion of patients assigned to each treatment is a function of some unknown parameters of the response {£„}• That is, the goal of the allocation scheme is to have Nm/ m —• v = p ( 0 ) , where P(v) = (pi(y), • • •, PK{Y)) '• R ^ ~* (0, l)K is a vector-valued function sat- isfying p(y)l' — 1, & — (0I,...,6K) is a vector in RK, and 8k is an unknown parameter of the distribution of £1^, k — 1,...,K. Without

loss of generality, we assume that Ok — E£i,k, k = 1,...,K. Choose a

®o = (#o,i, • - •, ®O,K) S KK as the first estimate of 0 . If m patients are assigned and the responses are observed, we use the sample means 9m,k, which are based on the responses observed, to estimate the parameters Ok as we do in Adaptive Design 2.1, k — I,... ,K, i.e.,

°m,k- Nm,k+1 ' K-l,...,n, and write 0m,fc = (0m,i, • • •, 0m,/r)-

Here, adding 1 in the denominator is for avoiding the case of 0/0, and adding 0o,fe m the numerator is for using 0o,fc to estimate 8k when no patient is assigned to the treatment fc, k — 1 , . . . ,K. Usually, the 0 o is chosen for avoiding pfc(0m) = 0, k = 1 , . . . , K. In practice, ©0 is the guessed value of 0 , or an estimate of 0 from other early trials. The following is the general BADC design.

General D A B C Design Let g(x, p) = {gx (x, p),...,gK{x, p)) :J0,1]* x [0,1]* -> [0,1]* be the allocation rule with g ( x , p ) l ' = 1. Let 0O = 0O

and 0m be estimated as in (22) from the first m observations, m = 1,2,....

Then the (m + l)-th patient is assigned to the treatment fc with probability Pm,k = 3 f c ( Nm/ m , pm) , k = l,...,K, where

Pm = P{®m) (23) is the sample estimate of v = (vi, •.., VK) = p ( 0 ) based on the responses

observed from the first m patients.

Theorem 4 . 1 . Suppose for some e > 0 and 0 < 5 < 1, £]|£n||2 + e < oo and

K dp

p(y) = p(©) + £(yfc-0fc)~f |e + o(l|y-©ll1+*) asy^e.

fe=l yk

Let S i = diag(v) — v ' v ,

a2k = Var{^k), k = l,...,K, St = diag{o-\/vi,..., a2K/vK),

fc=i

Choose the allocation function g(x, p) to be

where a > 0 and L > 1 are constants.

Then in a possibly richer underlying probability space in which there exists two independent K- dimensional standard Brownian motions { Wt} and { Bt} , we can redefine the sequence { Xn, £n} , without changing its distribution, such that

and

where

Nn - nv = Gn + o(n1 / 2 _ K) a.s.

R vi / 2

= B£213 _+ / 2_K ) ^ n

is a solution of the equation:

Gt = Wts l /2 + (l + a ) ^ £ f - a r ^ , i > 0 , Go = O.

t 6 Jo x

In particular,

V ^ ( N „ / n - v , pn- v ) ^ i V ( 0 , A ) , where

A = ( l ^ ^ i + ^ T ^ ^ S s j (26)

Maxima and boundary crossing probabilities of

Cramer type large deviations for independent random