statistical optimality

J P Morgan J.P.Morgan at qmul.ac.uk
Tue Apr 15 12:57:20 BST 2003


Hello All,

Below is a proposal regarding how statistical optimality criteria should be 
incorporated into the external representation. First a few comments on the 
breadth of allowable optimality functions. In the talk on 3rd April I showed 
the definition of a type-I optimality function, essentially a convex function 
of the eigenvalues plus a few extra conditions. The point of this is not that 
any particular function in that family would actually be used or indeed would 
hold any substantive statistical interest. Rather, the notion is employed 
because

a) It provides a minimal mathematical framework within which general 
optimality results can be obtained. The generality can help clarify (even 
simplify) the underlying mathematical problem. For instance, useful bounds 
for criteria values can be derived using only the type-I conditions. The 
bounds can then be applied to determine optimality for any particular 
criterion of interest. In the work with irregular BIBDs, this approach 
results in conditions for A- and D-optimality in a unified framework.

b) Occasionally a design or class of designs can be shown optimal w.r.t. all 
type-I criteria. Such a strong result is highly desirable - not just because 
it shows the tremendous statistical "strength" of the designs in question, 
but also in precluding the need for separate arguments for each criterion of 
genuine interest. It is not, however, an endorsement of more than cursory 
statistical relevance for each and every individual criterion in the family.  

As another example, statisticians also consider Schur-optimality: optimality 
w.r.t. all Schur-convex functions of the eigenvalues. One may be able to show 
that the vector of eigenvalues for a certain information matrix is majorized 
by those of all others, and thus it corresponds to a Schur-optimal design. 
But again this is no endorsement of all Schur-convex functions as 
individually interesting criteria on their own. It is simply reflective of 
the fact that many of the interesting criteria can be embedded in larger 
families for which we can sometimes determine quite general results. When, as 
is often the case, optimality w.r.t. such a general family does not hold, 
then we turn to individual criteria of interest and try to separately 
establish results for each.

What are these individual criteria of interest? On the one hand, the formal 
optimality theory, starting in earnest from the mid 1970s, has largely (not 
exclusively) concentrated on these four: (A,D,E,MV). Each corresponds to a 
desirable statistical property. On the other hand, earlier statistical block 
design work concentrated on various notions of balance as reflected in BIBDs 
and PBIBDs. Interpretability of results is also an important consideration, 
usually aligned with balance notions such as (for example) number of distinct 
variances, range of variances, etc. Another related issue concerns the 
pattern of variances as it relates to the structure of the treatment set.

A full notion of optimality should reflect the statistical usefulness of a 
design, and consequently should incorporate all of these ideas: efficiency 
w.r.t. alphabetic optimality (such as the four horsemen above) combined with 
notions of balance and structure as is appropriate.

That's my brief take on the background from which we proceed. For the 
external representation we face two main issues: what criteria to include, 
and in what form to express them. A proposal follows.

We currently have a category named efficiency_factors. I propose that we add 
a category under that heading named "optimality_functions_of_efficiency 
_factors" (or something similar), to include geometric mean, harmonic mean, 
etc as Rosemary skecthed out last Wednesday, but limited solely to functions 
of the efficiency factors.  Each is optimized if maximal.

We also include a category at the same level as efficiency_factors named 
optimality_criteria_values. Here we put functions that are minimized for 
optimality. Included are selected functions of the eigenvalues \mu_i of the 
information matrix and other selected criteria not based on eigenvalues. Here 
is my suggested list:

D = \phi_0
A = \phi_1
\phi_2
E = \phi_\infty = 1/\mu_1 = E1
E2 = E1+1/\mu_2 
E3 = E2 + 1/\mu_3
 .
 .
 .
E(v-1) = E(v-2) + 1/\mu_{v-1}
MV = max pairwise variance
# distinct  \mu_i
var_ratio = \mu_{v-1}/\mu_1
# distinct variances for pairwise comparisons
pairwise_var_ratio = MV/min pairwise variance

A,D,E,MV are all there, of course, along with several other ways of thinking 
about dispersion of variances and extremal behavior. The Ej's, which are 
components of a Schur-optimality argument, have also been studied on their 
own, albeit not extensively.


Rationale: 

1. For equireplication, criteria in terms of efficiency factors have the best 
intuitive appeal, so we should have them. They are also what many 
practitioners are more likely to look for.

2. For reasons including 

(i) ruling out nonequireplcate designs as optimality competitors when 
equireplication is possible, 
(ii) robustness studies of equireplicate designs under losses of plots and/or 
blocks, and 
(iii) direct optimality studies of settings (v,b,k) where v does not divide bk

we must include optimality criteria expressed in ways that seemlessly 
encompass both equireplicate and nonequireplcate designs, which is not the 
case when they are based on efficiency factors. This formulation will appeal 
less to practitioners, but holds great appeal to many statisticians working 
in design optimality.

3. There is redundancy between the two proposed categories, but this is not 
unlike redundancy we have incorporated elsewhere, and in this case is 
additionally supported by serving communities of users that only partially 
overlap.

4. The resource we are creating has the potential to make a significant 
impact on the design optimality community. It is thus very important, I 
believe, that we formally consider measures of statistical balance as 
optimality criteria. Why? So to nudge this community away from attitudes like 
"here is A-best and that's the end of story" and towards those like "after 
sifting through all the reasonably efficient designs, the best for this 
experiment has been selected based on a judicious balancing of efficiency, 
dispersion in the variances of estimated contrasts, the number of distinct 
variances, and the structure of the treatment set."  In doing so we enrich 
block design optimality as a field of study while bringing the mathematical 
optimality theory closer to the pragmatic optimality considerations of real 
experiments. Essentially I am saying that optimal design theory is more 
relevant to practice if the class of criteria considered is sufficiently 
rich, and I do not think A,D,E,MV alone make a sufficiently rich class. Why 
not? Because they do not incorporate balance notions that are important from 
multiple perspectives in so many actual experiments. 

Comments/discussion welcome.

JP





More information about the Developers mailing list