Indicators

John P. Morgan jpmorgan at chef.stat.vt.edu
Thu Oct 2 17:45:11 BST 2003


On Thu, 2 Oct 2003 14:44:01 +0100
Peter Cameron <p.j.cameron at qmul.ac.uk> wrote

>
>1. Affine
>---------
>I begin with a story. In the 1970s I was at Westfield College,
>where Marion Kimberley was teaching a course on design theory.
>"Design" meant "2-design" there. In a homework exercise, she asked
>students to define an affine design, expecting them to say "a 
resolvable
>design in which any two non-parallel blocks meet in \mu points". One
>student said, "a resolvable design in which any two non-disjoint 
blocks
>meet in \mu points". Marion pointed out that this is not the same 
thing
>for designs with \lambda=1 (e.g. Kirkman systems). For \lambda>1, 
nobody
>knows yet whether the two conditions are equivalent; I was able to 
show
>that there are very strong restrictions. (The first open case is 
that of
>a 2-(70,10,6) design. It is not necessary to assume resolvability; a 
single
>parallel class is enough. Without that, there are counterexamples, 
e.g. the
>3-(22,6,1) Witt design.
>
>The point is that "affine" (as usually interpreted) is more 
restrictive than
>"resolvable and block intersections 0 and \mu", and there is strong
>possibility of confusion here.
>
>The other reason I don't like this is that "affine" is so 
well-established
>and has such a body of theory that I would be reluctant to change 
the
>usage. On the other side, designs with block intersections of size
>\alpha or \beta are well studied: they are called "quasi-symmetric" 
(at
>least if they are 2-designs, and I think the generalisation here is 
>unexceptionable), and for example Shrikhande and Sane have a book 
about
>them.
>
>On the point of principle, I view indicators as being for humans 
rather than
>for computers, and would rather have indicators for concepts for 
which a
>body of theory exists than for "atomic" concepts, whataever that 
means.

The naming problem is a real concern (not just here - more below) and 
Peter's points are well taken. The quasi-symmetry is essentially 
Leonard's suggestion (not retricted to 2-designs). The suggested 
indicator or generalization could well be given a different name, to 
avoid confusion with other usages. 

There is still the separate question of whether the concept is 
sufficiently important to warrant indicator status. A list of reasons 
for its inclusion have been offered, and these should be addressed. 
If the "atomic" concept is rejected in this case, then the next step 
is to consider what is the appropriate combination. In some way I 
think this concept should be given indicator status, for reasons 
already explained. I am less sure about the more general notion of 
(generalized) quasi-symmetry. In any case, let's now focus on the 
concept, after which we can select the appropriate term.

On the more general issue (which Peter addresses in an email that 
followed the one here) here are some thoughts, and the "atomic" issue 
plays in here in a very useful way. We are taking a very general view 
in what we are building. Ours is a database for all block designs, 
including unequal replication, unequal blocksizes, and at a later 
date, nonbinarity. Had design theory historically developed with such 
a broad view, our task would be much simpler, for all of the concepts 
that we wish to report would already be named and accepted for our 
fully general setting. But of course this is not at all the case, and 
Peter's story illustrates the point well: even the term "design" 
still means (as it did then) different things to different people.

Without bothering to strictly define the term, I do support the 
"atomic" concept. First, I do not see where any indicator proposed or 
already included is not understandable by humans. The only hitch is 
that many people are not accustomed to thinking about certain 
concepts outside of certain narrowly defined (relative to our 
endeavor) setups. Our job is to define the general setup well. 
Providing simple, atomic indicators that are natural and obvious (two 
debatable terms) extensions of the general case, that reduce to the 
established concepts in the "usual" cases, is part of what we can 
provide.

All this is saying is "Think of the set of properties you want. We 
provide an indicator of each one for a general block design. Combine 
these as you will to get designs with exactly the properties you 
want." In taking this approach, we make our product *more* user 
friendly, we create the ability to look at many more combinations of 
properties than would otherwise be possible, and consequently we 
provide a vehicle for enhancing research. Will we miss out indicators 
that people will need? Yes, and we will add them as needed. But that 
is no argument for not including obvious, simple generalizations now. 


>
>2. Partially balanced
>---------------------
>For general block designs, there are (at least) three different 
kinds of
>balance: pairwise balance, variance balance, and efficiency balance. 
If
>the block size is constant these all reduce to the same thing.
>
Reduction to the same also requires equireplication and binarity.

>May something similar be true for partial balance? Thus, "pairwise 
partial
>balance" means that the concurrence matrix is in the Bose-Mesner 
algebra
>of an association scheme; "variance partial balance" means that the 
>information matrix is (do I have this right JP?) and there may be a 
kind of 
>"efficiency partial balance" as well.
>
>My proposal is that we do not in ignorance assume that only one of 
these
>three concepts is important. We could easily have three indicators 
as
>outlined above. 
>

I am going to argue that the suggestion is not at all a 
recommendation for proceeding in ignorance. My proposal as made a few 
days ago is that we implement a concept that is already used, and 
that we use the terminology that is already given to that concept. We 
could talk about "pairwise partial balance" and "efficiency partial 
balance" (which can be defined as the weighted information matrix - 
see the ext rep doc - as falling in the Bose-Mesner algebra) too, but 
as far as I known this is truly striking into new territory. Is there 
any mention anywhere in the literature of either of these concepts? 
If we do not know of any, then this may not be the right time to 
implement (they can always be added later), and they may not warrant 
indicator status.  But we have a concept that has seen considerable 
usage, it is named "partial balance", and it covers the full 
generality of what we are doing. Why are we defining it only for 
equireplicate, binary, constant blocksize when usage does not make 
any of these restrictions?

Here is a bit to clarify my earlier remarks. The typical way in which 
"partial balance" appears in the statistical literature is through 
one or more design constructions, the results of which are then shown 
to have partial balance by establishing that their information 
matrices are in the Bose-Mesner algebra. Typically not even the B/M 
term is used, it is just shown that the elements of the info matrix 
take values in accordance with some association scheme. This is done 
for many classes of designs (nested, row-column, nested row-col, 
test/control, etc). This is a large part of the reason for bringing 
it up now -  we either ignore the partial balance concept when we 
turn our attention to other classes of designs, or we come back to 
the same issue, leaving one of two possible questions
1) why did they define partial balance for block designs but not for 
class X?
2) why did they define partial balance in general for class X but 
only for a special case of block designs?

Of course, we could also remove all mention of partial balance from 
the current external rep. Probably not the move we will make, but 
tenable if we are concerned with proceeding in ignorance and if we do 
not know the relevant literature and usage of the term. I have tried 
to explain above that the suggested avenue is not a path of 
ignorance, and that there is established usage of the term.

Although it disrespects the current usage of the term "partial 
balance" as explained above, I do prefer Peter's term "partial 
variance balance" in being more descriptive. We also face the issue 
of statistical versus mathematical terminology, and although one 
field may have laid claim to a particular term, in creating a 
resource for all researchers interested in block designs 
(mathematicians, statisticians, computer scientists, etc) we bear 
some responsibility to select terms that are understandable and 
acceptable to all (in so far as this is possible). Sometimes this 
requires modification of existing terminology. 

Having thought about Peter's comments and made responses above, I 
still support changing the current "partial_balance" indicator as 
previously suggested, which adopts the "atomic" principle (removing 
any mention of blocksize or replication, addressing only the 
association scheme) but would rename it something like 
"information_matrix_association_scheme". Add a parallel indicator 
"concurrence_matrix_association_scheme." The terms themselves are 
easily understandable by humans and give us all what we want. They 
allow thinking about this concept from the combinatorial and the 
statistical perspectives, so address our main audiences and the 
differences among them. Indeed, they make more clear the ways in 
which those two audiences think differently about this concept 
(though my ignorance is that I do not know if combinatorialists 
consider it at all for unequal blocksizes, etc). Forget about any 
concept of partial efficiency balance, which will not be of interest 
in our lifetime (efficiency balance in itself is a marginal concept). 
The implementation of the partial_balance_properties element should 
include an optional way to explore the concept of partial variance 
balance when the design is *not* binary, equireplicate, 
equiblocksize; this option would be omitted in the binary, 
equireplicate, equiblocksize case.
 


>I assume it will turn out to be the case that for equireplicate 
designs
>with constant block size, all three concepts will coincide and 
become the
>classical notion of partial balance.
>
>By the way, I take the "diagonal" to be one of the associate classes 
in an
>association scheme. This implies that a pairwise partially balanced 
design
>is automatically equireplicate. Is this agreed?
>

Do we really want to exclude nonbinarity? I hope not! Isn't it our 
plan to allow nonbinarity at some unspecified future time? With 
nonbinarity we need not have equireplication. The same is true for 
variance partial balance even for equal blocksizes.

JP





More information about the Developers mailing list