ext-rep file naming
Peter Dobcsanyi
p.dobcsanyi at designtheory.org
Fri May 7 14:49:47 BST 2004
Dear All,
We are going to have hundreds of ext-rep files floating around in our
system. To make this situation manageable I recommend the introduction
of an "ext-rep file naming convention". The purpose of such a scheme is
twofold. On one hand, it makes the system administration and the
organization of data processing (including external contributions) much
easier. On the other hand, it can serve as a base for a simple but
nevertheless useful web presentation method. This means that we can put
our design collections online within a few days and long before the
"real" design database actually comes alive.
My proposal for ext-rep file naming follows. Before the details, let's
start with an example. An ext-rep file file which contains the Fano
plane can be named:
t2-v7-b7-r3-k3-L1.icga.xml.gz
An ext-rep file's name has the following structure:
parameter_section[.content_indicators].type[.compression_type]
parameter_section:
Describe the parameters of the designs in the given file over
the parameter domain of t,v,b,r,k,L, where L stands for lambda.
The format is xN-... where x is one of the parameter's symbol
and N is the constant value of the parameter within the file. If
some of the parameters are not relevant to the given list of
designs and/or are not constant then they are left out.
content_indicators: (optional)
One letter symbols to indicate which of the subtrees of a
<block_design> tree are present for each block design in the
file. The possible values:
i <indicators>
c <combinatorial_properties>
g <automorphism_group>
r <resolutions>
s <statistical_properties>
It is not required that any of the indicated subtrees is fully
expanded.
Finally there is a special indicator:
a indicates that all pairwise non-isomorphic designs
of the given parameters are present in the file.
type:
xml for XML ext-rep
sex for Lisp S-expressions
compression_type: (optional)
gz gzipped file
bz2
As you know each design should have a ID. This ID is created from the
"parameter_section" of the file name by appending '-N' where N is a
serial number within the file starting 0. (Note, we use 0-based indexing
elsewhere in the ext-rep.)
For the time being, we should arrange our design collection (i.e.
ext-rep files) in a simple and logical directory structure, like:
designs/
t-designs/
...
Some remarks are due to place the proposed naming into the correct
perspective:
- This naming scheme is not a replacement for the database and it
cannot (should not) provide the same level of consistency. In
particular, the IDs won't be necessarily unique.
- There can be overlaps between files, that is OK.
- Although it is human readable, the more important design aim
behind it was to be able to parse/process/search this system by
simple automatic tools.
Please comment,
-- ,
Peter Dobcsanyi
More information about the Developers
mailing list