HAL Id: hal-02705097
https://hal.inrae.fr/hal-02705097
Submitted on 1 Jun 2020
HAL
is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.
L’archive ouverte pluridisciplinaire
HAL, estdestinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.
Principle of a fractional factorial design for qualitative and quantitative factors: application to the production of Bradyrhizobium japonicum in culture and inoculation
media
S. Cliquet, C. Durier, André Kobilinsky
To cite this version:
S. Cliquet, C. Durier, André Kobilinsky. Principle of a fractional factorial design for qualitative
and quantitative factors: application to the production of Bradyrhizobium japonicum in culture and
inoculation media. Agronomie, EDP Sciences, 1994, 14 (9), pp.569-587. �hal-02705097�
Plant breeding
Principle of a fractional factorial design for qualitative
and quantitative factors: application to the production
of Bradyrhizobium japonicum in culture
and inoculation media
S Cliquet C Durier A Kobilinsky
1
INRA,
laboratoire de microbiologie des sols, F21034 Dijon cedex;2
INRA,
laboratoire de biométrie, F78026 Versailles cedex, France(Received 13 August 1993; accepted 8 November 1994)
Summary —
To improve the production of Bradyrhizobium japonicum in liquid culture media, different carbon andnitrogen
substrates at different concentrations were tested. In order to study simultaneously these qualitative and quan- titative factors, a suitable experimentaldesign
was necessary. We develop here the principle leading to such fractional factorialdesigns.
The specificdesign
used allowed us to decrease the theoretical number of treatments from 1 024 to 128 and to get estimates of factorial main effects and 2-factor interactions. Five of the 7 tested factors, pH, carbon and organicnitrogen
sources, yeast extract concentration andorganic
nitrogen concentration, were found to have asignifi-
cant effect on optical density. They also were found to interact with each other. The design allowed us to select 2
media that produced more than 1010bacteria/ml.
Bradyrhizobium japonicum
/ carbon and nitrogen substrate / pH / factorial fractional design / microbial pro- ductionRésumé — Principe d’un plan factoriel fractionné
intégrant
des facteurs qualitatifs et quantitatifs. Applicationà la production d’un milieu de culture et d’inoculation pour Bradyrhizobium japonicum. Afin d’améliorer la pro- duction de Bradyrhizobium japonicum en milieu de culture liquide, nous avons comparé les performances de plusieurs
substrats carbonés et azotés à différentes concentrations. La prise en compte simultanée de ces facteurs qualitatifs et quantitatifs a nécessité l’utilisation de plans factoriels adaptés, dont nous développons ici le principe de construction.
Le plan factoriel fractionnaire retenu a permis de réduire le nombre de traitements étudiés de 1 024 à 128, tout en
autorisant l’estimation des effets principaux et des interactions entre 2 facteurs.
L’analyse
a montré que, sur les 7 fac- teurs testés, 5 d’entre eux : le pH, la source de carbone, la source d’azote organique, la dose d’extrait de levure et la dose d’azote organique, ont un effet principalsignificatif
sur la densité optique du milieu etprésentent également
desinteractions significatives. Le plan d’expérience a permis de sélectionner 2 milieux de culture produisant plus de 1010
germes/ml.
Bradyrhizobium
japonicum / substrat carboné et azoté/ pH / plan
factoriel fractionnaire/ production
microbienne
* Correspondence and reprints
† Present address: NCAUR-USDA, 1815 North University, Peoria, 61604 IL, USA.
INTRODUCTION
Soybean yield
in Francerequires
the inoculation of asymbiotic
bacterium,Bradyrhizobium japon- icum,
which fixesatmospheric nitrogen.
Inoculation is
usually performed
with sterilepeat containing
a culture of Bjaponicum,
and mixedwith
soybean
seeds onsowing.
Thistechnique
isexpensive
however(peat grinding, packaging,
and
sterilization)
andjudged
as dullby
farmers.Bacterial survival can also be
severely
reducedby
desiccation in the hoursfollowing
inoculation.A
high
bacterialdensity
is thusrequired during growth
andstorage.
Therefore it is necessary touse a culture medium that allows for a
high
bac-terial
density.
To achieve this, the carbon andnitrogen
substrates metabolizedby B japonicum
should be
carefully
tested and alarge
number ofculture media
experimented.
Several studies(Duménil
et al,1975;
Phan-Tan-Luuet al,
1979;Fannin
et al, 1981)
concerned fractional factorialdesigns taking
into account several factors simul-taneously
withpossible
interactions between them. Thesedesigns
involvedonly quantitative
factors and some were
applied
to theoptimiza-
tion of fermentation
(Deshayes,
1980; De Méo etal, 1985).
We introduce here the
principle
of fractionaldesigns
forqualitative
andquantitative
factors at2 and 4 levels. The
design
builtalong
theseprin- ciples
allowed us to decrease the theoretical number of treatments from 1 024 to 128 in a1/8th fractional
design.
It is also shown how anefficient 1/16th fractional
design,
with 64 unitsonly,
could have been obtained had the number of availableexperimental
units been morerestricted.
MATERIAL AND METHODS
Strain and culture medium
The strain of B japonicum G49
(Indian Agricultural
Research Institute
(IARI)
SB16, New-Delhi) used is the current strain for inoculation of soybean in France. The basal medium(Bergersen,
1961), modified from Sherwood(1972)
and Balatti and Mazza(1978)
con-tained
(per
I): NaCl, 0.1 g;MgSO 4 ·7H 2 O,
0.492 g;CaCl 2
·2H 2
O,
0.147 g;FeCl 3
at 28%, 30 μl.Nitrogen, carbon substrates and addition of a
growth factor (yeast extract) were the factors studied in the experimental
design.
The pH was adjusted to 6 or7 with
KH 2 PO 4
andK 2 HPO 4
at 10 mM. The mediacontaining carbon, nitrogen sources and yeast extract
were autoclaved at 120°C except glucose which was
sterilized by filtration.
Growth conditions
Because numerous runs were made, bacteria were
grown in tubes (18 by 180 mm) previously matched for
optical density
(OD)
readings. Tubes were placed on a rotary shaker (200 rpm) and incubated at 28°C for 6 d.In these conditions, bacterial production in flasks was
6 x 109bacteria/ml (standard deviation: 2 x
10 9 )
and ofthe same order of magnitude as microbial production
in tubes (5 x 109 bacteria/ml; standard deviation: 3 x
10 9 ).
Analysis
The optical density was read at 600 nm after growth.
On some cultures, viable B japonicum were counted by plate dilution on Bergersen medium
(Bergersen,
1961) using theSpiral
system.Choice of factors and their levels
The choice of factors and their levels was determined
by preliminary trials
(Cliquet,
1990) and from biblio-graphic data. Four carbon sources were selected: glu-
cose,
gluconate,
mannitol and glycerol (Burton, 1967;Elkan and Kwik, 1968; Keele et al, 1969, 1970). It is
believed that bacterial
growth
is not limitedby
the car-bon dose, so
studying
it at 2 levels seemed to be suffi- cient. On the other hand, yeast extract, an importantgrowth
factor, might show toxicity at high concentra-tions (Date, 1972; Sherwood, 1972).
Yeast extract concentration therefore had to be at least a 3-level factor to detect the curvature implied by
its toxic effect. Construction of fractional designs is
however easier for mixtures of 2-level and 4-level fac- tors than for 2-level and 3-level factors (Kobilinsky and Monod, 1991). Four doses were therefore chosen, 1, 2, 3 and 4
g/I
for yeast extract, and 0.1, 0.2, 0.3 and 0.4g/l
for organic nitrogen dose, as either Nagluta-
mate or casein
hydrolysate (Elkan
and Kwik, 1968;Chakrabarti et al, 1981).
As simultaneous mineral and organic
nitrogen
might also benefit growth(Elkan
and Kwik,1968), NH 4 Cl
was sometimes added. Finally, initial pH was adjusted
to 6 or 7, the usual range for this type of culture
(Jordan, 1982).
There were thus 7 factors in this factorial
design,
the levels of which are given in table I. The number of combinations of levels of the 7 factors was 4 x 4 x 4 x
2 x 2 x 2 x 2, that is 1 024 combinations also called the treatments.
Only
a fraction of these, made up of 128 treatments, was tested. The method we used to choose the fraction will be explained, using a plain 2-factor example.
PRINCIPLE OF THE EXPERIMENTAL DESIGN
Quantitative factor case:
the
polynomial
effectsConsider the case in which 2
quantitative
factorsare to be studied:
Factor Number of levels Coded levels
The
quantity
Ymeasuring
the microbialgrowth
is assumed to follow the model:
In this
context, Y
is often called theexperimental
response or
just
the response. It differs fromf(A,B),
the theoretical response(Phan-Tan-Luu
etal, 1983),
because of the error ϵ.Quantities called
polynomial
factorialeffects,
or more
simply polynomial effects,
which charac- terize the influence of the various factors andtheir interactions on the response, are deduced from the theoretical response.
More
precisely,
these effects are defined from the values of the theoretical response f on thegrid
of 4 x 2points
associated with the different combinations of levels of the factors A and B.Thus,
the mean factorial effect stands for themean of this set of 8 values. It is denoted
by e(1).
Consider a
given factor,
say A. For each of its levels, we cancompute
thecorresponding
meanof the values of
f,
ie in this case the mean overthe 2 levels of B. If A is used to denote the level
(as
well as thefactor),
the associated mean is denotedby f(A, ·):
The
slope
of the bestfitting straight
line between these means and the factor A is called the linear factorial effect of factor A and denotede(linA) (John, 1971).
The coefficient of A2 in the best fit-ting parabola (fig 1)
is called thequadratic
factori-al effect and denoted
e(quadA) (John, 1971).
Finally,
if the curverepresenting
the theoreticalmean response
f(A, ·)
as a function of A exhibitsan inflection
point,
one has to consider the cubiceffect
e(cubA),
defined as the coefficient of A3 in the cubic that best fits the 4points
considered onthat curve
(fig 1).
Since factor B has
only
2levels,
theonly poly-
nomial effect of B which can be considered is the linear effect
e(linB).
Tostudy
the combined influ-ence of the 2 factors A and
B,
interaction terms denotedby e(linA ·linB), e(quadA ·linB)
ande(cubA.linB)
are introduced. With each interac- tion term is associated apolynomial degree
which is defined as the sum of the
degrees
ineach effect. The
degree
is 2 fore(linA ·linB),
3 fore(quadA
·linB),
and 4 fore(cubA ·linB).
To
define,
forinstance, e(linA ·linB),
one looksfirst at the variation of the theoretical response in function of A for each level of B and
deduces,
aspreviously,
a linear effect of A for this level ofB,
denotedby e(linA(B)) (fig 2).
The variation of this effect as a function of B thenyields e(linA.linB),
which is here the
slope
of thestraight
linethrough
thepoints
associated with B = -1 and B = +1 infigure
3. The effects obtained in this way aregenerally
normalizedby multiplication by
an
appropriate
constant in order to make themcomparable
with each other.There is a very
simple
way ofobtaining
thesenormalized effects from the values of the theoret- ical response f on the 8
points
of thegrid.
Oneintroduces the
orthogonal polynomials
in A and Bdenoted
by
mean,linA, quadA,
cubA and linB whoseexpressions
as functions of A and B aregiven
in theAppendix.
Theseorthogonal polyno-
mials,
which havelong
since been used tosimpli- fy
thecomputations
inpolynomial regressions,
are defined and tabulated in Fisher and Yates
(1957)
and Pearson andHartley (1976). Although
their interest for calculation has now
disap- peared,
their use is stillhighly
recommended in thegeneral
context ofpolynomial regression
toobtain
meaningful parameters
which are the least correlatedpossible (Durier
andKobilinsky,
1989;Kobilinsky, 1989).
The values of the
orthogonal polynomials
andtheir
products linA ·linB,
..., cubA·linB calculatedon the 8
grid points
aregiven
in table II. Theseare the coefficients to
apply
to the 8 responsesf(-3, -1),
...,f(3, 1)
to obtain thecorresponding
effects
(up
to normalization coefficientsgiven
inthe last
row).
For instance:The normalization coefficients are chosen in order to obtain the same variance for all the columns in table II.
The
polynomial
effects are defined from the theoretical responses on the 8 initialgrid points only.
Tointerpolate
between thesepoints,
f isgenerally
assumed to have apolynomial form,
forinstance,
apolynomial
ofdegree
2 like:This function can be written as a function of the
orthogonal polynomials:
The absence of B2 and
quadB
indicates that B has noquadratic effect,
anassumption
which iscoherent with the choice of
only
2 levels for B.The coefficients α0, α1, α2,
β 1 ,
γ1appearing
inthe latter form are then
precisely
the effectse(1), e(linA), e(quadA), e(linB),
ande(linA ·linB).
Theother effects
e(cubA), e(quadA ·linB), e(cubA
·linB),
associated with terms that areexcluded from the
model,
are assumed to bezero.
Since the number of
parameters
in the model for f is less than the number ofpoints
in thegrid,
the
polynomial
effects can be estimated from afraction of the 8
points
instead of the whole set.This fraction must have at least as many
points
as the number of
parameters
in themodel,
if all effects are to be estimable. It must also be care-fully
chosen to make the estimation of effects as accurate aspossible.
Theplain technique
described thereafter leads to very efficient frac- tions for situations
involving
a mixture of 2- or 4- levelqualitative
andquantitative
factors.Decomposition
inpseudofactors
of a 4-level factor.
Definition
of(pseudo)factorial
effectsFractional factorial
design
for 2-level factors arewell known
(Finney, 1945). They
can beeasily adapted
to mixtures of 2-level and 4-levelfactors, by decomposing
each 4-level factor into two 2- levelpseudofactors.
Moreover if a 4-level factor isquantitative,
anappropriate decomposition
leads to
simple
relations between the factorial effects associated with thepseudofactors
and thepolynomial effects,
which make sense in thatcase. These
relationships
can be used to select afraction well suited to the
polynomial
formassumed for the theoretical response.
In the
example,
the 4-level factor A is decom-posed
into two 2-levelpseudofactors A 1
andA 2 .
The
correspondence
between thequantitative
levels of A and the levels of the
pseudofactors A 1
and
A 2
isgiven
in table III. In this table we have alsoreported
the levels of theproduct pseudo-
factor
A 1 A 2 ,
which are theproducts
of the levels ofA 1
andA 2 ,
and thecorresponding
values ofthe
orthogonal polynomials
ofdegree
1, 2 and 3 in A:linA, quadA
and cubA.The
pseudofactors A 1 , A 2
andA 1 A 2
have nospecial meaning
and areonly
used as tools tobuild the
design.
The theoretical responsef(A,B)
can be
expressed
as a functiong(A 1 ,A 2 ,B),
of thelevels of the
pseudofactors A 1 A 2
and the factorB. The factorial effects
(also
calledpseudofactori-
al
effects)
ofA 1 , A 2 ,
B and their interactions canbe obtained from table IV in the same way as the
previously
introducedpolynomial
effects.For
instance,
the factorial effect ofA 1 ,
denotedby e(A 1 )
is defined as the half difference between the means of g forA 1
= 1g(1, ·, ·),
and forA 1
=- 1, g(-1,
·,
·):
Similarly,
the effect of interaction betweenA 1
andA
2
,
denotedby e(A 1 A 2 )
is definedby:
Relationship
betweenpolynomial
factorsand
pseudofactors
and inducedrelationships
between effects
It is
easily
checked in table III that thefollowing relationships
between thepolynomial
factors linA,quadA,
cubA on one side thepseudofactors A
1 ,A
2
andA 1 A 2
on the other exist:They
induce thefollowing
similar relations betweeneffects,
for instanceand
they
areeasily
extended topseudofactorial
effects built as
products,
for instanceQualitative four-level factors
Each
qualitative
factor A is alsodecomposed
into2
pseudofactors A 1
andA 2
but in that case thecorrespondence
between levels is not relevant and can bearbitrarily
chosen. For instance if a1, a2
, a3 and a4 are the 4 levels of A, a
possible
choice is:
This choice induces a
decomposition
of themain effect of A into 3 effects
e(A 1 ), e(A 2 )
ande(A
1 A 2
)
andsimilarly
adecomposition
of theinteraction between A and B into 3 effects
e(A 1
B), e(A 2 B)
ande(A 1 A 2 B).
Construction and
properties
of a fractional
design
The columns
A 1 , A 2
and B of table IVgives
the 8treatments of a
complete
factorialdesign.
Suppose
we canonly
afford toexperiment
4 ofthem. A
simple
way to select them is to chose thosesatisfying
theequation A 1 A 2 B
= 1, which iscalled a
defining
relation(Box et al, 1978).
The fractional
design
thusdefined,
which is ahalf
fraction,
includes the treatments 2, 3, 5 and 8 of table IV which aresingled
out in table V.Some
elementary
results on matrices are usedin the
following development. They
are recalledin the
Appendix for
the use of the reader.The
general
meaneffect,
denotedby e(1),
the main effects of factors
A 1 , A 2
and B andtheir interaction effects are defined from the the- oretical responses on the
complete
factorialdesign by equalities
similar to(E2)
and(E1).
IfX’ is the
transpose
of matrix Xappearing
intable
IV,
these definitions can bepresented
inmatrix form as follows:
Let e and g be the vectors
appearing
on theleft and the
right
of this matricialexpression respectively,
that is the vector ofpseudofactorial
effects and the vector of theoretical responses.
We can write more
concisely:
It is easy to check that XX’ = 8I where I is the
identity
matrix.Premultiplication by
of the 2terms of the above
equality
thenyields
theequality:
which expresses the theoretical responses as functions of the
pseudofactorial
effects and canbe written in the
developed
form asThe
only
responses estimable from the 4points
of the fractional
design
are:The 4
corresponding equations
can be written in thefollowing
manner:From the estimations of the 4 responses, it is
impossible
toget separate
estimates of the 2 fac- torial effects in 1 of the sums on theright,
likee(1)
+e(A 1 A 2 B),
ore(B)
+e(A 1 A 2 ).
If one of themwas increased and the other decreased
by
thesame amount and the observations are not affected. The 2 effects thus associated are said to be confounded or aliased.
The
confounding
of effects is due to theequal- ity
of thecorresponding
columns in table V:These
equalities
can bedirectly
obtainedby multiplying
thedefining relationship
1 =A 1 A 2 B
ofthe fraction
successively by B, A 1 , A 2 ,
since B2 =1 for each of its levels, 1 or -1, and
similarly
A 1 2
=A 2 2
=1.
The
confounding
of effects renders the inter-pretation
of the statisticalanalysis
verydifficult,
unless 1 of the 2 confounded effects isnegligible.
Thus if
e(B)
+e(A 1 A 2 )
is foundsignificantly greater
than 0, it isimpossible
to determine with- out further information which of the 2 effects isgreater
than 0. However when A isquantitative, e(A
1 A 2
)
=e(quadA).
Sincequadratic
effects canoften be assumed to be far smaller than linear ones, the sum
e(B)
+e(A 1 A 2 )
≈e(B)
can often beassimilated to the linear effect of B.
The
unambiguous interpretation
of all 4 sumsof aliased effects in this
simple example requires
the
strong assumption
that the 3degrees
of free-dom of the interaction between A and
B,
iee(A 1 A 2
B), e(A 1 B),
and ≈e(A 2 B),
arenegligible
aswell as the
quadratic
effect of A.The same method of fractionation leads to more
interesting
fractions when the number of factors islarger.
It is thenpossible
to confoundimportant
effects with interactions of 3 factors or more, or with effects ofdegree greater
than orequal
to 3. It is thenquite
realistic to assumethat these last effects are
negligible
and thisentails that all
important
effects can be well esti- mated.Complementary
half-fraction. Fraction 1/4 It ispossible
in theexample
to select the other half of thedesign
definedby A 1 A 2 B
= -1. Theconfounded effects are the same but it is their dif- ference which is estimated instead of their sum.
For instance B =
-A 1 A 2 ,
and so it is the differ-ence
e(B) - e(A 1 A 2 ),
which can be estimated instead of the sume(B)
+e(A 1 A 2 ).
If a further fraction of the half-fraction defined
by A 1 A 2 B
= 1 isrequired,
apossible
choice is toselect the 2 treatments, out of the 4 of the half-
fraction,
whichsatisfy:
This
gives
the 1/4 fraction of thecomplete
fac-torial
design
definedby
the 2defining
relationsA
1 A 2
B
= 1 andA 1 A 2
= 1.Making
theproduct
ofthese 2
defining
relationgives:
Thus on this 1/4 fraction
A 1 A 2 B
=A 1 A 2
= B =1.
Multiplying
theseequalities by A 1
thengives A
2
B= A 2
=A 1 B
=A 1 .
Theseequalities imply, exactly
as in the halffraction,
that the corre-sponding
effects are aliased. Moreprecisely
from the two responses it is
only possible
to esti-mate the 2 sums
e(A 1 A 2 B)
+e(A 1 A 2 )
+e(B)
+e(1)
ande(A 2 B)
+e(A 2 )
+e(A 1 B)
+e(A 1 ).
Thisexample,
in which the factor B remains constant, hasclearly
nopractical
interest. But a similar kind of fractionationgoing beyond
the half frac-tion is very
helpful
when the factors are morenumerous.
GENERALIZATION: THE 1/8 FRACTION FOR IMPROVING THE CULTURE OF B JAPONICUM
Construction
We now
present
the 1/8th fractionaldesign,
which was obtained
by
thepreviously
describedmethod for the 7 factors in table I.
Two
pseudofactors A 1
andA 2
are associatedwith the factor carbon source
(denoted by A),
with the
following arbitrarily
chosen correspon- dence between levels.The
organic nitrogen
concentration(B)
andyeast
extract concentration(C)
are also decom-posed
into two-levelpseudofactors B 1
andB 2
onone
side, C 1 , C 2
on the other. These factors arequantitative,
and so thecorrespondence
betweenlevels is chosen as indicated in table III for the factor A of the
previous example.
Thedefining
relations are:
The reasons for this choice will appear more
clearly
in thefollowing
sections.Premultiplications by E,
F, Grespectively give:
To obtain the
design,
the 128 combinations of levels of the 7pseudofactors A 1 B 1 B 2 C 1 C 2
Dare first written down and for each of them the levels of
E, F,
G are deducedusing
the termsunderlined in the above
equalities.
Multiplying
the 3defining
relations between theseyields:
The effects associated with the 8
equal prod-
ucts in
[E4]
are confounded.They
are said to beconfounded with the
general
meane(1).
Consider now any
product
which does not appear in[E4],
for instanceA 1 B 1 B 2 . Multiplying
itby
theequalities
in[E4] gives:
The
corresponding
effectse(C 1 C 2 E)...
e(A 1 A 2 B 1 B 2 C 2
EFG)
are the effects confounded withe(A 1 B 1 B 2 ).
The choice of thedefining
rela-tionships [E4]
relies on therespective importance
of the
polynomial effects,
which we now consider.Classification
of polynomial
effectsThe
meaningful
effects are thepolynomial
effectssuch as
e(A 1 ·quadB), e(A 1 A 2 ·linB)
ande(linB
·linC
·D),
which are defined andcomputed
as
explained
in thesimpler example
in theprevi-
ous section. Two criteria may be used to
classify
these effects: the number of factors
appearing
inthe effects and their
degrees.
Thedegree
is thesum of the
degrees
in eachfactor;
thedegree
attributed to
qualitative
factors isalways
1. These2 criteria are
given
below for the 3polynomial
effects
just
mentioned:The
importance
of an effectdepends
on itsdegree
and on the number of factors in it. Themore
important
effects are those ofdegree
1:e(A 1
), e(A 2 ), e(A 1 A 2 ), e(linB), e(linC), e(D), e(linE), e(linF)
ande(linG).
The effects ofhigh degrees
likee(A 1 ·quadB ·quadC)
ande(A
1
·D·linE·linF·linG),
which are ofdegree
5, aregenerally
assumed to benegligible.
For effects of the samedegree,
theimportance generally
decreases with the number of factors involved.
For instance
e(cubB)
is considered moreimpor-
tant than
e(linB·linC·D).
Rationale for
choosing
thedefining relationships and properties
of thedesign
The above classification can be used in
conjunc-
tion with the
relationships [E3]
betweenpseudo-
and
polynomial
factors to chooseappropriate defining
relations. There are 2possible approaches.
Theapproach
described in this sec-tion
(Bailey,
1982;Kobilinsky, 1985)
was used toobtain the
defining
relations of this 1/8th fraction.It
only
relies on the presence or absence of each term in therelationships linking pseudofactorial
and
polynomial
effects.Pseudofactorial effects are
categorized
asnegligible
or notaccording
to whether theirexpressions
in function of thepolynomial
effectsinvolve
negligible
termsonly.
Thedefining
rela-tionships
are chosen in order to estimate all non-negligible pseudofactorial effects, by only
con-founding
them withnegligible
ones. Forinstance,
assume that the
negligible polynomial
effects arethose of
degree
3 or more. Theestimability
isthen
required
for the 4pseudofactorial
effectse(B
1 C 1
), e(B 1 C 2 ), e(B 2 C 1 )
ande(B 2 C 2 ) carrying
information on
e(linA·linB).
These effects can be confounded with anynegligible
effect such ase(DEF), e(B 1 C 1 C 2 ) = (2e(linB·quadC) - e(cubB·quadC))/√5...
The
importance
ofpolynomial
effectsdepends
on their
degree.
Thissuggests defining
thedegree
of apseudofactorial
effect as the mini-mum
degree
of thepolynomial
effects as a func-tion of which it can be
expressed.
Forinstance,
The
degree
of apseudofactorial
effect is in factthe sum of the
degrees
of each factor in it. It canbe obtained here very
quickly
as the number of letters different from Aplus
1 if it includes the let- ter A(once
ortwice).
The
pseudofactorial
effects can thus be classi-fied as the
polynomial
effects as a function of theirdegree
and of the number of factorsthey
contain.If
polynomial
effects ofdegree greater
than orequal
to n andinvolving
more than m factors areneglected,
then so are thepseudofactorial
effectsof
degree greater
orequal
to ninvolving
morethan m factors. It is therefore very
simple
to findthe
negligible pseudofactorial
effects and to checkthe
admissibility
of a set ofdefining relationships.
For instance:
The effects confounded with
e(B 1 C 1 )
ande(B
1 C 2
), e(B 2 C 1 )
ande(B 2 C 2 ),
obtainedby
multi-plying
theequalities [E4] by
thecorresponding effects,
are interactions between three factors at least. If these interactions are assumed to be zero, these effects are estimable and so ise(linB·linC).
e(quadB)
=e(B 1 B 2 )
is confounded with inter- actions of 4 factors or more and with 1 interactionA 1 C 1 C 2
E,
involveonly
3 factors(of degree 4)
that can be
safely neglected.
More
generally,
it can be noticed that all thepseudofactorial
effects confounded with the gen- eral mean, that isappearing
in[E4],
are ofdegree greater
orequal
to 5(the design
is said tobe of
polynomial
resolution5).
Thisimplies
that:(i)
the factorial effects ofdegree
1 are con-founded with effects of
degree
at least 4; and(ii)
the factorial effects of
degree
2 are confoundedwith effects of
degree
at least 3.Moreover,
the main effects of anydegree
areconfounded with interactions of at least 3 factors and at least
degree
4, and interactions of 2 factors anddegree
2 are confoundedonly
with interactions of 3 factorsexcept e(A 1 ·linE),
confounded with a 2- factor interactione(quadB·quadC). However,
the latter is ofdegree
4 and can be assumed to benegligible.
Interactions of 2 factors anddegree
3are also confounded with interactions of at least 3
factors, apart
fromA 1 B 1 B 2 , C 1 C 2 E, A 1 C 1 C 2
andB
1 B 2
E
which are confoundedby pairs.
The final
properties
come from the fact that theeffects confounded with the mean are interac- tions of 5 factors
except A 1 B 1 B 2 C 1 C 2
E in whichappear
only
4 factors. The smallest number of factors in the effects confounded with thegeneral
mean is the resolution of the
design.
The resolu- tion here isonly
4. It would have been better ofcourse to use a resolution 5
design
but this reso-lution cannot be obtained
by
the fractionationtechniques
considered in this section.Another
approach (Edmondson, 1991)
goes 1step
furtherby considering
therespective impor-
tance of the coefficients in the relations
[E3]
link-ing pseudofactorial
andpolynomial
effects. Wewill show below how this second
approach
canstill
produce
an efficient 1/16th fraction with 64 units instead of 128.REPETITIONS
In order to
get
therepeatability
of results for fixedculture
media,
a subfraction of 16 treatments wasrepeated.
This subfraction was obtainedby adding
thefollowing defining relationships:
These 16
points
made up a resolution 3 frac- tion of thecomplete
factorialdesign
in which B and C were studied at 2 levelsonly (0.2-0.4 g/l
for
B,
2-4g/l
forC).
Viable cells were counted forthe 2
repetitions
of these 16repeated
treatmentsin order to
study
therelationship
between theoptical density (OD)
and the number of viable bacteria.STATISTICAL ANALYSIS
The
polynomial
model used for theanalysis
included all main
effects,
all 2-factor interactions ofdegree
2, somedegree
3 estimable effects andfinally
1 term out of eachpair {e(A 1 B 1 B 2 );
e(C 1 C 2
E)}, {e(A 1 C 1 C 2 ); e(B 1 B 2 E)}
of confounded 2-factor interactions. Note that these last terms stand for thecorresponding
sums of confounded effects.OD = e(1) + e(A1) A1+ e(A2) A2+ e(A1A2) A1A2
+ e(linB) linB + e(quadB) quadB + e(cubB) cubB
Main
effects+ e(linC) linC + e(quadC) quadC + e(cubC) cubC
+ e(D) D + e(linE) linE + e(linF) linF + e(linG)
linG
+ e(A1·linB) A1·linB + ... + e(A1·linG) A1·linG Two-factor inter-
+ e(A2·linB) A2·linB + ... + e(A2·linG) A2·linG actions involving A
+ e(A1A2·linB) A1A2·linB + ... + e(A1A2·linG) A1A2·linG
+ e(linB·linC) linB·linC + e(linB·D) + ... + (linF·linG) linF·linG
linear·linear interactions
+ estimable or confounded degree 3 effects
The
polynomial effects,
the coefficients of this model, were estimatedby least-squares
approx- imation. From the standard deviations of theirestimates,
confidence intervals at different lev- els of confidence werededuced, using
the t-Student distribution. In this model we selected all the terms that were
significantly
differentfrom 0, ie
having
a confidence interval not cov-ering
0. We added the terms ofdegree
smalleror
equal
in each factor to them. For instance, ife(A1·quadB)
wassignificant,
the termslinB, quadB, A 1
andA 1 ·linB
were introduced in addi- tion toe(A 1 ·quadB) .
The confidence level cho-sen was
equal
to 99% to avoidretaining
a lot ofterms that were
negligible
withrespect
to themost
important
effects in the model. The sub-model thus obtained could be used, if it did not include any of the confounded
effects,
topredict
the OD for any values of the factors within the
experimental
domain and inparticular
for eachof the 1 024 treatments of the
complete
factorialdesign.
1/16th FRACTION
If
polynomial
effects ofdegree strictly greater
than 2 are assumed zero, it is
possible
to frac-tionate one
step
furtherusing
Edmondson’s method(1991, 1993),
thusgetting
a 1/16 fractionwith
only
64experimental
units.Possible
defining
relations for this 1/16 fractionare:
These
defining
relations were obtainedby
anautomatic search program Planor
(Kobilinsky, 1994).
In itssimpler form,
theinput
for this pro- gram is apseudofactorial
model and the terms which are to be estimated in it. It is alsopossible
to introduce several
pseudofactorial
models andthe
corresponding
families of terms to be esti- mated. For thissearch,
werequired:
i)
all the main effects to be estimable in a modelincluding
all main effects and 2-factorinteractions;
ii)
all 2-factorinteractions,
notincluding B 2
orC 2 ,
to be estimable in a model
including
the maineffects and these interaction terms.
The
symbolic expression
of theserequire-
ments, asthey
appear in the program, was:Note that a
product
like q·qgives,
afterreplacement
of qby
itsexpression, development
and omission of redundant terms, a model includ-
ing
all main effects and 2-factor interactions.Requirement (i)
thus means that the searcheddesign
is a resolution 4design,
andrequirement (ii)
is anempirical
choicejustified by
the nature ofthe
relationships
betweenpseudofactorial
andpolynomial
effects. This is betterexplained
at theend of this section.
It was
possible,
in that case, to obtain all the sets ofdefining
relationsmeeting
therequire-
ments. There were 1 152 such sets,
which,
afterinvestigation,
were found to be derived from 6 basic onesby permuting
the factors with sym- metricroles,
that isD,E,F
and G first then B and C andfinally
the 3pseudo-factors A 1 , A 2
andA
1 A 2 .
Table VI
gives
these setsand,
for each ofthem,
3global efficiency
criteria. Theseglobal
criteria are defined
by comparison
between the normalized information matrix X’X/N of thedesign
andX’ 0 X 0 /N 0
of thecomplete
factorialdesign (N =
64,No
= 1024).
It is convenient to choose theparametrization
so that the latter isthe
identity: X/ 0 X 0 /N 0
= I. Then, ifλ i ,
i =1,..., 45
are the
eigenvalues
of X’X/N.The column No of
permutations
in table VIgives
the number of other sets obtained
by permuting
the factors with
symmetric
roles. The best set forthe trace and determinant criteria is the one pro-
posed
at thebeginning
of this section. We nowstudy
it morethoroughly by
the same method asabove.
Multiplying
the 4defining
relationsby
eachother
gives
thefollowing
16 whichcorrespond
toall the
pseudofactorial
effects confounded with thegeneral
mean.Multiplication
of the above termsby
anotherone, say