• Aucun résultat trouvé

Experimental design: The joys and caveats of setting up a proper study framework

N/A
N/A
Protected

Academic year: 2022

Partager "Experimental design: The joys and caveats of setting up a proper study framework"

Copied!
10
0
0

Texte intégral

(1)

Article

Reference

Experimental design: The joys and caveats of setting up a proper study framework

WEINKAUF, Manuel

WEINKAUF, Manuel. Experimental design: The joys and caveats of setting up a proper study framework. Newsletter of Micropalaeontology, 2018, vol. 98, p. 57-65

Available at:

http://archive-ouverte.unige.ch/unige:109020

Disclaimer: layout of this document may differ from the published version.

(2)

Experimental design: The joys and caveats of setting up a proper study framework

Manuel F. G. Weinkauf1

1Department of Earth Sciences, Université de Genève, Geneva, Switzerland

Experimental design is an important, but often neglected topic in micropa- laeontology. Fortunately, quite a lot of literature exists to introduce this topic properly to natural scientists (Federer 1955,Hurlbert 1984,Quinn and Keough 2002, Dytham 2011).

But why is experimental design in all its facets important? For starters, having a proper experimental design makes all ensuing quantitative ana- lyses much more easy, because one does not have to tweak statistical methods for an ill-chosen or com- plicated setup that introduces unne- cessary random terms and bias into the analyses. But the problem does not end there, because an inappro- priate experimental design can lead to wrong results, incomparable stud- ies, experiments that cannot be rep- licated, and many other problems. In

this article, I will try to share the very basics of experimental design and show some of the problems and pitfalls that come with it.

Mensurative vs. manipulative experiments

When thinking about experiments, many people tend to think only of manipulative experiments, i.e. exper- iments where something (e.g. a pop- ulation of benthic Foraminifera) is manipulated in the laboratory (e.g.

cultured in water with different sa- linities). I believe this is part of the reason why many palaeontolo- gists tend to think that experimen- tal design need not concern them, since they only work with mater- ial sampled in the field without act- ively manipulating anything. Un-

(3)

TMS Newsletter vol. 98 Methods in micropalaeontology

fortunately, this is wrong (Hurlbert 1984)! We indeed distinguish two types of experiments, and experi- mental design is important for both of them.

The first type, as already men- tioned, are manipulative experi- ments. Those are all experiments which are performed by actively ma- nipulating some of the variables in- volved, so that one has an actively chosen treatment. Often, manip- ulative experiments take place in laboratories, where organisms can be cultured in several vessels that are exposed to different environmental parameters (e.g. culturing popula- tions at a range of different temperat- ures, salinities, light levels, etc.) (see Marschner et al. 2012,Schmidt et al.

2014, for examples of well-designed manipulative experiments). They can, however, also be performed by manipulating populations directlyin situ(e.g.Nomaki et al. 2005).

Mensurative experiments, on the other hand, are experiments where samples are taken in nature across space and/or time and then analysed for similarities and differences. The

‘treatment’ is space or time, i.e. it is known that some environmental parameters changed in a spatial or

temporal pattern, and assemblages or populations are investigated con- cerning their reaction to that change.

This type of experiment is what the majority of micropalaeontological analyses entails (e.g.Aldridge et al.

2012,Weinkauf et al. 2014,Knap- pertsbusch 2016). Mensurative ex- periments are a bit more complic- ated to analyse, since they do not have a control treatment in the strict sense (as manipulative experiments do). They also have a larger error as- sociated with the treatment variable because it is only measured (often with a non-negligible error due to temporal or spatial integration) in- stead of actively chosen by the ex- perimenter. Therefore, although this is often forgotten, mensurative ex- periments often call for more robust statistical analyses than manipulat- ive experiments to be meaningfully analysed (Dytham 2011).

Sampling schemes

The sampling scheme is the most in- tegral part of any design, because it provides the raw material for manip- ulative experiments and comprises the entire dataset of mensurative ex- periments. In order not to make

(4)

things overly complicated, I will present the three most prevalent end- members here, which are equally useful for mensurative and manipu- lative experiments and for sampling in space (area) and time (sediment column) (Fig. 12).

Random sampling scheme

The random sampling scheme is the easiest and on first glance the most robust, because it takes samples at completely random locations. When the environment is very homogen- eous, for instance a sediment column that comprises only one lithology or a mudflat area without changes in environmental parameters or sedi- mentology, this is the easiest design to follow. Sampling locations are depicted by a random generator and are thus ensured to not be biased by any pre-existing assumptions of the experimenter.

Stratified random sampling scheme

It can happen that previous know- ledge of the sediment column or area to be sampled requires to modify the sampling scheme. For instance, one can imagine a sediment column

that mainly consists of limestones but has thin intercalations of sand- stone, or a mudflat that is crossed by a river which delivers coarser sedi- ment and more nutrients to the area in its vicinity. In both cases there is a dominating environment (lime- stone/mudflat) and a rare environ- ment (sandstone/river area). A com- pletely random design runs the risk of by chance only or disproportion- ately often sampling the dominat- ing environment and neglecting the rare environment. This can give a biased picture of the development of e.g. the microfossil community over time or space, because it can be assumed that the under-represented rare environment has a different as- semblage than the dominating envir- onment. The solution for this is the stratified random sampling. Here, it is made sure that each stratum (e.g.

dominating environment and rare en- vironment) is represented with the same number of samples but samples within each stratum are still distrib- uted randomly in space and time.

Systematic sampling scheme Systematic sampling is most often applied in micropalaeontology, be- cause it is easily designed and fol-

(5)

TMS Newsletter vol. 98 Methods in micropalaeontology

Limestone Sandstone

1 2 3 4

metres

Random Stratified random Systematic

Figure 12: Examples of three prevalent sampling schemes, shown on the example of sampling a geological sediment column. Random sampling: For a pure limestone as- semblage, where the environment was stable over the entire time, three randomly dis- tributed samples (black triangles) may be enough for analyses and do not bias the results.

Stratified random sampling:When there are intercalations of sandstone in the limestone, the three original random samples (black triangles) would by chance all fall in the limestone facies, obscuring the short-term environmental changes. A stratified random sampling (black and red triangles combined) with two random samples per stratum gives a bet- ter picture of the true development over time by sampling equally in both lithologies.

Systematic sampling:A systematic sampling every 1 m (black triangles) could lead to the conclusion that the sediment column represents only one environmental gradient, from sandy to calcareous, because each sample is a little bit more calcareous than the one before. In reality, there are three limestone–sandstone cycles, which are obscured because the sampling interval (1 m) shows interference with a naturally occurring pattern (c.1.25 m- cyclicity). A random sampling design (red triangles) would give a less biased picture here, but a systematic sampling with a much finer resolution may be the best choice to fully understand this sediment column.

(6)

lowed in the field. In systematic sampling, samples are taken in pre- defined distances to each other (e.g.

every 1 m along a sediment column or every 1 by 1 m across a mud- flat). Under many circumstances, this scheme is not problematic and eliminates bias by the experimenter as effectively as the random schemes.

However, the systematic sampling can lead to a large bias when the chosen sample distance interferes with a naturally underlying pattern.

In Fig. 12, the sampling interval of 1 m would create as set of samples which show gradually more cal- careous components and fewer sand content. This is because there is a real cyclicity with three limestone–

sandstone cycles ofc.1.25 m thick- ness. In effect, each of the three sys- tematic samples comes from another cycle, but because the cycles are a little bit thicker than the sampling in- terval, each sample is located a little bit further down in its cycle than the previous sample was in the previous cycle. In absence of further know- ledge, only based on these three sys- tematic samples, one may interpret the sediment column as one large sandstone–limestone trend (e.g. due to slow subsidence), when in effect

it is composed of three limestone–

sandstone cycles (e.g. indicating in- tervals of slow uplift phases termin- ated by rapid subsidence). System- atic sampling on that scale would thus lead to a completely wrong observation and interpretation of the real situation, where random sampling would at least show you that the pattern is rather complex and may warrant further investiga- tion. In a spatial sampling, an equi- valent example may be sampling in a mudflat in a 20 by 20 m grid, where there are waste-water exhausts at the coast in 20 m intervals. When by chance samples are taken in the drainage channels, assemblages ob- served there will indicate much more eutrophic conditions in the mudflat than is really the case.

Laboratory setup

There is a vast variety of more or less established experimental setups (for which analytical methods are well understood) for manipulative experiments, which are partly de- rived from the sampling schemes dis- cussed above (Dytham 2011,Corn- wall and Hurd 2015). The important thing about a proper laboratory setup

(7)

TMS Newsletter vol. 98 Methods in micropalaeontology

is first and foremost an unbiased dis- tribution of treatment and control units. There are different types of controls, but the two most important are (1) the procedural control and (2) the experimental control (Dytham 2011).

The experimental control is what many scientist consider the only con- trol in an experimental setup. It is a population, that is not exposed to the treatment (e.g. increased salin- ity) of the experimental group, but is instead cultured at natural condi- tions. This allows to evaluate, if the treatment had any effect on the in- vestigated parameter, or if the con- trol group shows the same patterns as the experimental group.

The procedural control is another type of control group that may be worthwhile when the treatment in- volves a lot of disturbances (e.g. spe- cimens have to be taken out of the aquaria each day for measurements and then put back). In such cases, the stress from the manipulation alone can have an effect, independent of the actual treatment. A procedural control is a control group that is dis- turbed in the same way as the exper- imental group (e.g. taken out of the aquarium into measurement vessels

each day) but otherwise cultured un- der the same conditions as the exper- imental control group.

When designing the setup of con- trol and experimental groups, it is important to make sure that there are no confounding factors. For in- stance, having all the experimental groups stand closer to the window and all the control groups further in the room closer to the air condition- ing in an experiment studying the effect of salinity levels can lead to an unobserved influence of light at- tenuation and temperature. It can thus lead to an observed difference between control and experimental group, that has nothing to do with salinity but is misinterpreted as such, because salinity was the only factor that was actively manipulated. The two most common base forms of proper experimental setup are the Latin square design and the random- ized block design (Hurlbert 1984, Dytham 2011). In the Latin square design, all treatments are applied within one block, but the different treatments have different positions in each row to avoid confounding of treatments. In the randomized block design, treatments are equally dis- tributed across blocks (e.g. incubat-

(8)

ors) but their position per block is randomized (Fig. 13).

A B C D B C D A C D A B D A B C

A C D B

B A C D D B

A C

C D B A

Block 1 Block 2

Block 3 Block 4 Latin square Randomized block

Figure 13:Examples of the two most com- mon experimental setups. In the Latin square design all treatments (including con- trols) are set up in one block, and each row and each column contains exactly one of each treatment. The shown arrangement is indeed not optimal, because treatment ‘D’

occurs in the corner twice (in the first and last row). In the randomized block design the experiment is spatially segregated into blocks, and each block contains one of each treatment (the arrangement of treatments is different in each block).

Replication

Replication is a very controversial matter, and far too complex to be discussed here. Ever sinceHurlbert (1984)published his devastating con- clusion about extensive pseudorep- lication in scientific experiments, experimental design was the most dreaded part of work for many scient- ists. Indeed, replication as expected

byHurlbert (1984)requires elabor- ate design, andHeffner et al. (1996) and Cornwall and Hurd (2015) in- dicate that the situation in scientific practice had not changed for the bet- ter since. There are two things to say about this here.

The first is, thatHurlbert (1984) is right in demanding proper rep- lication of experiments. Some of the experimental setups he criti- cises as pseudoreplication (Hurlbert 1984, fig. 1), for instance simple or clumped segregation or no replica- tion, can easily be avoided by just sticking to the guidelines outlined in this article and the literature here cited. Others may be harder to grasp for the reader. For instance, the fact that having only one incubator or aquarium per treatment—so that all control groups are in Incubator 1 and all experimental groups in In- cubator 2—is isolative segregation (Hurlbert 1984, fig. 1) and not rep- lication may be hard to understand.

The simple reason is, that even mod- ern technology does not work per- fectly, and one of the incubators may not work properly, break entirely, or have in the past accidentally been contaminated with a substance that influences the experiment. This can

(9)

TMS Newsletter vol. 98 Methods in micropalaeontology

lead to erroneous results, and it is not only in the interest of proper rep- lication but also of saving time in the case of catastrophic events (e.g.

one incubator breaks down entirely, destroying the entire experiment if there is no replication in another in- cubator) to avoid this.

The second thing is, that there are those who argue thatHurlbert (1984) might be a bit too restrictive in his approach (Hargrove and Pickering 1992, Oksanen 2001). Especially when dealing with mensurative and manipulative field experiments (nat- ural experiments), the high standards of replication demanded byHurlbert (1984) can hardly be met, or if so only by strict spatial or temporal lim- itation of the study. In such cases, a large range of treatments that offers properly robust results takes priority over replication, if replication would mean that the potential range of treat- ments is so small that any potentially observable effect is ambiguous.

To make this clear, one should try to properly replicate ones ex- periments as far as possible and avoid pseudoreplication whenever possible. But the wish to replicate must never limit the experiment in a way that makes its results ambigu-

ous. As Oksanen (2001, p. 33) put it:

‘Let us face it. If the concept pseu- doreplication is used in the broader sense, including compound treat- ments, then all experiments are pseu- doreplicated, though we do not al- ways have enough information to un- derstand how.’.

References

Aldridge D, Beer CJ, and Purdie DA (2012) Calcfication in the planktonic Foramini- feraGlobigerina bulloideslinked to phos- phate concentrations in surface waters of the North Atlantic Ocean.Biogeosciences 9: 1725–39. doi:10 . 5194 / bg - 9 - 1725 - 2012.

Cornwall CE and Hurd CL (2015) Experi- mental design in ocean acidification re- search: Problems and solutions. ICES Journal of Marine Science73 (3): 572–

81. doi:10.1093/icesjms/fsv118.

Dytham C (2011)Choosing and Using Stat- istics: A Biologist’s Guide. 3rd ed. (Ox- ford, Chichester, and Hoboken: Wiley–

Blackwell). 298 pp.

Federer WT (1955) Experimental Design.

(New York and London: Macmillan &

Co.). 544 pp.

Hargrove WW and Pickering J (1992) Pseu- doreplication: Asine qua nonfor regional ecology.Landscape Ecology6 (4): 251–

8. doi:10.1007/BF00129703.

Heffner RA, Butler MJ, and Reilly CK (1996) Pseudoreplication revisited.Eco- logy 77 (8): 2558–62. doi:10 . 2307 / 2265754.

(10)

Hurlbert SH (1984) Pseudoreplication and the design of ecological field experiments.

Ecological Monographs54 (2): 187–211.

doi:10.2307/1942661.

Knappertsbusch M (2016) Evolutionary pro- spection in the Neogene planktic fo- raminiferGloborotalia menardiiand re- lated forms from ODP HOLE 925B (Ceara Rise, western tropical Atlantic):

Evidence for gradual evolution superim- posed by long distance dispersal?Swiss Journal of Palaeontology 135: 1–44.

doi:10.1007/s13358-016-0113-6.

Marschner L, Triebskorn R, and Köhler H.-R (2012) Arresting mantle formation and redirecting embryonic shell gland tissue by platinum2+leads to body plan modi- fications inMarisa cornuarietis(Gastro- poda, Ampullariidae).Journal of Mor- phology273 (8): 830–41. doi:10.1002/

jmor.20019.

Nomaki H, Heinz P, Nakatsuka T, Shiman- aga M, and Kitazato H (2005) Species- specific ingestion of organic carbon by deep-sea benthic Foraminifera and meiobenthos: In situ tracer experiments.

Limnology and Oceanography 50 (1):

134–46. doi:10.4319/lo.2005.50.1.0134.

Oksanen L (2001) Logic of experiments in ecology: Is pseudoreplication a pseudois- sue?Oikos94 (1): 27–38. doi:10.1034/j.

1600-0706.2001.11311.x.

Quinn GP and Keough MJ (2002)Experi- mental Design and Data Analysis for Bio- logist. (Cambridge, New York, Port Mel- bourne, Madrid, and Cape Town: Cam- bridge University Press). 537 pp.

Schmidt C, Kučera M, and Uthicke S (2014) Combined effects of warming and ocean acidification on coral reef Foraminifera Marginopora vertebralis and Hetero- stegina depressa. Coral Reefs 33 (3):

805–18. doi:10.1007/s00338-014-1151- 4.

Weinkauf MFG, Moller T, Koch MC, and Kučera M (2014) Disruptive selection and bet-hedging in planktonic Foramini- fera: Shell morphology as predictor of ex- tinctions.Frontiers in Ecology and Evol- ution2: Article 64. doi:10 . 3389 / fevo . 2014.00064.

Références

Documents relatifs

Realizability as presented in [15] is checked as follows: one first checks that a set of peers obtained via projection from the choreography is synchronizable. If the

Curvature and distance measurements for the buckling actuator: initial state at ambient temperature (left), actuated state with resistive heating (right).. We define the

Please find the revised version of our manuscript entitled Which pitfall traps and sampling efforts should be used to evaluate the effects of cropping systems on

The first one dealt with an inverse problem of non-linear heat transfer to estimate the thermal properties (storage and transport coefficients), con- sidering a uniform

Whereas binary and secant ternary items can be solved using cell-based strategies, nonsecant ternary and quaternary items require shape-based strategies, as can be seen in

La capacité croissante à contrôler des systèmes quantiques toujours plus simples a pu être mise à prot pour étudier, tester et comprendre plus profondément encore cette

Pour un système d’exploitation standard, du type UNIX ou Linux, où les tables de pages utilisées par les TLBs ne sont pas répliquées, cette réduction du taux de Miss entraîne

Faculty Annual Funding = Operating + Overhead + Capital + Research Operating funds are distributed annually to cover the costs of staff salaries, administration