Explore key concepts in experimentation:
Every study should follow a somewhat similar workflow that involves:
Every study should follow a somewhat similar workflow that involves:
Hypothesis formalize our belief about the outcome of the study BEFORE we conduct it.
Hypothesis must be testable and proved right OR wrong, but not both.
After collecting data, I should be able to say that the data provides evidence in support OR against my original hypothesis.
It is ok if you get to the end of your study and your data shows your original hypothesis was wrong, this is part of science!
We hypothesize that increasing nitrogen (N) and potassium (K) fertilizer rates will increase corn grain yield.
Is this testable? Why?
We hypothesize that increasing N and K fertilizer rates may increase corn grain yield.
Is this testable? Why?
Our hypothesis is that greater N and K fertilizer rates can increase corn grain yield.
Is this testable? Why?
Your objectives should stem directly from your hypothesis:
Hypothesis: We hypothesize that increasing N and K fertilizer rates will increase corn grain yield.
Objectives: Our objective is to assess the effect of different N and K fertilizer rates on corn grain yield.
A study design is comprised of two components:
Treatment design + Experimental design
Treatment design is the part of the study design related to what treatments you need to answer your hypothesis/objectives, and how they are related to one another.
Note
Our goal is to select a treatment design that contains the treatment factors and their respective levels necessary to properly address the research question(s).
Let’s stop for a second and think about it.
If I want to find the optimum amount of an input, I really need to treat the crop with different levels of that input so I can estimate at which level the yield response is maximized.
Since we have two inputs (N and K), we need to do that for both of them.
Also, it may be that N and K interact in affecting corn yield, so we are interested in this effect too.
We have 2 treatment factors (N and K), and now let’s decide on their levels:
Note
Treatment factors: K and N fertilizer
Treatment levels: the rates chosen within the treatment factors (e.g., 0, 30, 60 kg K/ha)
Since we are interested in how N and K interact, we need to have in our treatment design all the combinations between N and K fertilizer levels.
That leaves us with 3 N levels x 3 K levels = 9 total treatment combinations.
In other words, we need all the above 9 treatments to find the joint optimum N and K fertilizer rates that optimize corn yield.
This is called a crossed 2-way factorial treatment design, or, more informative, crossed 3 N rate x 3 K rate factorial treatment design.
Note that our treatment factors are crossed, meaning that both N and K are at the same hierarchical level in the treatment design.
In contrast, treatment factors can also be nested, where the levels of one treatment factor are allocated within the levels of the other. Split-plot is an example where treatment factors are nested, meaning that one factor is at a higher hierarchy than the other (this is subject of a future lecture).
Now, notice that the treatment design has only addressed our objectives.
So what is the use of the experimental design?
Experimental design is the part of the study design related to how your treatments are assigned to the experimental units.
The decision of which experimental design to use should be guided by two factors:
Our goal is to use the simplest experimental design that accommodates all our treatments and their replications in homogeneous experimental units.
Experimental unit: the smallest unit in an experiment that is randomly and independently assigned to a treatment, normally is a plot in field research context. Unit of true replication.
Observational unit: entity of which the response variable is measured. If not the same as EU, then watch out for sub-sampling (not true replicate).
Replicate: different experimental units receiving the same treatment. Replicates exist in most designs.
Block: a set of experimental units grouped into homogeneous conditions where each EU is randomly assigned to a different treatment, and randomization happens independently for each block.
Continuing with our example of N and K levels, let’s assume that we would like to have 4 replications. That brings us to 9 treatments x 4 replications = 36 experimental units. The question that follows then is
“Do I have enough homogeneous experimental material to accommodate all 36 experimental units?”.
Two of the most common experimental designs in agriculture are:
“Do I have enough homogeneous experimental material to accommodate all 36 experimental units?”.
If the answer is Yes, then you can use CRD as the experimental design, and treatments will be randomized to the entire area (no restrictions in randomization).
In the plot layout here, all treatments (1 through 9) were randomly assigned to any experimental unit (plot) in the study area.
Treatment 1 and its replicates are highlighted.
Note how, due to the unrestricted randomization, treatment 1 appears twice in the first column, and does not appear on the third column. The same happened with other treatments.
Because the experimental material is homogeneous (e.g., same soil texture class), this should not be an issue when estimating treatment means and performing comparisons. 👍
“Do I have enough homogeneous experimental material to accommodate all 36 experimental units?”.
If the answer is No, then you should consider what types of limitation you have and which experimental design can be used.
Let’s say we answered No in our example. We have enough area to allocate 36 plots on the field, but we know that the area has heterogeneity in soil texture, a feature that likely impacts corn yield.
If we were to continue using a CRD, here’s what it would look like:
Suppose that dark soil texture has a positive effect on corn yield, and light soil texture has a negative effect.
In this case, using a CRD as in the example above, treatment 1 would likely have an average yield that is overestimated due to the unbalanced number of reps in each soil texture class.
Because of how treatments were randomized (no restriction), it is impossible to separate the effect of treatment from that of the soil texture class.
All these have a negative effect on both treatment means (biased) and on the analysis standard error (inflated), which makes treatment comparisons unfair and inaccurate (more difficult to detect real differences).
But, we can fix this by choosing an appropriate experimental design!
For that, we could use an RCBD, where blocks will be entirely confined within a given soil texture class.
Note
With RCBDs, each treatment appears once per block, and a set of 9 different treatments are randomized and assigned to each individual block separately.
Note how each treatment appears once and only once in every block.
The increased variability in corn yield caused by soil texture will be confined to the block effect, and can be properly dissected and not affect our inference on the treatment design variables.
There are other experimental designs that account for increasingly complicated limitations in experimental material.
Some of these include latin square, balanced incomplete block, partially confounded factorial, cyclic, lattice, supersaturated, response surface, and other designs.
We will not get into those in this class as they are less common and field-specific.
It all starts with curiosity and a well defined, testable hypothesis.
Treatment design: addresses research question(s).
Experimental design: addresses lack of homogeneity and/or limitations (funds, size, etc.) of experimental material.
The proper choice of experimental design will allow for unbiased and accurate treatment comparisons.