Repeated measurements

Transcription

Repeated measurements
Repeated measurements
Bo Markussen
bomar@math.ku.dk
Department of Mathematical Sciences
June 4, 2014
LAST (MATH)
Statistical methods for the Biosciences
Day 7
1 / 27
Outline
Introduction to SmB-II.
Analysis of repeated measurements.
LAST (MATH)
Statistical methods for the Biosciences
Day 7
2 / 27
Statistical data analysis project
Product:
I
I
I
I
Article style project (10–15 pages, say) with biological background and
objective. Presentation of statistical method and results. Preferably
written such that it easily may be altered into a submission paper.
Oral project presentation on June 18–19 in form of 20 minutes “Power
Point”. Again with both biological and statistical content.
Project handed in by email, preferably in Word, possibly in pdf.
Deadline June 16.
If neither June 18 nor 19 are possible for you, we may find an
alternative dates for handing in project and doing the presentation.
Supervision:
I
I
I
I
You receive comments and suggestions on your project synopsis.
You are entitled to 2 supervisions meetings (≈ 45 minutes) during the
3 weeks project period.
I’ll answer methodological and technical questions by email.
I’ll also act as a “statistical reviewer” on the written report.
Software: May be chosen freely.
LAST (MATH)
Statistical methods for the Biosciences
Day 7
3 / 27
Repeated measurements models: Why?
Example from course day 6: Color of pork meat.
10 pigs from both old and new breed: 7 chops from each pig.
Storage
Dark
Light
0 days
chop 1: data
not used
Time
1 days 4 days
chop 2 chop 3
chop 5 chop 6
6 days
chop 4
chop 7
Random effect model for chops 2 to 7 (in total 2*10*6=120 observations):
rednessi = α(storagei , timei , breedi ) + A(pigi ) + i
|{z}
| {z }
∼N (0,σA2 )
∼N (0,σ 2 )
But what if the experimental design has been like this?
Storage
Dark
Light
LAST (MATH)
0 days
chop 1
chop 2
Time
1 days 4 days
chop 1 chop 1
chop 2 chop 2
Statistical methods for the Biosciences
6 days
chop 1
chop 2
Day 7
4 / 27
example continued. . .
Alternative design has 4 measurements for each pork chop
Storage
Dark
Light
0 days
chop 1
chop 2
Time
1 days 4 days
chop 1 chop 1
chop 2 chop 2
6 days
chop 1
chop 2
Random effect model for the alternative design:
rednessi = α(storagei , timei , breedi ) + A(pigi ) + B(pigi , chopi ) + i
|{z}
{z
}
| {z } |
∼N (0,σA2 )
∼N (0,σB2 )
∼N (0,σ 2 )
Here the 4 measurements on the same pork chop (from the same pig)
share an additional random effect B(pig,chop).
But perhaps measurements taken close in time are more correlated
than measurements taken fare apart in time! How to model that?
LAST (MATH)
Statistical methods for the Biosciences
Day 7
5 / 27
General remarks on repeated measurements
Repeated measurements originate from study designs where the
experimental units have been measured several times (typically at
different time points or at different spatial positions):
I
I
I
I
“Economic” necessity, e.g. when experimental units are expensive.
Experimental units may serve as their own controls.
Response profile (i.e. response over time) is of scientific interest.
Repeated measurements are analysed either to gain power or to
investigate the response profile.
A summary measure is a single number capturing the important
feature of the response profile.
I
I
I
Examples: AUC (area under the curve), mean, maximum, minimum,
range between max and min, time under a pre-specified level, slope,
curvature, halving time, slope after the minimum, . . .
Summary measure preferably suggested from the scientific study, not
from the statistical analysis.
Summary measures reduce the repeated measurements to a single
observation =⇒ statistical analysis without repeated measurements.
LAST (MATH)
Statistical methods for the Biosciences
Day 7
6 / 27
Which method to use?
Summary measures vs. Random effects vs. Repeated measurements
Summary measures is always an option.
I
Unless you have particular interest in the response profile I recommend
analysis of summary measures (if it has sufficient power).
With few repeated measurements per subject, say 4 or less, it does
not make sense to estimate the serial correlation structure.
I
Simply use a random effect model.
With many repeated measurements per subject, say 5 or more, you
have enough information to estimate the serial correlation structure.
I
This is necessary to have trustworthy p-values and confidence intervals.
LAST (MATH)
Statistical methods for the Biosciences
Day 7
7 / 27
Case study: Growth of Baobab trees under water stress
Data kindly provided by Henri-Noël Bouda
Baobab seeds from 3 countries and 7 provenances sown in the
beginning of 2009.
Plants grown under 3 water regimes (100%, 75% and 50% field
capacity).
Diameter and height of plants measured monthly.
To measure root weight etc. some plants were harvested in August
2009, some plants in February 2010.
Purpose of experiment:
I
I
How does water drought effect growth of trees.
Is there an interaction with country and/or provenance?
Some plants die (12%). Should this be incorporated in the analysis?
LAST (MATH)
Statistical methods for the Biosciences
Day 7
8 / 27
Individual response profiles (subject profiles)
For 362 different baobab trees
A good plot to make. Gives overview of the data and provides an
impression of the “typical” time-response relationship.
LAST (MATH)
Statistical methods for the Biosciences
Day 7
9 / 27
Average response profiles
Response profiles averaged within the 9 combinations of treatments and countries
A good plot to make. Gives overview of the treatment effects.
We will analyse profiles from the harvested trees (plot to the right).
In particular we ignore trees that died – possibly from water drought.
I
Any comments on this?
LAST (MATH)
Statistical methods for the Biosciences
Day 7
10 / 27
Data organization in Excel sheet
Wide form (also called “horizontal organization”) of responses: diameter, height
Variable
Country
Provenance
Levels
3 (Burkina,Mali,Tanzanie)
7 (Kolangal,. . . ,Samé)
Plant
Block
Treatment
HarvestDate
Dia0209
..
.
362 (BKol-04,. . . ,TNku-76)
3 (1,2,3)
3 (1,2,3)
3 (aug-09,feb-10,missing)
continuous (or missing)
..
.
Description
Country
3 provenances from Burkina,
3 from Mali, 1 from Tanzanie
Plant id
Field blocks
Water regime
Day of harvest
Diameter, February 2009
..
.
Dia0110
Hei0209
..
.
continuous (or missing)
continuous (or missing)
..
.
Diameter, January 2010
Height, February 2009
..
.
Hei0110
continuous (or missing)
Height, January 2010
LAST (MATH)
Statistical methods for the Biosciences
Day 7
11 / 27
Long form: diameter, height for each month
The “long form” is also referred to as the “vertical organization”
Variable
Country
Provenance
Levels
3 (Burkina,Mali,Tanzanie)
7 (Kolangal,. . . ,Samé)
Plant
Block
Treatment
HarvestDate
month
diameter
height
362 (BKol-04,. . . ,TNku-76)
3 (1,2,3)
3 (1,2,3)
3 (aug-09,feb-10,missing)
12 (2,3,. . . ,13)
continuous (or missing)
continuous (or missing)
LAST (MATH)
Description
Country
3 provenances from Burkina,
3 from Mali, 1 from Tanzanie
Plant id
Field blocks
Water regime
Day of harvest
February’09 — January’10
Diameter
Height
Statistical methods for the Biosciences
Day 7
12 / 27
Long form as needed for the statistical analysis
Obs
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
...
...
...
Country Provenance Plant
Block
Burkina Kolangal BKol-11
1
Burkina Kolangal BKol-11
1
Burkina Kolangal BKol-11
1
Burkina Kolangal BKol-11
1
Burkina Kolangal BKol-11
1
Burkina Kolangal BKol-11
1
Burkina Kolangal BKol-11
1
Burkina Kolangal BKol-11
1
Burkina Kolangal BKol-11
1
Burkina Kolangal BKol-11
1
Burkina Kolangal BKol-11
1
Burkina Kolangal BKol-11
1
Burkina Kolangal BKol-78
1
Burkina Kolangal BKol-78
1
Burkina Kolangal BKol-78
1
Burkina Kolangal BKol-78
1
Burkina Kolangal BKol-78
1
Burkina Kolangal BKol-78
1
Burkina Kolangal BKol-78
1
LAST (MATH)
Treat
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
HarvestDate
01AUG2009
01AUG2009
01AUG2009
01AUG2009
01AUG2009
01AUG2009
01AUG2009
01AUG2009
01AUG2009
01AUG2009
01AUG2009
01AUG2009
01AUG2009
01AUG2009
01AUG2009
01AUG2009
01AUG2009
01AUG2009
01AUG2009
Statistical methods for the Biosciences
month diameter height
2
4.33
33.0
3
4.33
33.0
4
5.62
40.0
5
5.65
42.0
6
7.10
52.0
7
10.00
54.0
8
13.35
69.0
9
NA
NA
10
NA
NA
11
NA
NA
12
NA
NA
13
NA
NA
2
4.65
39.0
3
4.65
39.0
4
6.43
39.0
5
6.50
43.0
6
8.00
49.0
7
8.60
62.0
8
13.73
75.0
Day 7
13 / 27
Table of variables
Variable
Country
Provenance
Plant
Type
Nominal
Nominal
Remark:
Nominal
Block
Treatment
month
Nominal (3 levels: 1,2,3)
Nominal (3 levels: 1,2,3)
Nominal (12 levels: 2,. . . , 12,13)
diameter
height
continuous
continuous
(3 levels: Burkina, Mali, Tanzania)
(7 levels: Kolangal,. . . ,Samé)
nested in Country
(362 levels: BKol-04,. . . ,TNku-76)
Usage
fixed effect
fixed effect
random effect
subject id
fixed effect
fixed effect
fixed effect
correlation effect
response
response
The two responses (diameter, height) analysed separately.
R code via lme() in nlme-package:
I
I
I
fixed effects are specified in model formula.
random effects are specified inrandom option.
correlations are specified in corr option.
LAST (MATH)
Statistical methods for the Biosciences
Day 7
14 / 27
Diagram of fixed and random factors
3/
362
7
[plant]335
(
3412
'
/
21
6
[I]2819
/ block32
9
treat*block4
7
treat*provenance12
'
252
treat*month*provenance132
/
'
36
< E 01
1
7
7
provenance6
treat*month22
(
"/
3
treat2
84
month*provenance66
/
'
12
month11
In the model reduction provenance is nested within country.
Are the residuals i , i = 1, . . . , 3412, independent?
LAST (MATH)
Statistical methods for the Biosciences
Day 7
15 / 27
Repeated measurements model
diameteri = α(treati , monthi , provenancei , blocki )
+ A(planti ) + B(planti , monthi ) + i
|{z}
{z
}
| {z } |
∼N (0,σA2 )
∼N (0,σ 2 )
∼N (0,σB2 )
|
{z
[I]-term in the factor diagram
}
Random effect A: Some plants are bigger than others.
Correlated effect B: Correlated within plants (∼ subject id).
Correlation typically decreases with increasing time distance.
Uncorrelated between plants.
I
Possible interpretation is variation between time position of growth
period.
Error term : Possible interpretation is measurement error.
LAST (MATH)
Statistical methods for the Biosciences
Day 7
16 / 27
Three examples of correlation structures for B
diameteri = α(treati , monthi , provenancei , blocki ) + A(planti ) + B(planti , monthi ) + i
(A) The model without the serial correlated effect B.
I
In this case we have a random effect model as treated on Day 6. This
model is also referred to as the random intercept model or the
compound symmetry model.
|monthi −monthj |
d
|monthi −monthj |2
ρ2
(B) Var B(plant, monthi ), B(plant, monthj ) = σB2 exp −
I
Correlation has exponential decrease.
(C) Var B(plant, monthi ), B(plant, monthj ) = σB2 exp −
I
I
Correlation has Gaussian decrease.
When a random effect A and an error term are present, this model is
sometimes referred to as the Diggle model after Peter Diggle.
LAST (MATH)
Statistical methods for the Biosciences
Day 7
17 / 27
R code
# In the data frame ’baobab’ we have the factors:
#
Plant, Block, Treatment, Provenance, Country
long <- reshape(baobab,
varying=list(c("Dia0209","Dia0309","Dia0409","
c("Hei0209","Hei0309","Hei0409","
v.names=c("diameter","height"),
timevar="month",
idvar="Plant",
direction="long")
# Diggle model: two other models in R script
mGauss <-lme(diameter~Treatment*Block+
Treatment*factor(month)*Provenance,
random=~1|Plant,
corr=corGaus(form=~month|Plant,nugget=TRUE),
data=long,na.action=na.omit)
LAST (MATH)
Statistical methods for the Biosciences
Day 7
18 / 27
Which repeated measurements model to use?
There exists many other models than those listed on slide 17
Is the model valid?
I
I
I
Residual plot: Non-random scatter suggests that the explanatory
variables have not been used appropriately, e.g. an interaction or a
quadratic term might be missing.
Normal quantile plots: Residuals not on a straight line suggests that
the response variable perhaps should be transformed.
Semi-variogram: Compares empirical correlation structure (the dots) to
the fitted theoretical correlation structure (the line).
Interpretation?
I
Random effects have a simple interpretation, which speak in favour of
the compound symmetry model. The exponential decrease and the
Diggle model have similar interpretations.
Akaikes Information Criterion (AIC): “The smaller the better”.
I
For the Baobab dataset the compound symmetry model is clearly
rejected by the AIC.
LAST (MATH)
Statistical methods for the Biosciences
Day 7
19 / 27
Exponential vs. Gaussian decrease: Residual plot
Exponential decrease model
Diggle model
4
Standardized residuals
Standardized residuals
4
2
0
−2
−4
2
0
−2
−4
5
10
15
0
Fitted values
5
10
15
Fitted values
plot(mExp,main="Exponential decrease model")
plot(mGauss,main="Diggle model")
LAST (MATH)
Statistical methods for the Biosciences
Day 7
20 / 27
Exponential vs. Gaussian decrease: Normal quantile plot
Note that the axes are interchanged relative to the usual qqnorm()
Diggle model
Quantiles of standard normal
Quantiles of standard normal
Exponential decrease model
2
0
−2
−4
−2
0
2
4
2
0
−2
−4
Standardized residuals
−2
0
2
4
Standardized residuals
qqnorm(mExp,main="Exponential decrease model")
qqnorm(mGauss,main="Diggle model")
LAST (MATH)
Statistical methods for the Biosciences
Day 7
21 / 27
Semi-variogram: γ(h) = 12 var X (t + h) − X (t)
An alternative is to use Chapter 9.4 in the R guide
Exponential decrease model
Diggle model
1.0
0.6
0.8
Semivariogram
Semivariogram
0.5
0.4
0.3
0.6
0.4
0.2
0.2
0.1
2
4
6
8
10
2
Distance
4
6
8
10
Distance
plot(Variogram(mExp),ylim=c(0,0.7))
plot(Variogram(mGauss),ylim=c(0,1.1))
LAST (MATH)
Statistical methods for the Biosciences
Day 7
22 / 27
Choice of repeated measurements model
Residual and normal quantile plots acceptable for all models
Model
AIC
Compound symmetry
10534.887
Exponential decrease
8710.159
Diggle
8806.953
Akaikes Information Criterion (AIC) prefers the exponential decrease
model. However, the semi-variogram suggests that the correlation
structure is better modelled by the Diggle model.
We choose the Diggle model, which we describe as:
“An ANOVA with random effect of plant and residual errors
correlated within plants. The errors consist of an independent
component and a component with Gaussian decreasing
correlation. For the fixed effects we used the concatenation of
the full factorial design of (tretment,month,provenance) and the
full factorial design of (treatment,block)”.
LAST (MATH)
Statistical methods for the Biosciences
Day 7
23 / 27
Overview of steps in a repeated measurements analysis
List and classify the factors and covariates in the design.
I
Done (see slide 14).
Make plots of individual, and perhaps averaged, response profiles.
I
Done (see slides 9 and 10).
Choose and validate a correlation structure.
I
Done (see slides 20 to 23).
Test for reduction of fixed effects: interactions, main effects,
covariates etc. (as usual for ANOVA and ANCOVA models).
I
I
I
Remember to refit models using maximum likelihood (method="ML").
Unfortunately the drop1() function does not work for lme-objects. So
this has to be done by hand (see Chapter 9.5.3 in the R guide).
Automatic model selection based on AIC may be done using
stepAIC() from the MASS-package.
Report estimates and conclusions from the final model (as usual, e.g.
using lsmeans and the multcomp-package).
LAST (MATH)
Statistical methods for the Biosciences
Day 7
24 / 27
Analysis of Summary measures
An alternative to the repeated measurements analysis discussed above
Idea:
I
I
Reduce the curve for each subject to a single value.
Analyze this summary measure as usual (ANOVA, regression, . . . ).
As summary measures we could for example use:
I
I
I
I
I
I
I
Average response over time.
Area under curve (AUC, often used in medicine).
Slope of curve (rate of increase).
Maximal response.
Position (e.g. time) of maximal response.
Halving time since maximal response.
Curvature: fit α + β ∗ time + γ ∗ time2 for each individual and use γ̂.
Note: The summary measures should be computed for each subject — not
on the average profiles!
LAST (MATH)
Statistical methods for the Biosciences
Day 7
25 / 27
Principles for choosing summary measures
Select a measure that addresses the problem under investigation.
Do not choose summary measures on the basis of visual inspection of
the treatment differences — this is cheating.
I
But it is OK to plot all profiles in one graph and select “typical
features” of the curves for further investigation.
You may analyze more than one summary measure. If so, then choose
some that reflect different aspects of the curves. For example:
I
I
AUC and average response is NOT a good combination.
AUC and rate of increase might be a good combination.
But be aware of the associated multiple testing problem.
LAST (MATH)
Statistical methods for the Biosciences
Day 7
26 / 27
Analysis of summary measures: Pros and Cons
Advantages:
I
I
I
Simple analysis, which is more easily communicated.
Often powerful analysis if the summary measure is chosen appropriately.
Model validation more easy and transparent.
Disadvantages:
I
I
I
Each curve is reduced to a single value — loss of information?
Which summary measure should be choose?
No investigation of the “temporal” structure, which might be
important for the problem under investigation.
LAST (MATH)
Statistical methods for the Biosciences
Day 7
27 / 27