7 Missing Data
7.1 Research objective
- To identify if the mean SBP at Week 4 decreased from the baseline value in the Treatment group
7.2 Data
We will use part data to explore this specific objective
Part data are comprised of those in the Treatment group at Week 0 and 4 for GP = 1 only
The modelling approach we have just described is also ideal with missing observation.
The model can also borrow information at the patient’s level
Let’s imagine that five patients did not turn up on Week 4.
For example, the following five patients do not have Week 4 data: 2, 6, 7, 10, 20
The data are shown in the following table.
7.3 Data Summary: Plot
7.4 Data Summary: Table
Time | N | Mean | SD |
---|---|---|---|
0 | 10 | 146.0 | 9.10 |
4 | 5 | 137.4 | 8.26 |
7.5 Model equation
\[y_{ij} = \beta_0 + \beta_1 \times TIME_{ij} + u_{0i} + e_{ij}\]
Here:
\(y_{ij}\) = the SBP value of the i-th patient at the j-th time point
\(\beta_0\) = the intercept at the reference level of time (baseline value)
\(\beta_1\) = the effect at j-th time
\(u_{0i}\) = the effect of i-th patient associated with the intercept
\(e_{ij}\) = the random error
Assumption:
\(u_{0i} \sim N(0, \sigma_P^2)\)
\(e_{ij} \sim N(0, \sigma_e^2)\)
7.6 Hypothesis
\[Null \space hypothesis, H_0: \beta_1 = 0\]
\[Alternative \space hypothesis, H_1: \beta_1 \ne 0\]
7.7 Paired t-test: SPSS Syntax
DATASET ACTIVATE DataSet1.
-TEST PAIRS=SBP0 WITH SBP4 (PAIRED)
T/ES DISPLAY(TRUE) STANDARDIZER(SD)
/CRITERIA=CI(.9500)
/MISSING=ANALYSIS.
7.8 Paired t-test: Summary
The analysis of the data by paired t-test is correct, however, note that it removes all patients where paired data are not available at Week 4 resulting very low degrees of freedom for the test statistic.

7.9 LMM: SPSS Syntax
MIXED SBP BY fTime /CRITERIA = DFMETHOD(SATTERTHWAITE) CIN(95) MXITER(100) MXSTEP(10)
SCORING(1) SINGULAR(0.000000000001) HCONVERGE(0, ABSOLUTE)
LCONVERGE(0, ABSOLUTE) PCONVERGE(0.000001, ABSOLUTE)
/FIXED = fTime | SSTYPE(3)
/METHOD = REML
/PRINT = G R SOLUTION
/RANDOM = INTERCEPT | SUBJECT(ID) COVTYPE(VC).
7.10 Linear mixed model: Summary
The analysis of the data considering the patient as a random effect still accounts for the patients for whom the data were available at Week 0. The estimated difference between Week 4 and 0 is also different as we have more information about the baseline mean SBP than offered by a paired t-test.
Compare the estimate, SE, t-statistic, p-value for both paired t-test and linear mixed model. Although the difference is practically negligible for this small dataset with only 10 patients, in a large dataset with many patients, LMM will handle the data better than the paired t-test.

7.11 Explanation
The estimated mean SBP in the Treatment group at Week 4 decreased from the baseline value by a magnitude of about 12 mmHG.
Note that the denominator degrees of freedom may not be integer. The denominator degrees of freedom is calculated by a Satterthwaite approximation. There is other method of approximation like Kenward and Roger method. The LMM uses an approximation to estimate the denominator df.