8 Terminologies

8.1 Fixed factor

  • For fixed factor, all possible levels of the factor are present in the study

  • Since all levels are included in the study, if we sample the data again, we will obtain the identical levels of the factor

  • For example, we may assign the variable Sex as two levels: Male and Female. If we do collect another sample, the levels of Sex in the new sample will be among these two levels.

  • In general, the fixed factor has few levels


8.2 Fixed effect

  • Fixed effect refers to the inference about the effect size for the levels of fixed factor

  • Fixed effect inference can only be made about the specific level used in the study

  • Fixed effect is an unknown fixed or constant quantity for the given study

  • Numerical variables are always considered as fixed effects


Figure: An illustration of the fixed effect and random error for the observed value of four subjects




8.3 Random factor

  • The factor for which only a random sample of all possible levels of the factor are included in the study, i.e. the study does not include all levels of the factor

  • If we sample the data again, we will obtain different levels of the factor at each sampling stage

  • For example, we may sample patients from the population, and if we sample again, we will get a different set of patients from the population of patients.

  • Generally, random factor has many levels

  • The classification variables that identify the Level 2 and Level 3 units are considered as random factors


8.4 Random effect

  • Random effect refers to the inference about the effect size for the levels of random factor

  • It usually represents random deviations from the relationships described by the fixed effects

  • Random effect assumes the effect is randomly varying with a defined distribution, which is generally a Normal distribution

  • Random effect allows to make inference beyond the specific level included in the study. For example, we can generalise the inference to the population of patients, and therefore, the inference is not specific to patients included in the study

  • Commonly, random effects in a study are not of primary interest; or random factor is considered as a nuisance variable


Figure: An illustration of the distribution of random effect and error components for a single subject




8.5 Nested factors

  • All levels of a factor (first) are measured only with a single level of another factor (second)

  • The first factor is then said to be nested within the levels of the second factor

  • The effect estimated of the nested factor on the response variable is called as nested effect


Figure: An illustration of nested factors where Students are nested within the Class, and Classes are nested within School.


8.6 Crossed factors

  • All levels of a factor (first) are measured across all levels of another factor (second)

  • The effects of these factors on the response variable is called crossed effects


Figure: An illustration of crossed factors where data were measured for all combination of two levels of Sex and three levels of Ethinicity.




8.7 SBP Data

Figure: A schematic representation of the clinical data structure described above


8.8 Linear Model

If we wish to compare the effect of group on the SBP data at week 4 adjusting for the baseline SBP data, the model equation will be:

\[y_{i} = \beta_0 + \beta_1 \times SBP0_{i} + \beta_2 \times GROUP_{i} + \beta_3 \times (TIME_{i} = 4) + e_{i}\]

Here:

\(y_{i}\) = the SBP value of the i-th patient (\(i = 1, ..., 20\)) at Week 4

\(\beta_0\) = the intercept (fixed effect) at the reference level of group and time

\(\beta_1\) = the fixed effect of baseline SBP

\(\beta_2\) = the fixed effect of group

\(\beta_3\) = the fixed effect of time

\(e_{i}\) = the random error

Assumption:

\(e_{i} \sim N(0, \sigma_e^2)\)


8.9 Linear Mixed Model

If we wish to compare the effect of group on the SBP data at all time points adjusting for the baseline SBP data, the model equation will be:

\[y_{i} = \beta_0 + \beta_1 \times SBP0_{i} + \beta_2 \times GROUP_{i} + \beta_3 \times TIME_{ij} + u_{0i} + e_{ij}\]

Here:

\(y_{ij}\) = the SBP value of the i-th patient (\(i = 1, ..., 20\)) at the j-th time point (\(j = 1, 2, 3, 4\))

\(\beta_0\) = the intercept (fixed effect) at the reference level of group and time

\(\beta_1\) = the fixed effect of baseline SBP

\(\beta_2\) = the fixed effect of group

\(\beta_3\) = the fixed effect at j-th time (\(j = 2\))

\(u_{0i}\) = the random effect of i-th patient (\(i = 1, ..., 20\)) associated with the intercept

\(e_{ij}\) = the random error

Assumption:

\(u_{0i} \sim N(0, \sigma_P^2)\)

\(e_{ij} \sim N(0, \sigma_e^2)\)




8.10 Other terminologies

  • Likelihood

  • Methods of estimation

    • Restricted Maximum Likelihood (REML)

    • Maximum Likelihood (ML)

  • Degrees of freedom

    • Satterthwaite approximation

    • Residual method

    • Kenward-Roger approximation

  • Log-Likelihood convergence

  • Estimated Marginal (EM) Means




8.11 Advantage of mixed model

  • Complex design

  • Complex sampling strategies when simple random sampling within group is not applicable

  • Multistage sampling

  • Clustering of observations

  • Correct estimate of SE

  • Address missing values

  • Account different sources of variability

  • Ask complex research questions

  • Inference beyond the random factors included in the study