Section 47 Regression Model: Next Level


47.1 Checkpoints in Statistical Modelling

  • Understand the system and background; physical or biological

  • Understand the objective and hypotheses

  • Translate the problem into statistical framework

  • Data collection procedures and designs - observation or experimental data

  • Characteristics of response and predictor variables - types of variables

  • Coding of the variables, units of variables

  • Presence and proportion of missing values

  • Exploratory Data Analysis

    • Summary Statistics of individual response and predictor variables
    • Plots to describe the variables
    • Relationship between the variables - Summary Statistics and Plots
    • Identify possible data entry errors or other inconsistencies
    • Identify the possible relationship and outcomes from possible model exploration
  • Statistical modelling


47.2 Advanced Topics in Regression Model

  • Validation, Cross-validation: Model selection based on training and testing data

  • Model selection when p >> n

  • Shrinkage (regularisation) method: Ridge, Lasso and Elastic Net

  • Dimension reduction method: Principal Component Regression (PCR), Partial Least Squares Regression (PLS)

  • Smoothing the relationship between x and y: Generalised Additive Model

  • Non-linear regression model: Growth curve

  • Regression model with Non-Gaussian error: Bernoulli, Binomial, Poisson, Negative Binomial

  • Regression model with pseudoreplication: Linear Mixed Model, Multilevel Model