Section 47 Regression Model: Next Level
47.1 Checkpoints in Statistical Modelling
- Understand the system and background; physical or biological 
- Understand the objective and hypotheses 
- Translate the problem into statistical framework 
- Data collection procedures and designs - observation or experimental data 
- Characteristics of response and predictor variables - types of variables 
- Coding of the variables, units of variables 
- Presence and proportion of missing values 
- Exploratory Data Analysis - Summary Statistics of individual response and predictor variables
- Plots to describe the variables
- Relationship between the variables - Summary Statistics and Plots
- Identify possible data entry errors or other inconsistencies
- Identify the possible relationship and outcomes from possible model exploration
 
- Statistical modelling 
47.2 Advanced Topics in Regression Model
- Validation, Cross-validation: Model selection based on training and testing data 
- Model selection when p >> n 
- Shrinkage (regularisation) method: Ridge, Lasso and Elastic Net 
- Dimension reduction method: Principal Component Regression (PCR), Partial Least Squares Regression (PLS) 
- Smoothing the relationship between x and y: Generalised Additive Model 
- Non-linear regression model: Growth curve 
- Regression model with Non-Gaussian error: Bernoulli, Binomial, Poisson, Negative Binomial 
- Regression model with pseudoreplication: Linear Mixed Model, Multilevel Model