
Model Comparison and Selection
What you’ll build today: a practical framework for comparing models and selecting the one that best supports decisions—not just the one that fits best.
Learning Objectives
By the end of this lesson, you will be able to:
- Explain why multiple models can fit the same data
- Understand criteria for model comparison (AIC, BIC, likelihood)
- Balance model fit vs complexity
- Choose models based on decision relevance
Key Ideas
In pharmacometrics, it is common to have multiple models that fit the data reasonably well.
This raises an important question:
Which model should we use?
Model selection is not just about fit—it is about:
- predictive performance
- interpretability
- decision relevance
Why This Lesson Matters
Different models can:
- produce similar fits
- but lead to different predictions
This means:
Model choice can directly impact clinical and regulatory decisions.
Worked Example: Two Competing Models
Imagine:
- Model A → simpler and easier to interpret
- Model B → more flexible but more complex
Both explain the observed data.
But they may behave differently when predicting new scenarios.
Likelihood-Based Comparison
Models can be compared using likelihood:
\[ L(\theta) = P(\text{data} \mid \theta) \]
Higher likelihood → better fit
But:
- more complex models often have higher likelihood
Objective Function Values (OFV)
In pharmacometrics, likelihood is often reported as an objective function value (OFV):
\[ OFV = -2\log L \]
where:
- \(L\) is the likelihood
- lower OFV indicates better agreement between the model and the data
Conceptually:
Higher Likelihood
↓
Lower OFV
↓
Better Fit
This is the quantity commonly reported by software such as NONMEM and nlmixr2.
Comparing Nested Models
When one model is a modification of another, models can be compared using the change in objective function value:
\[ \Delta OFV = OFV_{old} - OFV_{new} \]
Interpretation:
- Positive ΔOFV indicates improved fit
- Larger ΔOFV indicates stronger evidence that the new model better explains the data
For example:
Base Model
OFV = 1250
New Model
OFV = 1240
ΔOFV = 10
The new model provides a better fit because the OFV decreased.
Information Criteria
To balance fit and complexity:
AIC (Akaike Information Criterion)
\[ AIC = -2 \log L + 2k \]
BIC (Bayesian Information Criterion)
\[ BIC = -2 \log L + k \log n \]
Where:
- \(k\) = number of parameters
- \(n\) = number of observations
Interpretation
- Lower AIC/BIC → stronger statistical support among compared models
- Penalizes overly complex models
Model Selection Is Sequential
Model comparison is rarely based on a single number.
Typical workflow:
- Check convergence and stability
- Inspect diagnostics
- Compare fit statistics (OFV, AIC, BIC)
- Evaluate predictive performance
- Ask whether the model supports the decision
Each step eliminates unsuitable models.
Insight
The best model is not the one that fits best—it is the one that balances fit and simplicity.
Overfitting improves fit but harms prediction.
Beyond Statistics: Decision Relevance
A model should be selected based on:
- ability to answer the scientific question
- predictive performance
- robustness
Example:
Model A: - slightly higher AIC
- stable predictions
- interpretable parameters
Model B: - lower AIC
- unstable extrapolation
👉 Model A may still be preferred.
Strategies
- Compare models using multiple criteria
- Consider simplicity and interpretability
- Focus on prediction and decision impact
- Validate models using diagnostics
Common Mistakes
- Choosing the model with lowest AIC blindly
- Ignoring biological plausibility
- Overfitting noise
- Ignoring predictive performance
Practice Problems
- Why can multiple models fit the same data?
- What is the purpose of AIC?
- Why is the lowest AIC not always the best model?
- Because different structures can explain the same observations
- To balance model fit and complexity
- Because it may still overfit or lack interpretability
Summary
Model selection:
- compares competing explanations
- balances fit and complexity
- focuses on decision relevance
The goal is not the best-fitting model, but the most useful model.
- Fit is necessary, not sufficient
- Use AIC/BIC as guides, not rules
- Prefer simpler models when possible
- Always consider the decision context