Essential techniques for robust machine learning model evaluation
Understanding how to properly split your data is crucial for building reliable machine learning models that generalize well to unseen data.
Purpose: Train the model parameters
Usage: Model learns patterns from this data
Size: Largest portion of your dataset
Purpose: Tune hyperparameters & model selection
Usage: Evaluate different model configurations
Size: Medium portion for reliable estimates
Purpose: Final unbiased performance evaluation
Usage: Used only once at the end
Size: Sufficient for reliable performance estimate
Divides data into k equal folds. Each fold serves as validation set once while others train the model.
Similar to K-Fold but maintains the class distribution in each fold, crucial for imbalanced datasets.
Extreme case where k equals the number of samples. Each sample is used as validation set once.
Respects temporal order by using past data for training and future data for validation.
Gather and clean your dataset
Divide into train/val/test sets
Apply CV strategy on train+val
Train models on each fold
Optimize based on CV results
Test best model on test set
Β© 2025 Machine Learning for Health Research Course | Prof. Gennady Roshchupkin
Interactive slides designed for enhanced learning experience