Fix modelling issues by zmek · Pull Request #129 · UCL-CORU/patientflow

zmek · 2026-02-06T15:39:25Z

Fix Modelling Issues

Summary

This PR addresses several modelling issues to improve the robustness and correctness of the classifier training and prediction pipeline.

Changes

Bug Fixes & Improvements:

Boolean encoding: Fixed issue where boolean columns were being one-hot encoded unnecessarily
Validation: Added validation and error message when no snapshots match the prediction time
Missing columns handling: Improved handling of missing columns inside the classifier pipeline
Model persistence: Now saves exclude_from_training_data list with the model for better reproducibility

Files Changed

src/patientflow/train/classifiers.py - Core classifier training improvements
src/patientflow/predict/emergency_demand.py - Prediction pipeline updates
src/patientflow/prepare.py - Data preparation updates
src/patientflow/viz/ - Updated visualization modules (calibration, estimated_probabilities, madcap, shap)
Updated notebooks to reflect changes
Updated tests

- do not one-hot encode boolean - add validation and error message when no snapshots match prediction time Please enter the commit message for your changes. Lines starting

zmek added 5 commits January 20, 2026 15:26

Address modelling issues:

2d74b4a

- do not one-hot encode boolean - add validation and error message when no snapshots match prediction time Please enter the commit message for your changes. Lines starting

handle missing cols inside classifier pipeline

b762087

save exclude_from_training_data list with model

631bd1d

move exclude_with_training_data param to end of param list

5d7b69e

fix: handle SettingWithCopyWarning removal in pandas 3.0

4eaacb6

zmek merged commit 4ce8dbc into main Feb 6, 2026
6 checks passed

zmek deleted the fix-modelling-issues branch February 6, 2026 15:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix modelling issues#129

Fix modelling issues#129
zmek merged 5 commits intomainfrom
fix-modelling-issues

zmek commented Feb 6, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

zmek commented Feb 6, 2026