Skip to content

Fix modelling issues#129

Merged
zmek merged 5 commits intomainfrom
fix-modelling-issues
Feb 6, 2026
Merged

Fix modelling issues#129
zmek merged 5 commits intomainfrom
fix-modelling-issues

Conversation

@zmek
Copy link
Copy Markdown
Collaborator

@zmek zmek commented Feb 6, 2026

Fix Modelling Issues

Summary

This PR addresses several modelling issues to improve the robustness and correctness of the classifier training and prediction pipeline.

Changes

Bug Fixes & Improvements:

  • Boolean encoding: Fixed issue where boolean columns were being one-hot encoded unnecessarily
  • Validation: Added validation and error message when no snapshots match the prediction time
  • Missing columns handling: Improved handling of missing columns inside the classifier pipeline
  • Model persistence: Now saves exclude_from_training_data list with the model for better reproducibility

Files Changed

  • src/patientflow/train/classifiers.py - Core classifier training improvements
  • src/patientflow/predict/emergency_demand.py - Prediction pipeline updates
  • src/patientflow/prepare.py - Data preparation updates
  • src/patientflow/viz/ - Updated visualization modules (calibration, estimated_probabilities, madcap, shap)
  • Updated notebooks to reflect changes
  • Updated tests

zmek added 5 commits January 20, 2026 15:26
- do not one-hot encode boolean
- add validation and error message when no snapshots match prediction time

 Please enter the commit message for your changes. Lines starting
@zmek zmek merged commit 4ce8dbc into main Feb 6, 2026
6 checks passed
@zmek zmek deleted the fix-modelling-issues branch February 6, 2026 15:48
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant