Get Started:
hw7_governance.pyNote: This is an intentionally lighter assignment to allow you to focus on your final project. It emphasizes thinking and writing over coding.
By completing this assignment, you will:
You’ve built models. Now what? The gap between a working model and a safely deployed clinical tool is enormous. This assignment focuses on the “boring” but critical work of governance:
This is the stuff that determines whether your model helps patients or harms them.
You’re a data scientist at a community hospital. The hospital wants to deploy a sepsis early warning model that predicts which patients in the general ward are at risk of developing sepsis in the next 6 hours.
The model was developed at a large academic medical center and published with impressive results (AUC 0.89). Your job is to:
Before deploying any external model, you need to validate it works at your institution.
1.1 Define Your Test Cohort (10 pts)
Design a validation study:
Write this as a 1-paragraph protocol.
1.2 Choose Evaluation Metrics (10 pts)
Select 3-4 metrics that matter for this use case. For each:
Consider: The model will generate alerts for nurses. False alarms cause alert fatigue. Missed sepsis cases can be fatal.
1.3 Acceptance Criteria Document (10 pts)
Create a formal acceptance criteria document with:
Once deployed, you need to know if the model is still working.
2.1 Choose Monitoring Metrics (10 pts)
Select 2-3 statistics to monitor continuously. For each:
Think about: input drift, prediction drift, outcome drift.
2.2 Implement Drift Detection (15 pts)
Write code that:
Test your implementation with simulated data showing:
2.3 Monitoring Dashboard Sketch (10 pts)
Create a simple visualization or sketch showing:
This can be a matplotlib figure or a hand-drawn sketch (photo) — the goal is to show what you’d want to see daily.
Documentation that nobody reads is useless. Create governance artifacts that busy clinicians might actually use.
3.1 One-Page Model Card (15 pts)
Create a model card (max 1 page) including:
3.2 Escalation Protocol (10 pts)
Define what happens when things go wrong:
Write this as a simple flowchart or numbered steps.
3.3 Review Cadence (10 pts)
Propose a review schedule:
| File | Description |
|---|---|
hw7_governance.py |
Code for Part 2 (drift detection) |
acceptance_criteria.md |
Part 1.3 document |
model_card.md |
Part 3.1 one-page model card |
escalation_protocol.md |
Part 3.2 flowchart/steps |
outputs/ |
Any figures (monitoring dashboard, etc.) |
| Component | Points |
|---|---|
| Part 1: Acceptance Testing | 30 |
| 1.1 Test cohort design | 10 |
| 1.2 Evaluation metrics | 10 |
| 1.3 Acceptance criteria document | 10 |
| Part 2: Monitoring | 35 |
| 2.1 Monitoring metrics | 10 |
| 2.2 Drift detection implementation | 15 |
| 2.3 Dashboard sketch | 10 |
| Part 3: Governance | 35 |
| 3.1 Model card | 15 |
| 3.2 Escalation protocol | 10 |
| 3.3 Review cadence | 10 |
| Total | 100 |