The midterm project is your opportunity to apply what you’ve learned in Modules 1-4 to a clinical AI problem end-to-end. You will build a risk prediction or outcome model using structured clinical data, and write a mini field guide documenting your work.
Deliverables:
Task: Build a risk prediction or outcome model using tabular clinical data.
Suggested Datasets:
Minimum Requirements:
Have a different structured clinical dataset in mind? Propose it!
To get approval:
| Component | Points |
|---|---|
| Data loading and preprocessing | 10 |
| Model architecture (appropriate for task) | 10 |
| Training pipeline (loss, optimizer, logging) | 10 |
| Evaluation metrics (appropriate for task) | 10 |
| Code quality (readable, documented) | 10 |
Write a 2-3 page document covering:
| Section | Points |
|---|---|
| Problem Statement — What clinical problem does this address? | 5 |
| Data Description — What data did you use? Limitations? | 5 |
| Methods — What approach did you take and why? | 10 |
| Results — Key performance metrics with interpretation | 10 |
| Limitations & Next Steps — What would you do differently? | 5 |
5-7 minute presentation covering:
Presentations will be during class on Wednesday, March 18.
Submit via GitHub (link TBD):
code/ — Your notebooks and/or scriptsfield_guide.pdf — Your mini field guideslides.pdf — Presentation slidesREADME.md — How to run your code| Date | Milestone |
|---|---|
| Mar 3 | Project released, teams assigned |
| Mar 6 | Deadline for custom dataset proposals |
| Mar 10 | Recommended: Data loaded, baseline running |
| Mar 11 | In-class work session |
| Mar 18 | Presentations in class (Wednesday) |
| Mar 18 | Code + Field Guide due by 11:59 PM |
| Component | Points |
|---|---|
| Code | 50 |
| Mini Field Guide | 35 |
| Presentation | 15 |
| Total | 100 |