| Topic | Research Summary | Link |
|---|---|---|
| Problem Specification | When is prediction suitable in observational data? | Guidelines needed |
| Data Creation | Comparison of cohort vs case-control design | Journal of Big Data |
| Data Creation | Addressing loss to follow-up (right censoring) | BMC medical informatics and decision makingk |
| Data Creation | Investigating how to address left censoring in features construction | BMC Medical Research Methodology |
| Data Creation | Impact of over/under-sampling | Journal of big data |
| Data Creation | Impact of phenotypes | Study Done - Paper submitted |
| Model development | How much data do we need for prediction - Learning curves at scale | International Journal of Medical Informatics |
| Model development | What impact does test/train/validation design have on model performance | BMJ Open |
| Model development | What is the impact of the classifier | JAMIA |
| Model development | Can we find hyper-parameter combinations per classifier that consistently lead to good performing models when using claims/EHR data? | Study needs to be done |
| Model development | Can we use ensembles to combine different algorithm models within a database to improve models transportability? | Caring is Sharing–Exploiting the Value in Data for Health and Innovation |
| Model development | Can we use ensembles to combine models developed using different databases to improve models transportability? | BMC Medical Informatics and Decision Making |
| Model development | Impact of regularization method | JAMIA |
| Evaluation | Why prediction is not suitable for risk factor identification | Machine Learning for Healthcare Conference |
| Evaluation | Iterative pairwise external validation to put validation into context | Drug Safety |
| Evaluation | A novel method to estimate external validation using aggregate statistics | Study under review |
| Evaluation | How should we present model performance? (e.g., new visualizations) | JAMIA Open |
| Evaluation | How to interpret external validation performance (can we figure out why the performance drops or stays consistent)? | Study needs to be done |
| Evaluation | Recalibration methods | Study needs to be done |
| Evaluation | Is there a way to automatically simplify models? | Study protocol under development |