← Back to All Open Problems

Domain Focus · 6 Projects

Clinical AI &
Medical Education

Dual-process reasoning. Automation bias. Neuro-symbolic safety guarantees.
Six open problems at the frontier of clinical AI.

Interested? Email me at my Ashoka address with subject "IML: [Project ID]" — e.g., "IML: I5". I supervise ML/systems aspects; for clinical domain questions I will connect you with appropriate co-mentors. All levels welcome.

Domain Overview

The clinical AI landscape has shifted dramatically in 2025–2026. Adam Rodman's landmark study (Harvard/BIDMC) shows LLMs scoring 10/10 on the r-IDEA clinical reasoning framework — surpassing the 9/10 average for attending physicians. Yet this headline result coexists with a troubling paradox: clinicians augmented with AI decision support sometimes underperform AI alone. Automation bias causes clinicians to delegate reasoning to the system, so when the AI errs on a difficult case, augmented clinicians fail to catch it. Simply providing AI predictions is insufficient — systems must actively engage clinician reasoning. This gap between capability and safe deployment is the central open problem in the domain.

On the education side, AI is reshaping how medicine is taught. Stanford's Fall 2025 medical curriculum now integrates AI literacy throughout, not as an elective. Faculty surveys reveal the scale of the challenge: 91% agree AI will help clinical practice, but only 12% feel "very familiar" with AI capabilities. Adaptive tutoring and virtual patient simulation — grounded in cognitive architectures (ACT-R for illness scripts, IRT for ability estimation, FSRS for spaced repetition, ZPD for scaffolding) — offer a principled approach to closing this gap at scale. Logical Neural Networks (LNNs), demonstrated at AMIA 2025, combine the interpretability of first-order logic rules with learnable weights, outperforming random forests and gradient boosting on clinical diagnosis tasks while remaining auditable by domain experts.

These problems connect directly to Ashish's research portfolio. Cardio-Sahayak is a CVD clinical decision support system calibrated for South Asian populations, where standard Western biomarker thresholds (BMI, HbA1c, lipid profiles) systematically misperform. Virtual Tumor Board is a multi-agent simulation of the multi-disciplinary tumor board process. Onco-Shikshak is Ashish's flagship medical education system: a 6-agent AI tumor board simulation integrating ACT-R, IRT, FSRS, Bloom's Taxonomy, ZPD, and Deliberate Practice to model learner cognition at multiple levels simultaneously. Project B11 feeds directly into Cardio-Sahayak; project I13 extends Onco-Shikshak; project A10 targets the hardest open problem in both systems.

The neuro-symbolic direction — led by collaborator Ishaan Varior (Direction D3) — targets what no existing system achieves: a dual-process clinical reasoning architecture where System 1 (LLM pattern matching) generates rapid hypotheses and System 2 (formally verified symbolic rules) analytically validates them, with metacognitive monitoring of when to escalate. No current clinical AI system has formal safety proofs that safety-critical rules — "never give immunotherapy to a patient with active autoimmune disease" — cannot be overridden by the neural component. That is what A10 attempts to build.

Ashoka Coursework Connection

Entry points by course completed

CS 1101 Intro CS B6 (FDA label NLP), B10 (CDSS rule engine)
MAT 2202 P&S B11 (CVD biomarkers with random forests) — Juneja-compatible
CS 3410 Intro to ML I5 (neuro-symbolic reasoning prototype), I13 (tumor board extension)
CS 5310/MAT 3216 Symbolic Logic I5 (first-order logic rules), A10 (formal verification)
CS 4101 Theory of Computation A10 (Lean4 safety proofs, decidability of clinical rule verification)

All 6 Projects

IDProjectDescriptionPrereqsTier
B6Classifying FDA Drug Labels with NLPBuild TF-IDF and BioBERT classifiers to categorize sentences from FDA prescribing information into structured sections (indications, adverse reactions, contraindications). Demonstrates the gap between rule-based and neural approaches to regulatory text.CS 1101Beginner
B10Simple Clinical Decision Support Rule EngineEncode NCCN lung cancer guidelines as a Python inference engine and compare its recommendations against GPT-4 and Claude on 20 clinical vignettes. The rule engine represents System 2 reasoning; the comparison reveals the complementary failure modes of symbolic and neural approaches.CS 1101, Discrete MathBeginner
B11Biomarker Identification for CVD Using Random ForestsTrain random forest classifiers on the UCI Heart Disease and Framingham datasets, then use SHAP values to identify the most predictive biomarker panel for cardiovascular risk — with attention to which features matter most for South Asian populations.MAT 2202 P&SBeginner Juneja
I5Neuro-Symbolic Clinical Reasoning PrototypeBuild a dual-process clinical reasoning module: System 1 is an LLM performing pattern-based diagnosis; System 2 is a rule-based guideline checker activated when confidence is low or safety-critical patterns are detected. Evaluated on 50 oncology vignettes against four system variants.Intro ML, LogicIntermediate
I13AI-Augmented Tumor Board Simulation ExtensionExtend Onco-Shikshak (Ashish's 6-agent tumor board simulation) with a new cancer type (paediatric GBM or neuroblastoma) and a new specialist agent role. Design and run a pilot evaluation measuring knowledge gain, reasoning quality, and learning outcomes against the original system.Intro ML, DSAIntermediate
A10Formally Verified Dual-Process Clinical ReasoningBuilding on I5, develop a full neuro-symbolic architecture integrating Logical Neural Networks with Lean4-verified safety proofs. Hard clinical rules (e.g., contraindications) are formally shown to be non-overridable by any neural component output. Integrated into Onco-Shikshak and evaluated with oncology residents on 100 vignettes.Theory of Computation, Symbolic Logic, Intro MLAdvanced

Key References

Papers & tools

  • Makani, A. (2026). Onco-Shikshak: AI-native medical education with cognitive architecture integration. medRxiv. — Ashish's flagship system; directly relevant to I13 and A10.
  • Rodman, A. (2025). Towards a Medical Superintelligence. Massachusetts Medical Society. — LLMs scoring 10/10 on r-IDEA; source of the automation bias analysis.
  • Logical Neural Networks for clinical diagnosis (2025). AMIA Annual Symposium Proceedings. PMC12150699. — LNNs outperforming random forests and gradient boosting on clinical tasks; key architecture for I5 and A10.
  • Croskerry, P. (2009). A Universal Model of Diagnostic Reasoning. Academic Medicine, 84, 1022–1028. — Foundational dual-process theory paper; theoretical grounding for I5 and A10.
  • Kahneman, D. (2011). Thinking, Fast and Slow. Farrar, Straus and Giroux. — System 1 / System 2 framework.
  • Li, Z. et al. (2023). Scallop: A Language for Neurosymbolic Programming. PLDI 2023. — Differentiable provenance semirings for end-to-end training through logical rules; relevant to A10. github.com/scallop-lang/scallop
  • IBM Research: Logical Neural Networks. Neural networks where each neuron implements a weighted first-order logic operation with learnable weights. github.com/IBM/LNN
  • Gupta, R. et al. (2019). Cardiovascular risk prediction in South Asians: a review. Indian Heart Journal, 71(2), 143–148. — Population-specific biomarker context for B11 and Cardio-Sahayak.

Ready to work on one of these?

Email with subject "IML: [Project ID]" — all levels welcome, undergrad through PhD.

← All 40 Open Problems

Last updated March 2026. Full project details available on request.