Clinical Trial Eligibility Research
Project Timeline: 2025 – present
Skills: Python, PyTorch, FAISS, sentence-transformers, SQLite, NLP, LLM classification, semantic matching, dual-GPU pipelines
Advisor: Dr. Reza Abbasi-Asl (UCSF / UC Berkeley)
Overview
Evidence-guided clinical trial design and evolution to optimize patient eligibility accrual across 500K+ trials. Includes an LLM benchmark for criteria change prediction and an adoption gap study measuring whether trials implement published broadening recommendations. NLP pipeline with semantic matching, LLM-based classification, and dual-GPU processing.
Key Components
- Eligibility criteria change tracking: version history analysis across 500K+ trials with 800K+ version snapshots
- LLM benchmark: evaluation framework for criteria change prediction tasks using retrieval-augmented generation
- Adoption gap analysis: systematic measurement of whether clinical trials adopt published evidence-based recommendations for broadening their eligibility criteria
- Semantic matching pipeline: directive-to-trial matching using biomedical embedding models
- LLM-based classification: adoption assessment at scale using dual-GPU processing
