Adversarial Robustness of Medical LLMs


Project Timeline: 2024

Skills: PyTorch, adversarial ML, medical NLP

Paper: arXiv: 2402.10527


Overview

Adversarial entity sampling methods for probing medical LLM failure modes under clinical text distributional shifts. Tests how models behave when clinical entities are perturbed in structured and unstructured medical text.