LLM-Powered Dysphagia Screening With Multimodal Physiological Signal Analysis and Medically Informed Prompts.
Liu Y, Wang L, Wang L, Wei X • IEEE transactions on neural systems and rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society • 2026
An LLM-based framework integrating multimodal physiological signals (laryngeal vibration, nasal airflow, and swallowing sound) with medically-informed prompts achieves 96.3% classification accuracy for dysphagia screening, outperforming baseline models and maintaining robust performance in few-shot learning settings.
Key Findings
Results
The proposed LLM-based dysphagia screening framework achieved a classification accuracy of 96.3%, significantly outperforming baseline models.
The framework integrates multimodal physiological signals including laryngeal vibration, nasal airflow, and swallowing sound.
A medically-informed prompt template was designed to incorporate individual attributes, key biosignal features, and task instructions.
Performance was described as significantly outperforming baseline models.
The model leverages the reasoning capabilities of large language models (LLMs) for signal analysis.
Methods
The study recruited 217 participants generating 2,664 total samples for dysphagia screening evaluation.
A total of 217 participants were recruited, including 109 post-stroke patients with dysphagia and 108 healthy individuals.
The dataset comprised 1,391 dysphagic samples and 1,273 healthy control samples.
Participants included both post-stroke patients and healthy individuals as controls.
The dataset was designed to address challenges of limited data and complex multimodal signals.
Results
The model demonstrated robust performance in few-shot learning settings, indicating strong generalization capabilities.
The few-shot learning evaluation was used to assess model generalization with limited training data.
Robust performance in few-shot settings was explicitly noted as a key strength of the framework.
This characteristic is described as indicating 'strong generalization capabilities.'
The few-shot robustness addresses the challenge of limited clinical data availability.
Background
Traditional diagnostic techniques for dysphagia are limited by accessibility, reliability, and invasiveness, motivating the development of automated screening methods.
Traditional techniques identified as limited include bedside screening and videofluoroscopic swallowing studies (VFSS).
Dysphagia is described as a common complication among stroke patients.
Dysphagia significantly increases the risk of aspiration pneumonia, malnutrition, and mortality.
The framework was proposed specifically to address challenges of limited data and complex multimodal signals in clinical dysphagia detection.
Methods
The medically-informed prompt template was specifically designed to guide the LLM to focus on dysphagia-related patterns by incorporating individual attributes, key biosignal features, and task instructions.
The prompt template incorporated three elements: individual attributes, key biosignal features, and task instructions.
The design goal was to effectively guide the LLM to focus on dysphagia-related patterns.
This prompt-driven reasoning approach was identified as a key component of the framework's effectiveness.
The framework is described as offering 'extensive applicability in clinical practice.'
Liu Y, Wang L, Wang L, Wei X. (2026). LLM-Powered Dysphagia Screening With Multimodal Physiological Signal Analysis and Medically Informed Prompts.. IEEE transactions on neural systems and rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society. https://doi.org/10.1109/TNSRE.2026.3674934