TY - GEN
T1 - CLONE
T2 - 2025 CHI Conference on Human Factors in Computing Systems, CHI EA 2025
AU - Cha, Seungeon
AU - Park, Jinseok
AU - Choi, Hojin
AU - Ryu, Hokyoung
AU - Seo, Kyoungwon
N1 - Publisher Copyright:
© 2025 Copyright held by the owner/author(s).
PY - 2025/4/26
Y1 - 2025/4/26
N2 - Early diagnosis of mild cognitive impairment (MCI) is essential to prevent its progression to Alzheimer’s disease. Human expert-driven diagnosis provides interpretable rationales but is time-consuming, while machine learning-based approaches offer efficiency but lack human-readable rationales. To address these limitations, we propose CLONE (Clinical Reasoning via Neuropsychologist Emulation), a three-stage framework leveraging large language models (LLMs) for MCI diagnosis: (1) emulating experts through role-playing, (2) synthesizing step-by-step diagnostic guidelines, and (3) performing clinical reasoning using the guideline. CLONE was evaluated on a real-world dataset of 65 subjects, achieving 89.23% diagnostic accuracy and outperforming the few-shot chain-of-thought (CoT) baseline by 6.15%, with specificity improving by 10.71%. Moreover, the synthesized guideline enhanced rationale quality, making rationales more consistent, correct, specific, helpful, and human-like compared to baselines. These findings highlight CLONE’s potential to enable accurate diagnosis and reliable clinical reasoning, addressing challenges in the field of MCI diagnosis. Our code is available at https://github.com/seoultech-HAILAB/CLONE.
AB - Early diagnosis of mild cognitive impairment (MCI) is essential to prevent its progression to Alzheimer’s disease. Human expert-driven diagnosis provides interpretable rationales but is time-consuming, while machine learning-based approaches offer efficiency but lack human-readable rationales. To address these limitations, we propose CLONE (Clinical Reasoning via Neuropsychologist Emulation), a three-stage framework leveraging large language models (LLMs) for MCI diagnosis: (1) emulating experts through role-playing, (2) synthesizing step-by-step diagnostic guidelines, and (3) performing clinical reasoning using the guideline. CLONE was evaluated on a real-world dataset of 65 subjects, achieving 89.23% diagnostic accuracy and outperforming the few-shot chain-of-thought (CoT) baseline by 6.15%, with specificity improving by 10.71%. Moreover, the synthesized guideline enhanced rationale quality, making rationales more consistent, correct, specific, helpful, and human-like compared to baselines. These findings highlight CLONE’s potential to enable accurate diagnosis and reliable clinical reasoning, addressing challenges in the field of MCI diagnosis. Our code is available at https://github.com/seoultech-HAILAB/CLONE.
KW - Clinical Reasoning
KW - Interpretable AI Diagnosis
KW - Large Language Models
KW - Mild Cognitive Impairment
KW - Synthetic Guidelines
UR - https://www.scopus.com/pages/publications/105005763154
U2 - 10.1145/3706599.3720111
DO - 10.1145/3706599.3720111
M3 - Conference contribution
AN - SCOPUS:105005763154
T3 - Conference on Human Factors in Computing Systems - Proceedings
BT - CHI EA 2025 - Extended Abstracts of the 2025 CHI Conference on Human Factors in Computing Systems
PB - Association for Computing Machinery
Y2 - 26 April 2025 through 1 May 2025
ER -