AI Outperforms Glaucoma, Retina Specialists for Diagnostic Accuracy

Medically reviewed by Carmen Pope, BPharm. Last updated on Feb 28, 2024.

By Elana Gotkine HealthDay Reporter

WEDNESDAY, Feb. 28, 2024 -- A large language model (LLM) chatbot outperforms glaucoma and retina specialists for diagnostic accuracy, according to a study published online Feb. 22 in JAMA Ophthalmology.

Andy S. Huang, M.D., from the Icahn School of Medicine at Mount Sinai in New York City, and colleagues conducted a comparative cross-sectional study recruiting 15 participants aged 31 to 67 years, including 12 attending physicians and three senior trainees, to compare the diagnostic accuracy and comprehensiveness of responses from an LLM chatbot with those of fellowship-trained glaucoma and retina specialists. Responses were assessed via a Likert scale for glaucoma and retina questions (10 of each type) in deidentified glaucoma and retina cases (10 of each type).

The researchers found that the combined question-case mean rank for accuracy was 506.2 and 403.4 for the LLM chatbot and glaucoma specialists, respectively; the corresponding mean ranks for completeness were 528.3 and 398.7. The mean rank for accuracy was 235.3 and 216.1 for the LLM chatbot and retina specialists, respectively; the corresponding mean ranks for completeness were 258.3 and 208.7. A significant difference was seen between all pairwise comparisons -- except for specialist versus trainee -- in rating chatbot completeness in the Dunn test. Compared with their specialist counterparts, both trainees and specialists rated the chatbot's accuracy and completeness more favorably, with specialists noting a significant difference in the accuracy and completeness of the chatbot.

"These findings support the possibility that artificial intelligence tools could play a pivotal role as both diagnostic and therapeutic adjuncts," the authors write.

Abstract/Full Text

Editorial (subscription or payment may be required)

Disclaimer: Statistical data in medical articles provide general trends and do not pertain to individuals. Individual factors can vary greatly. Always seek personalized medical advice for individual healthcare decisions.

Posted February 2024

More news resources

Subscribe to our newsletter

Whatever your topic of interest, subscribe to our newsletters to get the best of Drugs.com in your inbox.

Recently Approved

Rytelo
Rytelo (imetelstat) is an oligonucleotide telomerase inhibitor indicated for...
mRESVIA
mRESVIA is a modified RNA vaccine that may protect adults aged 60 years and...
Bkemv
Bkemv (eculizumab-aeeb) is a complement inhibitor interchangeable biosimilar to...

More drug approvals

AI Outperforms Glaucoma, Retina Specialists for Diagnostic Accuracy

Read this next

Planetary Health Diet Index Linked to Lower Total, Cause-Specific Mortality

Prevalence of Iron Deficiency Varies With Different Definitions

AI Blood-Based Lung Cancer Screening Test Developed for Fragmentome

More news resources

Subscribe to our newsletter