Friday, 10 Apr 2026
  • Contact
  • Privacy Policy
  • Terms & Conditions
  • DMCA
logo logo
  • World
  • Politics
  • Crime
  • Economy
  • Tech & Science
  • Sports
  • Entertainment
  • More
    • Education
    • Celebrities
    • Culture and Arts
    • Environment
    • Health and Wellness
    • Lifestyle
  • 🔥
  • Trump
  • House
  • ScienceAlert
  • White
  • VIDEO
  • man
  • Trumps
  • Season
  • star
  • Watch
Font ResizerAa
American FocusAmerican Focus
Search
  • World
  • Politics
  • Crime
  • Economy
  • Tech & Science
  • Sports
  • Entertainment
  • More
    • Education
    • Celebrities
    • Culture and Arts
    • Environment
    • Health and Wellness
    • Lifestyle
Follow US
© 2024 americanfocus.online – All Rights Reserved.
American Focus > Blog > Tech and Science > AI chatbots fail to diagnose patients by talking with them
Tech and Science

AI chatbots fail to diagnose patients by talking with them

Last updated: January 2, 2025 10:31 am
Share
AI chatbots fail to diagnose patients by talking with them
SHARE

Don’t call your favourite AI “doctor” just yet

Just_Super/Getty Images

Artificial intelligence has made significant advancements in various fields, including medicine. However, a recent study has revealed that while AI models excel in professional medical exams, they struggle with one crucial aspect of being a physician – interacting with patients to gather medical information and provide accurate diagnoses.

Research conducted by Pranav Rajpurkar and Shreya Johri at Harvard University introduced a new evaluation benchmark called CRAFT-MD, which assesses clinical AI models’ reasoning abilities through simulated doctor-patient conversations. These conversations were based on 2000 medical cases from US medical board exams, replicating real-life scenarios where patients may not disclose crucial information unless prompted.

The study utilized OpenAI’s GPT-4 model as a “patient AI” interacting with the clinical AI being tested. Results showed that leading AI models such as GPT-3.5, GPT-4, Meta’s Llama-2-7b, and Mistral AI’s Mistral-v2-7b performed significantly worse in conversation-based diagnostics compared to written case summaries.

For instance, GPT-4’s diagnostic accuracy dropped from 82% with structured summaries to 26% in patient conversations. Despite being the top performer, GPT-4 only gathered complete medical histories in 71% of simulated conversations and did not always provide accurate diagnoses.

Eric Topol from the Scripps Research Translational Institute noted that evaluating AI’s clinical reasoning through patient conversations is a more practical approach than traditional exams. However, Rajpurkar emphasized that AI’s success in simulated scenarios does not equate to surpassing human physicians due to the complexities of real-world medical practice.

While AI shows promise in supporting clinical work, it is not a substitute for the holistic judgement and experience of human doctors. The study underscores the importance of ongoing research to enhance AI’s capabilities while acknowledging the irreplaceable role of human healthcare providers.

See also  Daniel H. Wilson on Finding a Native Take on Traditional Alien Invasion Stories

Topics:

TAGGED:chatbotsdiagnoseFailpatientstalking
Share This Article
Twitter Email Copy Link Print
Previous Article A “Both Sides” Approach to Orientalism A “Both Sides” Approach to Orientalism
Next Article How To Draw a Snowman (Free Printable + Video) How To Draw a Snowman (Free Printable + Video)
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *


The reCAPTCHA verification period has expired. Please reload the page.

Popular Posts

Auction Houses: The Luxury Boom Nobody Saw Coming

Auction houses are experiencing a surge in interest from younger bidders, and they are capitalizing…

November 24, 2025

NWSL launching tool to protect players from online harassment : NPR

NWSL Commissioner Jessica Berman speaks with the press during the 2025 NWSL Media Day in…

March 14, 2025

Camp Mystic plans to reopen with new safety protocols after Texas flood killed 27 young campers, staffers

Camp Mystic, the site where 27 young campers and staff tragically lost their lives in…

September 24, 2025

Meet Hyperallergic’s New Editor-in-Chief 

Four weeks ago, I proudly became a new American. This momentous occasion marked the conclusion…

October 5, 2025

19 Best Airbnbs in Italy, From Puglia to the Dolomites

Are you looking for a luxurious getaway in Rome, Lake Como, or the Dolomites mountains?…

January 25, 2026

You Might Also Like

How to watch NASA’s Artemis II splash back down to Earth
Tech and Science

How to watch NASA’s Artemis II splash back down to Earth

April 10, 2026
Mythos autonomously exploited vulnerabilities that survived 27 years of human review. Security teams need a new detection playbook
Tech and Science

Mythos autonomously exploited vulnerabilities that survived 27 years of human review. Security teams need a new detection playbook

April 10, 2026
Scientists Found a Common Brain ‘Fingerprint’ Across 5 Psychedelics : ScienceAlert
Tech and Science

Scientists Found a Common Brain ‘Fingerprint’ Across 5 Psychedelics : ScienceAlert

April 10, 2026
Oppo Find X9 Ultra Colours Leaks
Tech and Science

Oppo Find X9 Ultra Colours Leaks

April 10, 2026
logo logo
Facebook Twitter Youtube

About US


Explore global affairs, political insights, and linguistic origins. Stay informed with our comprehensive coverage of world news, politics, and Lifestyle.

Top Categories
  • Crime
  • Environment
  • Sports
  • Tech and Science
Usefull Links
  • Contact
  • Privacy Policy
  • Terms & Conditions
  • DMCA

© 2024 americanfocus.online –  All Rights Reserved.

Welcome Back!

Sign in to your account

Lost your password?