Sunday, 1 Mar 2026
  • Contact
  • Privacy Policy
  • Terms & Conditions
  • DMCA
logo logo
  • World
  • Politics
  • Crime
  • Economy
  • Tech & Science
  • Sports
  • Entertainment
  • More
    • Education
    • Celebrities
    • Culture and Arts
    • Environment
    • Health and Wellness
    • Lifestyle
  • 🔥
  • Trump
  • House
  • ScienceAlert
  • VIDEO
  • White
  • man
  • Trumps
  • Watch
  • Season
  • star
Font ResizerAa
American FocusAmerican Focus
Search
  • World
  • Politics
  • Crime
  • Economy
  • Tech & Science
  • Sports
  • Entertainment
  • More
    • Education
    • Celebrities
    • Culture and Arts
    • Environment
    • Health and Wellness
    • Lifestyle
Follow US
© 2024 americanfocus.online – All Rights Reserved.
American Focus > Blog > Tech and Science > AI chatbots fail to diagnose patients by talking with them
Tech and Science

AI chatbots fail to diagnose patients by talking with them

Last updated: January 2, 2025 10:31 am
Share
AI chatbots fail to diagnose patients by talking with them
SHARE

Don’t call your favourite AI “doctor” just yet

Just_Super/Getty Images

Artificial intelligence has made significant advancements in various fields, including medicine. However, a recent study has revealed that while AI models excel in professional medical exams, they struggle with one crucial aspect of being a physician – interacting with patients to gather medical information and provide accurate diagnoses.

Research conducted by Pranav Rajpurkar and Shreya Johri at Harvard University introduced a new evaluation benchmark called CRAFT-MD, which assesses clinical AI models’ reasoning abilities through simulated doctor-patient conversations. These conversations were based on 2000 medical cases from US medical board exams, replicating real-life scenarios where patients may not disclose crucial information unless prompted.

The study utilized OpenAI’s GPT-4 model as a “patient AI” interacting with the clinical AI being tested. Results showed that leading AI models such as GPT-3.5, GPT-4, Meta’s Llama-2-7b, and Mistral AI’s Mistral-v2-7b performed significantly worse in conversation-based diagnostics compared to written case summaries.

For instance, GPT-4’s diagnostic accuracy dropped from 82% with structured summaries to 26% in patient conversations. Despite being the top performer, GPT-4 only gathered complete medical histories in 71% of simulated conversations and did not always provide accurate diagnoses.

Eric Topol from the Scripps Research Translational Institute noted that evaluating AI’s clinical reasoning through patient conversations is a more practical approach than traditional exams. However, Rajpurkar emphasized that AI’s success in simulated scenarios does not equate to surpassing human physicians due to the complexities of real-world medical practice.

While AI shows promise in supporting clinical work, it is not a substitute for the holistic judgement and experience of human doctors. The study underscores the importance of ongoing research to enhance AI’s capabilities while acknowledging the irreplaceable role of human healthcare providers.

See also  This Biotech Startup Raised $34 Million For Urine-Based Tests To Help Diagnose Cancer

Topics:

TAGGED:chatbotsdiagnoseFailpatientstalking
Share This Article
Twitter Email Copy Link Print
Previous Article A “Both Sides” Approach to Orientalism A “Both Sides” Approach to Orientalism
Next Article How To Draw a Snowman (Free Printable + Video) How To Draw a Snowman (Free Printable + Video)
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Popular Posts

NHL fans react to Flames’ return for trading Rasmus Andersson to Golden Knights

The Calgary Flames made a significant move by trading defenseman Rasmus Andersson to the Vegas…

January 18, 2026

Study finds solution to a major source of doctor burnout

Doctors are facing increasing levels of burnout and job dissatisfaction due to the overwhelming amount…

August 27, 2024

2025’s Best Thriller Is Now Streaming — and It Has an 84 Percent on Rotten Tomatoes

Peacock has recently added a hidden gem to its lineup of thrillers, a movie that…

July 16, 2025

NYC council candidate Phil Wong moves campaign HQ to U-haul truck after losing lease

It’s the route to City Haul. A Democratic contender for City Council in Queens has…

October 4, 2025

What Americans Lose if Their National Center for Atmospheric Research Is Dismantled

The Importance of Preserving NCAR for Americans This article was originally published in Eos, the…

January 27, 2026

You Might Also Like

Bacteria Play Previously Unknown Role in Kidney Stones, Study Finds : ScienceAlert
Tech and Science

Bacteria Play Previously Unknown Role in Kidney Stones, Study Finds : ScienceAlert

March 1, 2026
Polymarket saw 9M traded on bets tied to bombing of Iran
Tech and Science

Polymarket saw $529M traded on bets tied to bombing of Iran

March 1, 2026
Why mathematicians hate Good Will Hunting
Tech and Science

Why mathematicians hate Good Will Hunting

March 1, 2026
What if the real risk of AI isn’t deepfakes — but daily whispers?
Tech and Science

What if the real risk of AI isn’t deepfakes — but daily whispers?

March 1, 2026
logo logo
Facebook Twitter Youtube

About US


Explore global affairs, political insights, and linguistic origins. Stay informed with our comprehensive coverage of world news, politics, and Lifestyle.

Top Categories
  • Crime
  • Environment
  • Sports
  • Tech and Science
Usefull Links
  • Contact
  • Privacy Policy
  • Terms & Conditions
  • DMCA

© 2024 americanfocus.online –  All Rights Reserved.

Welcome Back!

Sign in to your account

Lost your password?