Tuesday, 20 Jan 2026
  • Contact
  • Privacy Policy
  • Terms & Conditions
  • DMCA
logo logo
  • World
  • Politics
  • Crime
  • Economy
  • Tech & Science
  • Sports
  • Entertainment
  • More
    • Education
    • Celebrities
    • Culture and Arts
    • Environment
    • Health and Wellness
    • Lifestyle
  • 🔥
  • Trump
  • House
  • VIDEO
  • ScienceAlert
  • White
  • man
  • Trumps
  • Watch
  • Season
  • Years
Font ResizerAa
American FocusAmerican Focus
Search
  • World
  • Politics
  • Crime
  • Economy
  • Tech & Science
  • Sports
  • Entertainment
  • More
    • Education
    • Celebrities
    • Culture and Arts
    • Environment
    • Health and Wellness
    • Lifestyle
Follow US
© 2024 americanfocus.online – All Rights Reserved.
American Focus > Blog > Tech and Science > AI chatbots fail to diagnose patients by talking with them
Tech and Science

AI chatbots fail to diagnose patients by talking with them

Last updated: January 2, 2025 10:31 am
Share
AI chatbots fail to diagnose patients by talking with them
SHARE

Don’t call your favourite AI “doctor” just yet

Just_Super/Getty Images

Artificial intelligence has made significant advancements in various fields, including medicine. However, a recent study has revealed that while AI models excel in professional medical exams, they struggle with one crucial aspect of being a physician – interacting with patients to gather medical information and provide accurate diagnoses.

Research conducted by Pranav Rajpurkar and Shreya Johri at Harvard University introduced a new evaluation benchmark called CRAFT-MD, which assesses clinical AI models’ reasoning abilities through simulated doctor-patient conversations. These conversations were based on 2000 medical cases from US medical board exams, replicating real-life scenarios where patients may not disclose crucial information unless prompted.

The study utilized OpenAI’s GPT-4 model as a “patient AI” interacting with the clinical AI being tested. Results showed that leading AI models such as GPT-3.5, GPT-4, Meta’s Llama-2-7b, and Mistral AI’s Mistral-v2-7b performed significantly worse in conversation-based diagnostics compared to written case summaries.

For instance, GPT-4’s diagnostic accuracy dropped from 82% with structured summaries to 26% in patient conversations. Despite being the top performer, GPT-4 only gathered complete medical histories in 71% of simulated conversations and did not always provide accurate diagnoses.

Eric Topol from the Scripps Research Translational Institute noted that evaluating AI’s clinical reasoning through patient conversations is a more practical approach than traditional exams. However, Rajpurkar emphasized that AI’s success in simulated scenarios does not equate to surpassing human physicians due to the complexities of real-world medical practice.

While AI shows promise in supporting clinical work, it is not a substitute for the holistic judgement and experience of human doctors. The study underscores the importance of ongoing research to enhance AI’s capabilities while acknowledging the irreplaceable role of human healthcare providers.

See also  Why Silicon Valley is really talking about fleeing California (it's not the 5%)

Topics:

TAGGED:chatbotsdiagnoseFailpatientstalking
Share This Article
Twitter Email Copy Link Print
Previous Article A “Both Sides” Approach to Orientalism A “Both Sides” Approach to Orientalism
Next Article How To Draw a Snowman (Free Printable + Video) How To Draw a Snowman (Free Printable + Video)
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Popular Posts

Putin declares 30-hour Easter ceasefire in Ukraine

Stay updated with the latest developments in the ongoing War in Ukraine with our free…

April 19, 2025

Researchers Discover New Color That’s Impossible to See without Lasering Your Retinas

A groundbreaking discovery has been made in the world of color vision, where researchers have…

April 19, 2025

Brazen burglar swoops up $1.5M in jewelry from NYC Macy’s, ditches suitcases holding the loot when guard confronts him: cops

A daring thief managed to snag approximately $1.5 million worth of jewelry from a Macy's…

February 12, 2025

Why we all need a little festive pedantry when it comes to snowflakes

Winter is a time of year when everything seems to be adorned with festive decorations.…

December 28, 2025

Idaho student had DNA of 3 people under fingernails

One of the coeds who were tragically murdered at the University of Idaho in 2022…

March 6, 2025

You Might Also Like

EPA rule sparks air quality concerns, cancer survival hits record high, and NASA executes historic space evacuation
Tech and Science

EPA rule sparks air quality concerns, cancer survival hits record high, and NASA executes historic space evacuation

January 20, 2026
Everstone combines Wingify, AB Tasty for 0M+ digital experience optimization platform
Tech and Science

Everstone combines Wingify, AB Tasty for $100M+ digital experience optimization platform

January 20, 2026
Barnacle gloop could improve inflammatory bowel disease treatments
Tech and Science

Barnacle gloop could improve inflammatory bowel disease treatments

January 20, 2026
Looking ahead to 2026: What’s next for Startup Battlefield 200
Tech and Science

Looking ahead to 2026: What’s next for Startup Battlefield 200

January 19, 2026
logo logo
Facebook Twitter Youtube

About US


Explore global affairs, political insights, and linguistic origins. Stay informed with our comprehensive coverage of world news, politics, and Lifestyle.

Top Categories
  • Crime
  • Environment
  • Sports
  • Tech and Science
Usefull Links
  • Contact
  • Privacy Policy
  • Terms & Conditions
  • DMCA

© 2024 americanfocus.online –  All Rights Reserved.

Welcome Back!

Sign in to your account

Lost your password?