Tuesday, 20 Jan 2026
  • Contact
  • Privacy Policy
  • Terms & Conditions
  • DMCA
logo logo
  • World
  • Politics
  • Crime
  • Economy
  • Tech & Science
  • Sports
  • Entertainment
  • More
    • Education
    • Celebrities
    • Culture and Arts
    • Environment
    • Health and Wellness
    • Lifestyle
  • 🔥
  • Trump
  • House
  • VIDEO
  • ScienceAlert
  • White
  • man
  • Trumps
  • Watch
  • Season
  • Years
Font ResizerAa
American FocusAmerican Focus
Search
  • World
  • Politics
  • Crime
  • Economy
  • Tech & Science
  • Sports
  • Entertainment
  • More
    • Education
    • Celebrities
    • Culture and Arts
    • Environment
    • Health and Wellness
    • Lifestyle
Follow US
© 2024 americanfocus.online – All Rights Reserved.
American Focus > Blog > Tech and Science > AI chatbots fail to diagnose patients by talking with them
Tech and Science

AI chatbots fail to diagnose patients by talking with them

Last updated: January 2, 2025 10:31 am
Share
AI chatbots fail to diagnose patients by talking with them
SHARE

Don’t call your favourite AI “doctor” just yet

Just_Super/Getty Images

Artificial intelligence has made significant advancements in various fields, including medicine. However, a recent study has revealed that while AI models excel in professional medical exams, they struggle with one crucial aspect of being a physician – interacting with patients to gather medical information and provide accurate diagnoses.

Research conducted by Pranav Rajpurkar and Shreya Johri at Harvard University introduced a new evaluation benchmark called CRAFT-MD, which assesses clinical AI models’ reasoning abilities through simulated doctor-patient conversations. These conversations were based on 2000 medical cases from US medical board exams, replicating real-life scenarios where patients may not disclose crucial information unless prompted.

The study utilized OpenAI’s GPT-4 model as a “patient AI” interacting with the clinical AI being tested. Results showed that leading AI models such as GPT-3.5, GPT-4, Meta’s Llama-2-7b, and Mistral AI’s Mistral-v2-7b performed significantly worse in conversation-based diagnostics compared to written case summaries.

For instance, GPT-4’s diagnostic accuracy dropped from 82% with structured summaries to 26% in patient conversations. Despite being the top performer, GPT-4 only gathered complete medical histories in 71% of simulated conversations and did not always provide accurate diagnoses.

Eric Topol from the Scripps Research Translational Institute noted that evaluating AI’s clinical reasoning through patient conversations is a more practical approach than traditional exams. However, Rajpurkar emphasized that AI’s success in simulated scenarios does not equate to surpassing human physicians due to the complexities of real-world medical practice.

While AI shows promise in supporting clinical work, it is not a substitute for the holistic judgement and experience of human doctors. The study underscores the importance of ongoing research to enhance AI’s capabilities while acknowledging the irreplaceable role of human healthcare providers.

See also  NJ Transit engineers on strike after contract negotiations fail — wreaking havoc on commuters

Topics:

TAGGED:chatbotsdiagnoseFailpatientstalking
Share This Article
Twitter Email Copy Link Print
Previous Article A “Both Sides” Approach to Orientalism A “Both Sides” Approach to Orientalism
Next Article How To Draw a Snowman (Free Printable + Video) How To Draw a Snowman (Free Printable + Video)
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Popular Posts

California inspired a wave of plastic bag bans — with a big loophole

The plastic bag ban in California, which was implemented ten years ago, was intended to…

October 12, 2024

TSLA) Bull, Base and Bear Price Prediction and Forecast

Investors in Tesla Inc. (NASDAQ:TSLA) have seen quite a rollercoaster ride over the past few…

December 1, 2025

Powered by India’s small businesses, UK fintech Tide becomes a TPG-backed unicorn | TechCrunch

Tide Joins Unicorn Club with $120 Million Funding to Empower Micro and Small Businesses U.K.-based…

September 22, 2025

Doing This for 30 Minutes a Day Can Unlock Your Full Potential

In today's fast-paced world, it's easy to get caught up in the daily grind of…

February 11, 2025

USWNT vs. Canada: USA soccer players to watch as Emma Hayes continues player pool experimentation

Now, with a chance to showcase her skills against Canada, Thompson will be looking to…

July 1, 2025

You Might Also Like

Everstone combines Wingify, AB Tasty for 0M+ digital experience optimization platform
Tech and Science

Everstone combines Wingify, AB Tasty for $100M+ digital experience optimization platform

January 20, 2026
Barnacle gloop could improve inflammatory bowel disease treatments
Tech and Science

Barnacle gloop could improve inflammatory bowel disease treatments

January 20, 2026
Looking ahead to 2026: What’s next for Startup Battlefield 200
Tech and Science

Looking ahead to 2026: What’s next for Startup Battlefield 200

January 19, 2026
Aurora Watch in Effect as Severe Solar Storm Slams Into Earth : ScienceAlert
Tech and Science

Aurora Watch in Effect as Severe Solar Storm Slams Into Earth : ScienceAlert

January 19, 2026
logo logo
Facebook Twitter Youtube

About US


Explore global affairs, political insights, and linguistic origins. Stay informed with our comprehensive coverage of world news, politics, and Lifestyle.

Top Categories
  • Crime
  • Environment
  • Sports
  • Tech and Science
Usefull Links
  • Contact
  • Privacy Policy
  • Terms & Conditions
  • DMCA

© 2024 americanfocus.online –  All Rights Reserved.

Welcome Back!

Sign in to your account

Lost your password?