Tuesday, 30 Jun 2026
  • Contact
  • Privacy Policy
  • Terms & Conditions
  • DMCA
logo logo
  • World
  • Politics
  • Crime
  • Economy
  • Tech & Science
  • Sports
  • Entertainment
  • More
    • Education
    • Celebrities
    • Culture and Arts
    • Environment
    • Health and Wellness
    • Lifestyle
  • 🔥
  • Trump
  • House
  • White
  • ScienceAlert
  • VIDEO
  • man
  • Trumps
  • Season
  • star
  • Years
Font ResizerAa
American FocusAmerican Focus
Search
  • World
  • Politics
  • Crime
  • Economy
  • Tech & Science
  • Sports
  • Entertainment
  • More
    • Education
    • Celebrities
    • Culture and Arts
    • Environment
    • Health and Wellness
    • Lifestyle
Follow US
© 2024 americanfocus.online – All Rights Reserved.
American Focus > Blog > Tech and Science > AI chatbots fail to diagnose patients by talking with them
Tech and Science

AI chatbots fail to diagnose patients by talking with them

Last updated: January 2, 2025 10:31 am
Share
AI chatbots fail to diagnose patients by talking with them
SHARE

Don’t call your favourite AI “doctor” just yet

Just_Super/Getty Images

Artificial intelligence has made significant advancements in various fields, including medicine. However, a recent study has revealed that while AI models excel in professional medical exams, they struggle with one crucial aspect of being a physician – interacting with patients to gather medical information and provide accurate diagnoses.

Research conducted by Pranav Rajpurkar and Shreya Johri at Harvard University introduced a new evaluation benchmark called CRAFT-MD, which assesses clinical AI models’ reasoning abilities through simulated doctor-patient conversations. These conversations were based on 2000 medical cases from US medical board exams, replicating real-life scenarios where patients may not disclose crucial information unless prompted.

The study utilized OpenAI’s GPT-4 model as a “patient AI” interacting with the clinical AI being tested. Results showed that leading AI models such as GPT-3.5, GPT-4, Meta’s Llama-2-7b, and Mistral AI’s Mistral-v2-7b performed significantly worse in conversation-based diagnostics compared to written case summaries.

For instance, GPT-4’s diagnostic accuracy dropped from 82% with structured summaries to 26% in patient conversations. Despite being the top performer, GPT-4 only gathered complete medical histories in 71% of simulated conversations and did not always provide accurate diagnoses.

Eric Topol from the Scripps Research Translational Institute noted that evaluating AI’s clinical reasoning through patient conversations is a more practical approach than traditional exams. However, Rajpurkar emphasized that AI’s success in simulated scenarios does not equate to surpassing human physicians due to the complexities of real-world medical practice.

While AI shows promise in supporting clinical work, it is not a substitute for the holistic judgement and experience of human doctors. The study underscores the importance of ongoing research to enhance AI’s capabilities while acknowledging the irreplaceable role of human healthcare providers.

See also  Let’s explore the best alternatives to Discord

Topics:

TAGGED:chatbotsdiagnoseFailpatientstalking
Share This Article
Twitter Email Copy Link Print
Previous Article A “Both Sides” Approach to Orientalism A “Both Sides” Approach to Orientalism
Next Article How To Draw a Snowman (Free Printable + Video) How To Draw a Snowman (Free Printable + Video)
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *


The reCAPTCHA verification period has expired. Please reload the page.

Popular Posts

OxygenOS 16 release date confirmed – Will your OnePlus phone get it?

Image: Foundry | Alex Walker-Todd Summary Launch date for OxygenOS 16 announced by OnePlus Starting…

October 7, 2025

Roberto Cavalli Resort 2026 Collection

Fausto Puglisi's Colorful and Joyful Fall 2025 Collection for Roberto Cavalli Tyla and Carla Bruni…

May 27, 2025

Should You Buy the 3 Highest-Paying Dividend Stocks in the Nasdaq?

High-Yielding Stocks to Watch: PepsiCo, Comcast, and Kraft Heinz In the current financial landscape, investors…

September 22, 2025

Isaiah Stokes, actor who appeared on ‘Law & Order,’ convicted of revenge-fueled NYC murder: DA

Isaiah Stokes, a well-known actor who has made appearances on popular TV shows like “Law…

March 10, 2025

Samsung Galaxy A37 Review: Samey but Solid

At a glance Expert's Rating Pros Large, colorful screen Six years of software updates Solid…

May 12, 2026

You Might Also Like

Startup Battlefield Australia application closes in days: Apply before July 6
Tech and Science

Startup Battlefield Australia application closes in days: Apply before July 6

June 30, 2026
This Chernobyl Fungus Seems to Have Evolved an Incredible Ability : ScienceAlert
Tech and Science

This Chernobyl Fungus Seems to Have Evolved an Incredible Ability : ScienceAlert

June 30, 2026
The attack that hijacked Claude Code came through Sentry. Datadog, PagerDuty, and Jira have the same exposure.
Tech and Science

The attack that hijacked Claude Code came through Sentry. Datadog, PagerDuty, and Jira have the same exposure.

June 30, 2026
Chaotic pigeons are helping redefine what we know about learning
Tech and Science

Chaotic pigeons are helping redefine what we know about learning

June 30, 2026
logo logo
Facebook Twitter Youtube

About US


Explore global affairs, political insights, and linguistic origins. Stay informed with our comprehensive coverage of world news, politics, and Lifestyle.

Top Categories
  • Crime
  • Environment
  • Sports
  • Tech and Science
Usefull Links
  • Contact
  • Privacy Policy
  • Terms & Conditions
  • DMCA

© 2024 americanfocus.online –  All Rights Reserved.

Welcome Back!

Sign in to your account

Lost your password?