In Harvard University study, AI provided more accurate diagnoses than emergency room doctors

New research investigates how large-scale language models perform in a variety of medical situations, including real-life emergency room cases. There, at least one model appears to be more accurate than human doctors.

The study, published this week in the journal Science, is the work of a research team led by doctors and computer scientists from Harvard Medical School and Beth Israel Deaconess Medical Center. The researchers said they conducted various experiments to measure how OpenAI’s models compared to human doctors.

In one experiment, researchers focused on 76 patients who came to Beth Israel’s emergency room and compared the diagnoses provided by two attending physicians with those generated by OpenAI’s o1 and 4o models. These diagnoses were evaluated by two other primary care physicians, but it was unclear which were human and which were AI-based.

“At each diagnostic touchpoint, O1 performed nominally better than or equal to two primary care physicians and 4O,” the study said, adding that the difference was “particularly pronounced at the first diagnostic touchpoint (early ER triage), when the least information is available about the patient and making the right decision is most urgent.”

In a press release from Harvard Medical School about the study, the researchers emphasized that “no data preprocessing was performed.” The AI model was presented with the same information that was available in the electronic medical record at the time of each diagnosis.

Armed with that information, the o1 model was able to provide “accurate or very close diagnoses” in 67% of triage cases. Meanwhile, one doctor was correct or very close to the diagnosis 55% of the time, and the other doctor was right 50% of the time.

“We tested our AI model against nearly every benchmark, and it outperformed both previous models and physician baselines,” Arjun Manraj, director of the AI Lab at Harvard Medical School and one of the study’s lead authors, said in a press release.

tech crunch event

San Francisco, California
|
October 13-15, 2026

To be clear, this study does not claim that AI is ready to make real life-or-death decisions in emergency rooms. Instead, it said the findings demonstrate “an urgent need for prospective clinical trials to evaluate these technologies in real-world patient care settings.”

The researchers also noted that they only studied how the model behaves when provided with text-based information, and that “existing research suggests that current underlying models are more limited in their inferences to non-text inputs.”

Beth Israel physician Adam Rodman, one of the study’s lead authors, told the Guardian that “there is currently no formal accountability framework” for AI diagnostics, and that patients still “want humans to guide them in life-and-death decisions and guide them through difficult treatment decisions.”

If you buy through links in our articles, we may earn a small commission. This does not affect editorial independence.

Source link

What's Hot

Inside the Titanic diving voyage that ended with five deaths

World Cup: Ivory Coast’s Eli Wahi refused entry to Canada for game against Germany | 2026 World Cup News

General Intuition in talks to raise $300 million at a valuation of about $2 billion

General Intuition in talks to raise $300 million at a valuation of about $2 billion

Pixi’s new iOS app turns text messages into interactive AR experiences

How to turn off AI in Google Docs

Global model maker Odyssey receives $1.45 billion valuation with backing from Amazon and other major companies

Newly freed hostages face long road to recovery after two years in captivity

Former Kenyan Prime Minister Raila Odinga dies at 80

New NATO member offers to buy more US weapons to Ukraine as Western aid dwindles

Russia expands drone targeting on Ukraine’s rail network

Inside the Titanic diving voyage that ended with five deaths

Lindsay Lohan posts family moments with husband Vader Shammas and son Luai

The love story of Knicks guards Josh Hart and Shannon Hart

Tessa Bailey talks about casting Nina Dobrev in a summer movie

Our Picks

For Iran’s leaders, surviving war may prove easier than winning peace

British media reported that Prince Harry and Duchess Meghan will visit the UK with their family for the first time in four years.

Ukraine attacks major Moscow refinery for second time in a week

Subscribe to Updates

What's Hot

In Harvard University study, AI provided more accurate diagnoses than emergency room doctors

Related Posts

Subscribe to Updates