After Launching ChatGPT Health, OpenAI Surpassed by Competitors on Its Own Medical Benchmark | In-Depth Analysis and Insights on the Blockchain Industry

David Kim
3 Min Read

OpenAI Faces Competition in Healthcare AI

In a rapidly evolving landscape, Chinese company Baichuan Intelligence has made headlines by surpassing OpenAI’s latest healthcare model in a benchmarking test just days after OpenAI’s release of ChatGPT Health.

Recent Developments

On January 7, 2026, OpenAI launched ChatGPT Health, an innovative tool allowing users to connect with electronic health records for tailored medical advice. However, just five days later, Baichuan announced its M3 model, which achieved a notable score of 65.1 on OpenAI’s HealthBench-marking it the highest-rated model in the world. This breakthrough provides Baichuan with a solid competitive edge over OpenAI’s GPT-5.2 High model.

About HealthBench

HealthBench, released by OpenAI in May 2025, was designed to be the authoritative benchmark for evaluating AI in the medical field. It features 5,000 realistic multi-turn medical dialogues compiled by 262 doctors from around the world. Baichuan’s M3 model not only topped the overall score but also excelled in the HealthBench Hard evaluation, which focuses on complex decision-making capabilities.

M3 Model Features and Innovations

  • Low Hallucination Rate: The M3 model demonstrated a hallucination rate of just 3.5%, which is among the lowest globally. This statistic is significant as it does not rely on external search tools, showcasing its inherent accuracy.
  • Fact Aware Reinforcement Learning: Baichuan incorporated a unique Fact Aware RL technique that minimizes misleading information while enhancing the model’s medical knowledge.
  • SCAN-bench Assessment: Baichuan also created a SCAN-bench evaluation set to further optimize the model’s ability to diagnose and retrieve relevant patient information through dynamic multi-turn interactions.

Implications for the Future of AI in Healthcare

Baichuan aims to enhance patient care by addressing critical gaps in lower-tier medical services, a need identified in China’s healthcare reforms. By leveraging AI, they hope to distribute top-tier medical knowledge widely, allowing communities to access expertise traditionally concentrated in large hospitals.

Moving forward, Baichuan’s focus will be on more complex medical issues, such as oncology, rather than areas perceived as easier, like psychological therapy. This decision stems from their belief that AI can provide more reliable outcomes in fields with stringent scientific foundations.

The success of the M3 model illustrates a significant leap toward integrating AI into serious medical practice. Baichuan’s commitment to resolving high-stakes medical issues suggests a future where AI can not only assist but potentially outperform human specialists in various scenarios.

Original source: Open the source

Editorial note: Cozy Corner Daily summarizes news based on available reporting and updates stories as new details emerge.

Read our editorial guidelines.

Share This Article
Follow:
David explores the intersection of technology, culture, and digital behavior. With an academic background in political science and digital policy, he writes about how emerging tools and platforms shape real-world habits and societal change. Before joining Cozy Corner Daily, David worked in public sector research and contributed to tech ethics publications. His editorial style is thoughtful and grounded, always focused on relevance over hype.
Leave a Comment

Leave a Reply

Best Lifestyle Blogs for Inspiration and Ideas - OnToplist.com
Ask Cozy Corner
×
×
Avatar
Cozy Corner Daily Assistant
News • Sports • Entertainment • Fashion • Home Fixes • Reviews • Guides • Lifestyle • Story Tips Welcome
Hi! I'm your Cozy Corner Daily Assistant 💚 What can I help you with today? News, sports, entertainment, home tips, reviews, or something else?
 
By using this chat, you agree to our site policies.