OpenAI HealthBench Dataset

News

23h

OpenAI releases HealthBench dataset to test AI in health care

OpenAI has unveiled a large dataset to help test how well artificial intelligence (AI) models answer health care questions.

CNET on MSN1d

OpenAI Launches HealthBench, a Dataset That Benchmarks Health Care AI Models

OpenAI, the creator of artificial intelligence chatbot ChatGPT, has a new open-source large language model called HealthBench ...

NewsBytes1d

OpenAI's new dataset evaluates how well AI answers medical questions

OpenAI has launched HealthBench, a comprehensive dataset to assess the performance of AI models in answering health-related ...

Fierce Healthcare1d

OpenAI pushes further into healthcare with release of HealthBench to evaluate AI models

OpenAI, the maker of ChatGPT, released an open-source benchmark designed to measure the performance and safety of large ...

1don MSN

OpenAI’s HealthBench reveals how well AI answers medical questions

OpenAI has launched HealthBench, a new dataset designed to test how accurately AI models respond to real-world health care ...

Becker's Hospital Review1d

OpenAI dives into healthcare: 6 things to know

OpenAI, the company behind ChatGPT, has introduced a new evaluation framework to assess how artificial intelligence systems perform in healthcare settings. Here are six things to know about the new ...

OpenAI's HealthBench shows AI's medical advice is improving - but who will listen?

The HealthBench test can't possibly tell us the critical factor: How humans would respond to chatbots under real-world ...

TechCrunch27d

OpenAI’s latest AI models have a new safeguard to prevent biorisks

OpenAI says that it deployed a new system to monitor its latest AI reasoning models, o3 and o4-mini, for prompts related to biological and chemical threats. The system aims to prevent the models ...

Engadget1mon

OpenAI is phasing out GPT-4.5 for developers

OpenAI has announced its phasing out GPT-4.5 from its developer API in favor of its new GPT-4.1 model. When it launched, OpenAI described GPT-4.5 as its best and most capable model so far, in part ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results