OpenAI HealthBench Dataset

News

23h

OpenAI releases HealthBench dataset to test AI in health care

OpenAI has unveiled a large dataset to help test how well artificial intelligence (AI) models answer health care questions.

CNET on MSN1d

OpenAI Launches HealthBench, a Dataset That Benchmarks Health Care AI Models

OpenAI, the creator of artificial intelligence chatbot ChatGPT, has a new open-source large language model called HealthBench ...

NewsBytes1d

OpenAI's new dataset evaluates how well AI answers medical questions

OpenAI has launched HealthBench, a comprehensive dataset to assess the performance of AI models in answering health-related ...

1don MSN

OpenAI’s HealthBench reveals how well AI answers medical questions

OpenAI has launched HealthBench, a new dataset designed to test how accurately AI models respond to real-world health care ...

The Laconia Daily Sun1d

OpenAI Releases HealthBench Dataset to Test AI in Health Care

Experts call it a major step forward, but they also say more work is needed to ensure safety. The dataset — called HealthBench — is OpenAI's first major independent health care project. It includes ...

STAT2d

OpenAI leaps into health care with AI benchmark to evaluate models

OpenAI on Monday released a large dataset for evaluating how well large ... calling them “unprecedented” in scale and breadth. The project, HealthBench, marks OpenAI’s first foray into ...

marktechpost2d

OpenAI Releases HealthBench: An Open-Source Benchmark for Measuring the Performance and Safety of Large Language Models in Healthcare

OpenAI has released HealthBench, an open-source evaluation framework designed to measure the performance and safety of large language models (LLMs) in realistic healthcare scenarios. Developed in ...

thetechportal.com23d

Third-party tests show OpenAI’s o3 under-delivers

OpenAI had first introduced its o3 reasoning model in December, promoting it as having strong mathematical reasoning capabilities, especially when evaluated on benchmark datasets such as FrontierMath.

Neowin29d

OpenAI announces GPT-4.1, its "smartest model for complex tasks"OpenAI announces GPT-4.1, its "smartest model for complex tasks"0 0

OpenAI has announced three models: GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano. These models come with massive context windows of up to 1 million tokens and a knowledge cutoff of June 2024. The ...

SiliconRepublic21d

Washington Post and OpenAI ink content-sharing deal

The Jeff Bezos-owned publication The Washington Post has signed a deal with OpenAI to allow its news to be used by ChatGPT. As part of the deal, ChatGPT will display “summaries, quotes and links ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results