News
OpenAI has unveiled a large dataset to help test how well artificial intelligence (AI) models answer health care questions.
OpenAI, the creator of artificial intelligence chatbot ChatGPT, has a new open-source large language model called HealthBench ...
OpenAI has launched HealthBench, a comprehensive dataset to assess the performance of AI models in answering health-related ...
OpenAI has launched HealthBench, a new dataset designed to test how accurately AI models respond to real-world health care ...
Experts call it a major step forward, but they also say more work is needed to ensure safety. The dataset — called HealthBench — is OpenAI's first major independent health care project. It includes ...
OpenAI on Monday released a large dataset for evaluating how well large ... calling them “unprecedented” in scale and breadth. The project, HealthBench, marks OpenAI’s first foray into ...
OpenAI has released HealthBench, an open-source evaluation framework designed to measure the performance and safety of large language models (LLMs) in realistic healthcare scenarios. Developed in ...
OpenAI had first introduced its o3 reasoning model in December, promoting it as having strong mathematical reasoning capabilities, especially when evaluated on benchmark datasets such as FrontierMath.
OpenAI has announced three models: GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano. These models come with massive context windows of up to 1 million tokens and a knowledge cutoff of June 2024. The ...
The Jeff Bezos-owned publication The Washington Post has signed a deal with OpenAI to allow its news to be used by ChatGPT. As part of the deal, ChatGPT will display “summaries, quotes and links ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results