News
OpenAI has unveiled a large dataset to help test how well artificial intelligence (AI) models answer health care questions.
OpenAI, the creator of artificial intelligence chatbot ChatGPT, has a new open-source large language model called HealthBench ...
OpenAI, the maker of ChatGPT, released an open-source benchmark designed to measure the performance and safety of large ...
OpenAI has launched HealthBench, a comprehensive dataset to assess the performance of AI models in answering health-related ...
OpenAI has launched HealthBench, a new dataset designed to test how accurately AI models respond to real-world health care ...
OpenAI, the company behind ChatGPT, has introduced a new evaluation framework to assess how artificial intelligence systems perform in healthcare settings. Here are six things to know about the new ...
The HealthBench test can't possibly tell us the critical factor: How humans would respond to chatbots under real-world ...
AI models seem to be dropping left, right and center right now, with the latest coming from OpenAI. No, they haven’t finally released GPT-5, instead, they’ve complicated the naming system even ...
OpenAI says that it deployed a new system to monitor its latest AI reasoning models, o3 and o4-mini, for prompts related to biological and chemical threats. The system aims to prevent the models ...
OpenAI has announced its phasing out GPT-4.5 from its developer API in favor of its new GPT-4.1 model. When it launched, OpenAI described GPT-4.5 as its best and most capable model so far, in part ...
Results that may be inaccessible to you are currently showing.
Hide inaccessible results