On September 25, 2025, OpenAI dropped a bombshell: its latest model, GPT-5, now “stacks up to humans in a wide range of jobs.
A new test from OpenAI aims to understand how close AI is to outperforming humans at economically valuable work.
6hon MSN
Claude just beat GPT-5, Gemini, and Grok in real-world job tasks, according to OpenAI’s own study
Surprisingly, the OpenAI study shows that the best performing model was Anthropic’s Claude Opus 4.1, which outpaced not only ...
According to OpenAI, the tasks were created by professionals with an average of 14 years of experience in relevant fields to reflect "real work products, such as a legal brief, an engineering ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results