Math Models Projects - Search News

News

OpenAI Model Earns Gold-Medal Score at International Math Olympiad and Advances Path to Artificial General Intelligence

A few months before the 2025 International Mathematical Olympiad (IMO) in July, a three-person team at OpenAI made a long bet ...

Scientific American5d

Can Writing Math Proofs Teach AI to Reason Like Humans?

OpenAI researchers reveal how their experimental model, devoid of any external aids, powered through hours-long proofs to earn a gold-medal score at the International Math Olympiad—and they discuss th ...

TechRepublic1mon

OpenAI Model Wins Gold at International Mathematical Olympiad – or ...

OpenAI’s latest model has achieved a gold-level score at the 2025 International Mathematical Olympiad. It answered five out of the six questions under exam conditions, scoring 35 out of a ...

Ars Technica9mon

New secret math benchmark stumps AI models and PhDs alike

secret math problems dept. New secret math benchmark stumps AI models and PhDs alike FrontierMath's difficult questions remain unpublished so that AI companies can't train against it.

VentureBeat4y

Researchers find that large language models struggle with math

To measure the problem-solving ability of large and general-purpose language models, the researchers created a dataset called MATH, which consists of 12,500 problems taken from high school math ...

Hosted on MSN7mon

Microsoft says 'rStar-Math' demonstrates how small language models ...

Microsoft has potentially made a breakthrough with small language models (SLMs) after the recent development of a new reasoning technique dubbed rStar-Math. For context, the technique enhances the ...

The Atlantic10mon

We’re Entering Uncharted Territory for Math - The Atlantic

Terence Tao, a mathematics professor at UCLA, is a real-life superintelligence. The “Mozart of Math,” as he is sometimes called, is widely considered the world’s greatest living mathematician.

TechCrunch3mon

DeepSeek upgrades its math-focused AI model Prover

The company recently released an upgraded version of V3, a general-purpose model, and is expected to update its R1 “reasoning” model soon. Topics AI, deepseek, In Brief October 27-29, 2025 ...

Forbes3mon

Big Models, Bad Math: The GenAI Problem In Finance - Forbes

The reason for this is fundamental: ChatGPT, and many models like it, can't actually do math. They rely on sophisticated pattern recognition and statistical memory, not true mathematical computation.

The Real Deal1y

ED1 projects multiply in LA as developers question the math

While the number of ED1 projects is poised to accelerate this year, a growing number of developers and advocates question the math behind it.

Results that may be inaccessible to you are currently showing.

Hide inaccessible results