OpenAI improves inference skills and upgrades smartest AI models

OpenAI today announced an improved version of its most capable artificial intelligence model to date (one that takes more time to think through questions), just one day after Google announced the first model of its type.

OpenAI’s new model is called o3 and replaces o1, which the company introduced in September. Similar to o1, the new model spends time thinking through problems to provide better answers to questions that require step-by-step logical reasoning. (OpenAI chose to abbreviate the designation “o2” because it is already the name of a UK mobile phone company.)

“We see this as the beginning of the next phase of AI,” OpenAI CEO Sam Altman said in a livestream Friday. “These models can then be used to perform increasingly complex tasks that require a lot of inference.”

According to OpenAI, the o3 model scores much higher than previous models on several measures, including those measuring complex coding-related skills and advanced math and science abilities. ARC-AGI is a benchmark designed to test the ability of AI models to reason about the most difficult mathematical and logical problems encountered for the first time.

Google is also conducting similar research. In a post about X yesterday, Google researcher Noam Shazeer revealed that the company has developed its own inference model called Gemini 2.0 Flash Thinking. Google CEO Sundar Pichai called it “the most thoughtful model yet” in a post of his own. Google’s new model achieved a high score on SWE-Bench, a test that measures a model’s agential capabilities.

However, OpenAI’s new o3 model is 20% better than o1. “o3 blew it out of the water,” says Ofir Press, a postdoctoral fellow at Princeton University who helped develop SWE-Bench. “It’s a very surprising increase, but we don’t know how it happened.”

The two head-to-head models show that the competition between OpenAI and Google is fiercer than ever. It is important for OpenAI to demonstrate that it can continue to make progress in order to attract more investment and build a profitable business. Meanwhile, Google is desperate to show that it remains at the forefront of AI research.

The new model also shows that AI companies are increasingly looking beyond just scaling up AI models to squeeze better intelligence out of them.

Source link

What's Hot

I’ve seen all the Marvel movies. Here’s how to save your MCU

London Stock Exchange Group share price rises as PISCES debut nears and financial results approach

Indian Americans largely disapprove of Trump’s first-year performance, but Democrats aren’t benefiting: Survey

OpenAI improves inference skills and upgrades smartest AI models

D Street Massacre, Humanity Milestones, Bangladesh Election Results, PMO Shift, and More

A smarter way for AI to understand text and images

Surprisingly Tough Competition for Meta’s Ray-Ban

20 Most Anticipated Sex Movies of 2025

How to tell the difference between fake and genuine Adidas Sambas

President Trump’s SEC nominee Paul Atkins marries multi-billion dollar roof fortune

Alice Munro’s Passive Voice | New Yorker

D Street Massacre, Humanity Milestones, Bangladesh Election Results, PMO Shift, and More

A smarter way for AI to understand text and images

Surprisingly Tough Competition for Meta’s Ray-Ban

How AI assistance impacts the formation of coding skills \ Anthropic

Our Picks

I’ve seen all the Marvel movies. Here’s how to save your MCU

London Stock Exchange Group share price rises as PISCES debut nears and financial results approach

Indian Americans largely disapprove of Trump’s first-year performance, but Democrats aren’t benefiting: Survey

Most Popular

Anthropic agrees to work with music publishers to prevent copyright infringement

chatgpt makers claim data breach claims “seriously”

Everything you need to know

Subscribe to Updates

What's Hot

OpenAI improves inference skills and upgrades smartest AI models

Related Posts