Close Menu
Karachi Chronicle
  • Home
  • AI
  • Business
  • Entertainment
  • Fashion
  • Politics
  • Sports
  • Tech
  • World

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

What's Hot

The world’s largest air force with the F-35 fleet in 2025

AI systems learn from many types of scientific information and run experiments to discover new materials | MIT News

Among the most troublesome relationships in healthcare AI

Facebook X (Twitter) Instagram
  • Home
  • About us
  • Advertise
  • Contact us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
Facebook X (Twitter) Instagram Pinterest Vimeo
Karachi Chronicle
  • Home
  • AI
  • Business
  • Entertainment
  • Fashion
  • Politics
  • Sports
  • Tech
  • World
Karachi Chronicle
You are at:Home » Researchers have created an open rival for Openai’s O1 “inference” model for under $50
AI

Researchers have created an open rival for Openai’s O1 “inference” model for under $50

Adnan MaharBy Adnan MaharFebruary 5, 2025No Comments4 Mins Read0 Views
Facebook Twitter Pinterest Telegram LinkedIn Tumblr Email Reddit
Share
Facebook Twitter LinkedIn Pinterest WhatsApp Email


AI researchers at Stanford and Washington University were able to train AI “inference” models under $50 in cloud calculation credits, according to a new research paper published last Friday.

Known as S1, the model is a test that measures mathematics and coding abilities and works similarly to cutting-edge inference models such as Openai’s O1 and Deepseek’s R1. The S1 model is available on GitHub and the data and code used to train it is available.

The team behind the S1 said they started with a ready-made base model and then tweaked it through distillation. This is the process of extracting “inference” functionality from another AI model by training its answer.

Researchers said S1 is distilled from Gemini 2.0 Flash Thinking Experimental, one of Google’s inference models. Distillation is the same approach that Berkeley researchers used last month to create an AI inference model that costs around $450.

For some, the idea that researchers with millions of dollars behind them can still innovate in the AI ​​field is exciting. However, S1 raises real questions about commoditizing AI models.

Where is the moat if someone can meticulously replicate a multi-million dollar model with relative pocket changes?

Naturally, Big AI Labs is not satisfied. Openai accuss Deepseek of improperly harvesting data from the API for model distillation purposes.

The researchers behind S1 were trying to find the simplest approach to achieving strong inference performance and “test time scaling”. These were some of Openai’s O1 breakthroughs, and Deepseek and other AI labs have attempted to replicate them through a variety of technologies.

The S1 paper uses a process called Monitored Fine Tuning (SFT), which explicitly directs the inference model to mimic specific behaviors within a DataSet, using a process called Monitored Fine Tuning (SFT), which allows for relatively small datasets. It suggests that it can be distilled.

SFTs tend to be cheaper than the large-scale reinforcement learning methods Deepseek adopted to train competitors on Openai’s O1 model, R1.

Google has daily rate limits via the Google AI Studio Platform, but you can access Gemini 2.0 Flash Thinking Experimentyal for free.

However, Google’s terminology prohibits the model from reverse engineering to develop services that compete with the company’s own AI services. I contacted Google for comment.

The S1 is based on a small, ready-made AI model from the Chinese AI Lab Qwen, owned by Alibaba, and is free to download. To train S1, researchers combine with answers to these questions, as well as the “thinking” process behind each answer in Google’s Gemini 2.0 Flash Thinking Experimential answer, with just 1,000 answers. We created a dataset of carefully curated questions.

According to the researchers, the S1 achieved strong performance on certain AI benchmarks after receiving less than 30 minutes of training using a 16 NVIDIA H100 GPU. Niklas Muenenhuhu, a researcher at Stanford University who worked on the project, told TechCrunch that he could borrow the calculations he needed today for $20.

Using a clever trick, the researchers reaffirmed the work with the S1 and extended its “thinking” time. They told me to wait for it. Adding the word “wait” during S1 reasoning allowed the model to arrive at a slightly more accurate answer according to the paper.

In 2025, Meta, Google and Microsoft are planning to invest hundreds of billions of dollars in AI infrastructure, partially used to train next-generation AI models.

That level of investment may still be necessary to drive the envelope of AI innovation. Distillation has been shown to be a good way to recreate the functionality of AI models at a low cost, but it does not create new AI models that are far better than what is available today.



Source link

Share. Facebook Twitter Pinterest LinkedIn Reddit WhatsApp Telegram Email
Previous ArticleApple suddenly checks “invitation”, the new iPhone app is currently live
Next Article Indian media is about to take part in a lawsuit against Openai chatbot
Adnan Mahar
  • Website

Adnan is a passionate doctor from Pakistan with a keen interest in exploring the world of politics, sports, and international affairs. As an avid reader and lifelong learner, he is deeply committed to sharing insights, perspectives, and thought-provoking ideas. His journey combines a love for knowledge with an analytical approach to current events, aiming to inspire meaningful conversations and broaden understanding across a wide range of topics.

Related Posts

AI systems learn from many types of scientific information and run experiments to discover new materials | MIT News

September 25, 2025

Among the most troublesome relationships in healthcare AI

September 25, 2025

Does access to AI become a fundamental human right? Sam Altman says, “Everyone would want…”

September 23, 2025
Leave A Reply Cancel Reply

Top Posts

20 Most Anticipated Sex Movies of 2025

January 22, 2025456 Views

President Trump’s SEC nominee Paul Atkins marries multi-billion dollar roof fortune

December 14, 2024122 Views

How to tell the difference between fake and genuine Adidas Sambas

December 26, 202486 Views

Alice Munro’s Passive Voice | New Yorker

December 23, 202474 Views
Don't Miss
AI September 25, 2025

AI systems learn from many types of scientific information and run experiments to discover new materials | MIT News

Machine learning models can speed up discovery of new materials by making predictions and proposing…

Among the most troublesome relationships in healthcare AI

Does access to AI become a fundamental human right? Sam Altman says, “Everyone would want…”

Google’s Gemini AI is on TV

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

About Us
About Us

Welcome to Karachi Chronicle, your go-to source for the latest and most insightful updates across a range of topics that matter most in today’s fast-paced world. We are dedicated to delivering timely, accurate, and engaging content that covers a variety of subjects including Sports, Politics, World Affairs, Entertainment, and the ever-evolving field of Artificial Intelligence.

Facebook X (Twitter) Pinterest YouTube WhatsApp
Our Picks

The world’s largest air force with the F-35 fleet in 2025

AI systems learn from many types of scientific information and run experiments to discover new materials | MIT News

Among the most troublesome relationships in healthcare AI

Most Popular

10 things you should never say to an AI chatbot

November 10, 20040 Views

Character.AI faces lawsuit over child safety concerns

December 12, 20050 Views

Analyst warns Salesforce investors about AI agent optimism

July 1, 20070 Views
© 2025 karachichronicle. Designed by karachichronicle.
  • Home
  • About us
  • Advertise
  • Contact us
  • DMCA
  • Privacy Policy
  • Terms & Conditions

Type above and press Enter to search. Press Esc to cancel.