Close Menu
Karachi Chronicle
  • Home
  • AI
  • Business
  • Entertainment
  • Fashion
  • Politics
  • Sports
  • Tech
  • World

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

What's Hot

The world’s largest air force with the F-35 fleet in 2025

AI systems learn from many types of scientific information and run experiments to discover new materials | MIT News

Among the most troublesome relationships in healthcare AI

Facebook X (Twitter) Instagram
  • Home
  • About us
  • Advertise
  • Contact us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
Facebook X (Twitter) Instagram Pinterest Vimeo
Karachi Chronicle
  • Home
  • AI
  • Business
  • Entertainment
  • Fashion
  • Politics
  • Sports
  • Tech
  • World
Karachi Chronicle
You are at:Home » What is the S1 AI model and what is the Openai O1 rival trained for under $50? | Technology News
AI

What is the S1 AI model and what is the Openai O1 rival trained for under $50? | Technology News

Adnan MaharBy Adnan MaharFebruary 10, 2025No Comments3 Mins Read0 Views
Facebook Twitter Pinterest Telegram LinkedIn Tumblr Email Reddit
Share
Facebook Twitter LinkedIn Pinterest WhatsApp Email


In January, the world witnessed Chinese AI startup Deepseek revolutionizing its cost-effective, cutting-edge AI models. The company has announced two models that rival the performance of the frontier models by Openai and Google: the Deepseek-V3 and the Deepseek-V1. Deepseek paves the way for more careful innovation in AI. Now, a new model has sparked the curiosity of the AI ​​community. Researchers from Stanford and Washington University trained an inference model named S1 for just $50 (approximately Rs 4,400) in cloud computing credits.

What is S1?

Based on research papers, the model S1-32B is an open source, advanced language model focusing on inference tasks. What stands out from other AI models is “test time scaling.” This is a technique that allows you to dynamically iterate the responses using additional computational resources during testing. The S1 reportedly competes directly with Openai’s O1 Reasoning model. This also generates answers to the prompts by considering via related questions, so you can also check your own answers. This method differs from traditional approaches that rely solely on training large-scale language models in advance.

For example, if you encourage your model to explain the cost of replacing your iPhone with an Android tablet, it breaks down the question into several steps. tablet.

The story continues under this ad

How were you trained?

The S1 model is trained by curating a high-quality dataset named S1K. These questions were selected based on difficulty, diversity and quality. The dataset also includes complex mathematics, inference and science problems. Another important aspect of model development is the fine tuning (SFT) monitored on this small dataset. According to the research paper, the SFT required 26 minutes of training on 16 Nvidia H100 GPUs. Regardless of small dataset size, S1 achieved high inference accuracy through the use of knowledge built into a pre-trained base model, QWEN2.5-32B-Instruct.

S1 is also based on a ready-made language model trained to infer by studying Google’s Gemini 2.0 Flash Thinking Experimental questions and answers. Google Models demonstrate the thinking behind any answering process, allowing S1 developers to award small amounts of training data to the model. They essentially taught the S1 model to mimic the Gemini thinking process.

In terms of performance, the S1 is rated on three inference benchmarks: AIME24, Math500, and GPQA diamonds. During testing, the model showed a significant improvement in accuracy, surpassing OpenAI’s closed-source model, O1 preview. The S1 model showed performance improvements of up to 27% in mathematical competition issues. While previous models required reinforcement learning and large datasets, S1-32B showed that effective training using only 1,000 samples could build competitive inference models.

What does AI mean?

The S1 model demonstrates the importance of transparency and open source contributions in AI development. With the development process of S1 now available in public spaces, researchers are hoping for more collaboration and innovation in this field. Researchers also show the need to overcome the limitations of test time scaling, explore alternative budgetary approaches, and suggest the need to further strengthen inference capabilities by applying reinforcement learning techniques. .

The story continues under this ad

In short, the S1 is a groundbreaking model that brings together principles of efficient training, innovative test time scaling, and open source.

©IE Online Media Services Pvt Ltd

Enlarge



Source link

Share. Facebook Twitter Pinterest LinkedIn Reddit WhatsApp Telegram Email
Previous ArticleBuilding a safer and more skilled blue-collar workforce for Indian manufacturing
Next Article Everything you need to know about the global market over the weekend
Adnan Mahar
  • Website

Adnan is a passionate doctor from Pakistan with a keen interest in exploring the world of politics, sports, and international affairs. As an avid reader and lifelong learner, he is deeply committed to sharing insights, perspectives, and thought-provoking ideas. His journey combines a love for knowledge with an analytical approach to current events, aiming to inspire meaningful conversations and broaden understanding across a wide range of topics.

Related Posts

AI systems learn from many types of scientific information and run experiments to discover new materials | MIT News

September 25, 2025

Among the most troublesome relationships in healthcare AI

September 25, 2025

Does access to AI become a fundamental human right? Sam Altman says, “Everyone would want…”

September 23, 2025
Leave A Reply Cancel Reply

Top Posts

20 Most Anticipated Sex Movies of 2025

January 22, 2025456 Views

President Trump’s SEC nominee Paul Atkins marries multi-billion dollar roof fortune

December 14, 2024122 Views

How to tell the difference between fake and genuine Adidas Sambas

December 26, 202485 Views

Alice Munro’s Passive Voice | New Yorker

December 23, 202474 Views
Don't Miss
AI September 25, 2025

AI systems learn from many types of scientific information and run experiments to discover new materials | MIT News

Machine learning models can speed up discovery of new materials by making predictions and proposing…

Among the most troublesome relationships in healthcare AI

Does access to AI become a fundamental human right? Sam Altman says, “Everyone would want…”

Google’s Gemini AI is on TV

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

About Us
About Us

Welcome to Karachi Chronicle, your go-to source for the latest and most insightful updates across a range of topics that matter most in today’s fast-paced world. We are dedicated to delivering timely, accurate, and engaging content that covers a variety of subjects including Sports, Politics, World Affairs, Entertainment, and the ever-evolving field of Artificial Intelligence.

Facebook X (Twitter) Pinterest YouTube WhatsApp
Our Picks

The world’s largest air force with the F-35 fleet in 2025

AI systems learn from many types of scientific information and run experiments to discover new materials | MIT News

Among the most troublesome relationships in healthcare AI

Most Popular

10 things you should never say to an AI chatbot

November 10, 20040 Views

Character.AI faces lawsuit over child safety concerns

December 12, 20050 Views

Analyst warns Salesforce investors about AI agent optimism

July 1, 20070 Views
© 2025 karachichronicle. Designed by karachichronicle.
  • Home
  • About us
  • Advertise
  • Contact us
  • DMCA
  • Privacy Policy
  • Terms & Conditions

Type above and press Enter to search. Press Esc to cancel.