Race to produce the cheapest top performance Artificial Intelligence (AI) The model is heated with a new inference model from US computer scientists, including the famous Chinese-American “AI Godmother” Li Feifei. Alibaba Group HoldingsFollowing China’s groundbreaking success, open source technology deepseek.
The S1 inference model was developed on the QWEN2.5-32B-Instruct model of China’s e-commerce giant, by researchers from Stanford University where Li works and researchers from Washington University, according to a research paper published last week. I did.
The Alibaba model’s capabilities are fresh evidence of how China narrows down the AI gap with major US players. It attracted global attention. Hong Kong listed its shares in Alibaba, the owner of the South China Morning Post, winning 6% on Monday.
After being trained with 1,000 carefully curated questions and answers to distilled “thinking process” from Google’s Gemini thought experimental model, the S1 model outperformed Openai’s O1-Preview on mathematics and programming skills .
05:00
Does China’s low-cost deepseek arrival mean the end of Nvidia’s chip advantage?
Does China’s low-cost deepseek arrival mean the end of Nvidia’s chip advantage?
The cost of running only a Graphic Processing Unit (GPU) to develop an S1 could be $14 USD based on the calculations described in the study. These tips can be rented for $2 per hour.
Source link