By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
Pratzo - Daily NewsPratzo - Daily NewsPratzo - Daily News
Notification Show More
Font ResizerAa
  • Technology
    • AI & Machine Learning
    • Software & Apps
    • Hardware & Gadgets
    Technology
    Show More
    Top News
    Oppo Find X8 Ultra Key Specifications Revealed; Snapdragon 8 Elite SoC, 6,100mAh Battery Confirmed
    April 9, 2025
    Xiaomi X Pro QLED (2025) First Impressions
    April 10, 2025
    CMF Buds 2 Price, Design and Specifications Leaked Ahead of April 28 Launch
    April 11, 2025
    Latest News
    Red Magic Astra Gaming Tablet Launched With Snapdragon 8 Elite SoC, 8,200mAh Battery
    July 2, 2025
    Samsung Galaxy Z Flip 7 FE Name Appears in Alleged Third-Party Case Listing Alongside Galaxy Z Flip 7
    July 2, 2025
    Poco F7 5G Confirmed to Get Snapdragon 8s Gen 4 Chipset Ahead of June 24 Launch
    July 2, 2025
    Threads Rolls Out DMs With Message Controls, Inbox Filters for Users Aged 18 and Above
    July 2, 2025
  • Digital Marketing
    • Social Media Updates
    • PPC & Ads Insights
    • SEO Trends
    • Content Marketing Strategies
    Digital MarketingShow More
    70% of Senior Marketers Support Google’s Decision to Retain Third-Party Cookies on Chrome
    December 6, 2024
  • Lifestyle & Productivity
    • Personal Productivity Tools
    • Smart Home Tech
    • Wearables
    • Wellness Gadgets
    Lifestyle & ProductivityShow More
    Allu Arjun’s Bail Hearing Postponed to January 3
    December 31, 2024
    Pushpa 2 Full Movie Leaked Online
    Pushpa 2 Full Movie Leaked Online: A Major Setback Despite Record Pre-Sales
    December 5, 2024
    Pushpa 2: The Rule Movie Review – A Gripping Mass Entertainer
    December 5, 2024
  • Automobile
    AutomobileShow More
    New Petrol Price in India: Crude Oil Prices Fall – Check Today’s Rates
    January 25, 2025
    All-New Honda Amaze 2025 Launched in India – Prices Start at ₹7.99 Lakh
    December 5, 2024
    Mahindra XEV 9e Launched In India Priced At ₹ 21.90 Lakh: Check Range, Features, and More
    November 27, 2024
Reading: Researchers Create a Low-Cost Open-Source AI Model to Analyse How OpenAI’s o1 Reasons
Share
Font ResizerAa
Pratzo - Daily NewsPratzo - Daily News
Search
Follow US
Pratzo - Daily News > Technology > Researchers Create a Low-Cost Open-Source AI Model to Analyse How OpenAI’s o1 Reasons
Technology

Researchers Create a Low-Cost Open-Source AI Model to Analyse How OpenAI’s o1 Reasons

admin
Last updated: February 6, 2025 7:02 pm
admin Published February 6, 2025
Share
SHARE

Researchers from Stanford University and Washington University have developed an open-source artificial intelligence (AI) model that is comparable in performance to OpenAI’s o1 model. The main objective of the researchers was not to create a powerful reasoning-focused model but to understand how the San Francisco-based AI firm instructed its o1 series models to perform test time scaling. Notably, the researchers were able to showcase the methodology and replicate the model’s behaviour at an extremely low cost while using far fewer compute resources.

Researchers Develop S1-32B AI Model

The researchers detailed the methodology and process of developing the model in a study published in the pre-print journal arXiv. The process involved creating a synthetic dataset from a different AI model and using several new techniques such as ablation and supervised fine-tuning (SFT). The model is available in a GitHub listing.

It should be noted that the AI model was not built from scratch. The developers used the Qwen2.5-32B-Instruct and distilled it to create the s1-32B large language model (LLM). Released in September 2024, the model is capable but given its size and lack of reasoning capabilities, it cannot match up to OpenAI’s o1.

During the process, the researchers used the Gemini Flash Thinking application processing interface (API) to generate reasoning traces and responses. A total of 59,000 triplets of questions, reasoning traces (the chain of thought or CoT), and responses were extracted from the API. A dataset called the s1K was then created by selecting 1,000 high-quality, diverse, and difficult questions as well as the reasoning traces and the responses.

After creating the s1K dataset, the researchers performed supervised fine-tuning on the Qwen2.5-32B-Instruct model. For this, basic fine-tuning hyperparameters were used. The distillation process took 26 minutes of training on 16 Nvidia H100 GPUs.

Till this point, the researchers had no idea how OpenAI trained the models to “think” and how it managed to stop the thinking process. Without this, a model runs the risk of overthinking indefinitely as it second-guesses its output wasting valuable processing power.

While fine-tuning the model, the researcher found something interesting. They found that they could manipulate the inference time by adding and XML tags. Once a model reaches the end tag, it is told to change its voice to an authoritative tone for the final answer. Notably, inference time is the near real-time responses that a typical AI model generates. Anything more than this would require careful manipulation of the code.

With the s1-32B model, the researchers added a “wait” command to force it to think beyond the usual inference period. Once added, the model began second-guessing and verifying its output. Then, the tag was used to either shorten this test time scaling phase or lengthen it.

Then, the researchers also experimented with several other phrases such as “alternatively”, and “hmm”, but found that the best performance metrics were achieved when using the “wait” tag. By bringing the model close to the performance of o1, the researchers claim that this might be the method used by OpenAI to fine-tune its reasoning models.

A TechCrunch report claims that the researchers were able to create the s1-32B AI model under $50 (roughly Rs. 4,380), highlighting that creating a post-training structure for reasoning models can be done at an extremely low cost.

source

You Might Also Like

Red Magic Astra Gaming Tablet Launched With Snapdragon 8 Elite SoC, 8,200mAh Battery

Samsung Galaxy Z Flip 7 FE Name Appears in Alleged Third-Party Case Listing Alongside Galaxy Z Flip 7

Poco F7 5G Confirmed to Get Snapdragon 8s Gen 4 Chipset Ahead of June 24 Launch

Threads Rolls Out DMs With Message Controls, Inbox Filters for Users Aged 18 and Above

Lumio Arc 5, Arc 7 Projectors Powered by Google TV to Launch in India on July 7

TAGGED:Satellite TechnologySpace TechnologyTechnology
Share This Article
Facebook Twitter Email Print
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Current Gold Rate: 3681.90 INR per gram

Follow US

Find US on Social Medias
FacebookLike
TwitterFollow
YoutubeSubscribe
TelegramFollow

Weekly Newsletter

Subscribe to our newsletter to get our newest articles instantly!

    Popular News
    Technology

    NASA Delays SPHEREx and PUNCH Missions Due to SpaceX Vehicle Checks

    admin admin March 12, 2025
    Sonos Launches Arc Ultra Soundbar, Sub 4 Subwoofer, and Era 100 Pro Speaker in India
    Doom: The Dark Ages Review — Rip and Tear, Medieval Style
    New Study Suggests Dogs May Have Domesticated Themselves for Food
    Trump-Backed USD1 Stablecoin Chosen to Back MGX’s $2 Billion Binance Stake Deal: Report
    - Advertisement -
    Ad imageAd image

    Always Stay Up to Date

    Subscribe to our newsletter to get our newest articles instantly!

      About US

      At News.Pratzo.com, we are shaping the conversation in business and technology with reliable insights and updates. As part of the Pratzo.com brand, we aim to be your trusted source for impactful stories and trends, empowering professionals and enthusiasts alike. Stay informed, inspired, and ahead with us!
      Quick Link
      • Automobile
      • News
      • Cricket
      • Lifestyle & Productivity
      • Entertainment
      • Reviews & Comparisons
      • Digital Marketing
      • SEO Trends
      • Technology
      • AI & Machine Learning

      © Flair Hair & Beauty Salon London 2025

      © Pratzo News Network. Assets of Pratzo.com . All Rights Reserved.
      Go to mobile version