La saga OpenAI

Artificial Intelligence

This is how o1 works, OpenAi's model that 'thinks' before responding

Project Straberry finally unveiled. The new Ai model is available for ChatGpt Plus and Team users

12 September 2024

2' min read

OpenAI today announced o1, a new series of models for solving increasingly complicated problems: from complex tasks to better problem solving than previous versions of the technology. This represents an early preview of this series that includes o1-preview and o1-mini, on ChatGPT and in the API.

here is o1, a series of our most capable and aligned models yet:https://t.co/yzZGNN8HvD

o1 is still flawed, still limited, and it still seems more impressive on first use than it does after you spend more time with it. pic.twitter.com/Qs1HoSDOz1
— Sam Altman (@sama) September 12, 2024

"We trained these models to spend more time thinking about problems before responding, just as a person would. Through training, they learn to refine their thinking process, try different strategies and recognise their mistakes".

Known by the code name Strawberry as blogged o1-preview is a new large language model. It has been trained with reinforcement learning to perform complex reasoning. And as they write on the blog: 'o1 thinks before it responds: it can produce a long internal chain of thoughts before responding to the user'.

'In essence,' it says, 'through the training process, these models learn to refine the processing method by using various possibilities and recognising errors'. It means that it answers more advanced questions more quickly, even compared to a human being.

Similar to how a human being may think for a long time before answering a difficult question, o1 uses a chain of thought when trying to solve a problem. Through reinforcement learning, o1 learns to refine his chain of thought and refine the strategies he uses. He learns to recognise and correct his mistakes. He learns to break down difficult steps into simpler ones. He learns to try a different approach when his current one does not work.

i love summer in the garden pic.twitter.com/Ter5Z5nFMc
— Sam Altman (@sama) August 7, 2024

How is it different from the past?

In practice, it is designed to write better code and solve multi-step problems than previous models. It is an experimental system with a vocation for mathematics and is also more expensive and slower to use than GPT-4o. "In many reasoning-based benchmarks, o1 rivals the performance of human experts." OpenAi's expectations are therefore to have a model that is 'better' at maths and less prone to hallucinations (errors).

"In our tests," they write on the blog, "these models behave like PhD students grappling with complex questions in physics, chemistry and biology. Furthermore, we found that the new models perform well in mathematics and coding."

The admittedly same Sam Altman model remains limitat. Relevant features are also absent in the API, For many common cases, GPT-4o remains more functional for now.

Who can already use it?

Within ChatGPT, Plus and Team users can now access the early preview of one of the new templates by selecting 'o1-preview' from the available template options. Also released is 'OpenAI o1 mini', an even faster template designed to address STEM questions. Both templates are available as of today. ChatGPT Enterprise and Edu users will have access to both templates starting next week.

And there is also OpenAI o1-mini

It is a faster and cheaper reasoning model that is particularly effective in coding. As a smaller model, o1-mini is 80 per cent cheaper than o1-preview, making it a powerful and inexpensive model for applications requiring reasoning but not extensive knowledge of the world.

Luca TremoladaGiornalista
- @lucatremolada
- LinkedIn
Luogo: Milano via Monte Rosa 91
Lingue parlate: Inglese, Francese
Argomenti: Tecnologia, scienza, finanza, startup, dati
Premi: Premio Gabriele Lanfredini sull’informazione; Premio giornalistico State Street, categoria "Innovation"; DStars 2019, categoria journalism
Scheda autore
Trust project