London Daily

Focus on the big picture.
Sunday, Sep 14, 2025

OpenAI's o3 AI model reaches human-level performance on a general intelligence assessment.

OpenAI's o3 AI model hits a significant milestone by achieving human-level performance on the ARC-AGI benchmark, igniting discussions about the potential of artificial general intelligence.
In a major development, OpenAI's o3 system reached human-level performance on a test assessing general intelligence.

On December 20, 2024, o3 achieved an 85% score on the ARC-AGI benchmark, surpassing the previous top AI score of 55% and equaling the average human score.

This is a pivotal moment in the quest for artificial general intelligence (AGI), with the o3 system excelling at tasks that evaluate AI's ability to adapt to new situations with limited data, a crucial measure of intelligence.

The ARC-AGI benchmark assesses AI's "sample efficiency"—its capacity to learn from minimal examples—and is considered a fundamental step toward AGI.

Unlike systems like GPT-4 that depend on large datasets, o3 appears to perform well with minimal training data, a significant challenge in AI development.

Although OpenAI has not fully revealed the technical specifics, o3’s success might derive from its ability to discern "weak rules" or simpler patterns that can be generalized to solve new problems.

The model likely explores various "chains of thought," choosing the most effective strategy based on heuristics or basic rules.

This strategy is similar to methods used by systems like Google's AlphaGo, which employs heuristic decision-making to play the game of Go.

Despite the encouraging results, many questions remain about whether o3 truly marks progress towards AGI.

There is speculation that the system might still depend on language-based learning instead of genuinely generalized cognitive abilities.

As OpenAI shares more information, the AI community will require further testing to evaluate o3's actual adaptability and whether it can match human intelligence's versatility.

The implications of o3’s performance are significant, especially if it proves to be as adaptable as humans.

It could begin a new era of advanced AI systems capable of addressing a broad range of complex tasks.

However, a complete understanding of its capabilities will necessitate more evaluations, leading to new benchmarks and discussions regarding AGI governance.
Newsletter

Related Articles

0:00
0:00
Close
French Debt Downgrade Piles Pressure on Macron’s New Prime Minister
US and UK Near Tech, Nuclear and Whisky Deals Ahead of Trump Trip
One in Three Europeans Now Uses TikTok, According to the Chinese Tech Giant
Could AI Nursing Robots Help Healthcare Staffing Shortages?
NATO Deploys ‘Eastern Sentry’ After Russian Drones Violate Polish Airspace
Anesthesiologist Left Operation Mid-Surgery to Have Sex with Nurse
Tens of Thousands of Young Chinese Get Up Every Morning and Go to Work Where They Do Nothing
The New Life of Novak Djokovic
The German Owner of Politico Mathias Döpfner Eyes Further U.S. Media Expansion After Axel Springer Restructuring
Suspect Arrested: Utah Man in Custody for Charlie Kirk’s Fatal Shooting
In a politically motivated trial: Bolsonaro Sentenced to 27 Years for Plotting Coup After 2022 Defeat
German police raid AfD lawmaker’s offices in inquiry over Chinese payments
Turkish authorities seize leading broadcaster amid fraud and tax investigation
Volkswagen launches aggressive strategy to fend off Chinese challenge in Europe’s EV market
ChatGPT CEO signals policy to alert authorities over suicidal youth after teen’s death
The British legal mafia hit back: Banksy mural of judge beating protester is scrubbed from London court
Surpassing Musk: Larry Ellison becomes the richest man in the world
Embarrassment for Starmer: He fired the ambassador photographed on Epstein’s 'pedophile island'
Manhunt after 'skilled sniper' shot Charlie Kirk. Footage: Suspect running on rooftop during panic
Effective Protest Results: Nepal’s Prime Minister Resigns as Youth-Led Unrest Shakes the Nation
Qatari prime minister says Netanyahu ‘killed any hope’ for Israeli hostages
King Charles and Prince Harry Share First In-Person Moment in 19 Months
Starmer Establishes Economic ‘Budget Board’ to Centralise Policy and Rebuild Business Trust
France Erupts in Mass ‘Block Everything’ Protests on New PM’s First Day
Poland Shoots Down Russian Drones in Airspace Violation During Ukraine Attack
Brazilian police say ex-President Bolsonaro had planned to flee to Argentina seeking asylum
Trinidad Leader Applauds U.S. Naval Strike and Advocates Forceful Action Against Traffickers
Kim Jong Un Oversees Final Test of New High-Thrust Solid-Fuel Rocket Engine
Apple Introduces Ultra-Thin iPhone Air, Enhanced 17 Series and New Health-Focused Wearables
Macron Appoints Sébastien Lecornu as Prime Minister Amid Budget Crisis and Political Turmoil
Supreme Court temporarily allows Trump to pause billions in foreign aid
Charlie Sheen says his father, Martin Sheen, turned him in to the police: 'The greatest betrayal possible'
Vatican hosts first Catholic LGBTQ pilgrimage
Apple Unveils iPhone 17 Series, iPhone Air, Apple Watch 11 and More at 'Awe Dropping' Event
Pig Heads Left Outside Multiple Paris Mosques in Outrage-Inducing Acts
Nvidia’s ‘Wow’ Factor Is Fading. The AI chip giant used to beat Wall Street expectations for earnings by a substantial margin. That trajectory is coming down to earth.
France joins Eurozone’s ‘periphery’ as turmoil deepens, say investors
On the Anniversary of Queen Elizabeth’s Death: Prince Harry Returns to Britain
France Faces New Political Crisis, again, as Prime Minister Bayrou Pushed Out
Murdoch Family Finalises $3.3 Billion Succession Pact, Ensuring Eldest Son’s Leadership
Big Oil Slashes Jobs and Investments Amid Prolonged Low Crude Prices
Court Staff Cover Up Banksy Image of Judge Beating a Protester
Social Media Access Curtailed in Turkey After CHP Calls for Rallies Following Police Blockade of Istanbul Headquarters
Nayib Bukele Points Out Belgian Hypocrisy as Brussels Considers Sending Army into the Streets
Elon Musk Poised to Become First Trillionaire Under Ambitious Tesla Pay Plan
France, at an Impasse, Heads Toward Another Government Collapse
Burning the Minister’s House Helped Protesters to Win Justice: Prabowo Fires Finance Minister in Wake of Indonesia Protests
Brazil Braces for Fallout from Bolsonaro Trial by corrupted judge
The Country That Got Too Rich? Public Spending Dominates Norway Election
Nearly 40 Years Later: Nike Changes the Legendary Slogan Just Do It
×