London Daily

Focus on the big picture.
Wednesday, Nov 26, 2025

OpenAI's o3 AI model reaches human-level performance on a general intelligence assessment.

OpenAI's o3 AI model hits a significant milestone by achieving human-level performance on the ARC-AGI benchmark, igniting discussions about the potential of artificial general intelligence.
In a major development, OpenAI's o3 system reached human-level performance on a test assessing general intelligence.

On December 20, 2024, o3 achieved an 85% score on the ARC-AGI benchmark, surpassing the previous top AI score of 55% and equaling the average human score.

This is a pivotal moment in the quest for artificial general intelligence (AGI), with the o3 system excelling at tasks that evaluate AI's ability to adapt to new situations with limited data, a crucial measure of intelligence.

The ARC-AGI benchmark assesses AI's "sample efficiency"—its capacity to learn from minimal examples—and is considered a fundamental step toward AGI.

Unlike systems like GPT-4 that depend on large datasets, o3 appears to perform well with minimal training data, a significant challenge in AI development.

Although OpenAI has not fully revealed the technical specifics, o3’s success might derive from its ability to discern "weak rules" or simpler patterns that can be generalized to solve new problems.

The model likely explores various "chains of thought," choosing the most effective strategy based on heuristics or basic rules.

This strategy is similar to methods used by systems like Google's AlphaGo, which employs heuristic decision-making to play the game of Go.

Despite the encouraging results, many questions remain about whether o3 truly marks progress towards AGI.

There is speculation that the system might still depend on language-based learning instead of genuinely generalized cognitive abilities.

As OpenAI shares more information, the AI community will require further testing to evaluate o3's actual adaptability and whether it can match human intelligence's versatility.

The implications of o3’s performance are significant, especially if it proves to be as adaptable as humans.

It could begin a new era of advanced AI systems capable of addressing a broad range of complex tasks.

However, a complete understanding of its capabilities will necessitate more evaluations, leading to new benchmarks and discussions regarding AGI governance.
Newsletter

Related Articles

0:00
0:00
Close
Lamine Yamal? The ‘Heir to Messi’ Lost to Barcelona — and the Kingdom Is in a Frenzy
Warner Music Group Drops Suit Against Suno, Launches Licensed AI-Music Deal
HP to Cut up to 6,000 Jobs Globally as It Ramps Up AI Integration
MediaWorld Sold iPad Air for €15 — Then Asked Customers to Return Them or Pay More
UK Prime Minister Sir Keir Starmer Promises ‘Full-Time’ Education for All Children as School Attendance Slips
UK Extends Sugar Tax to Sweetened Milkshakes and Lattes in 2028 Health Push
UK Government Backs £49 Billion Plan for Heathrow Third Runway and Expansion
UK Gambling Firms Report £1bn Surge in Annual Profits as Pressure Mounts for Higher Betting Taxes
UK Shares Advance Ahead of Budget as Financials and Consumer Staples Lead Gains
Domino’s UK CEO Andrew Rennie Steps Down Amid Strategic Reset
UK Economy Stalls as Reeves Faces First Budget Test
UK Economy’s Weak Start Adds Pressure on Prime Minister Starmer
UK Government Acknowledges Billionaire Exodus Amid Tax Rise Concerns
UK Budget 2025: Markets Brace as Chancellor Faces Fiscal Tightrope
UK Unveils Strategic Plan to Secure Critical Mineral Supply Chains
UK Taskforce Calls for Radical Reset of Nuclear Regulation to Cut Costs and Accelerate Build
UK Government Launches Consultation on Major Overhaul of Settlement Rules
Google Struggles to Meet AI Demand as Infrastructure, Energy and Supply-Chain Gaps Deepen
Car Parts Leader Warns Europe Faces Heavy Job Losses in ‘Darwinian’ Auto Shake-Out
Arsenal Move Six Points Clear After Eze’s Historic Hat-Trick in Derby Rout
Wealthy New Yorkers Weigh Second Homes as the ‘Mamdani Effect’ Ripples Through Luxury Markets
Families Accuse OpenAI of Enabling ‘AI-Driven Delusions’ After Multiple Suicides
UK Unveils Critical-Minerals Strategy to Break China Supply-Chain Grip
Taylor Swift’s “The Fate of Ophelia” Extends U.K. No. 1 Run to Five Weeks
UK VPN Sign-Ups Surge by Over 1,400 % as Age-Verification Law Takes Effect
Former MEP Nathan Gill Jailed for Over Ten Years After Taking Pro-Russia Bribes
Majority of UK Entrepreneurs Regard Government as ‘Anti-Business’, Survey Shows
UK’s Starmer and US President Trump Align as Geneva Talks Probe Ukraine Peace Plan
UK Prime Minister Signals Former Prince Andrew Should Testify to US Epstein Inquiry
Royal Navy Deploys HMS Severn to Shadow Russian Corvette and Tanker Off UK Coast
China’s Wedding Boom: Nightclubs, Mountains and a Demographic Reset
Fugees Founding Member Pras Michel Sentenced to 14 Years in High-Profile US Foreign Influence Case
WhatsApp’s Unexpected Rise Reshapes American Messaging Habits
United States: Judge Dressed Up as Elvis During Hearings – and Was Forced to Resign
Johnson Blasts ‘Incoherent’ Covid Inquiry Findings Amid Report’s Harsh Critique of His Government
Lord Rothermere Secures £500 Million Deal to Acquire Telegraph Titles
Maduro Tightens Security Measures as U.S. Strike Threat Intensifies
U.S. Envoys Deliver Ultimatum to Ukraine: Sign Peace Deal by Thursday or Risk Losing American Support
Zelenskyy Signals Progress Toward Ending the War: ‘One of the Hardest Moments in History’ (end of his business model?)
U.S. Issues Alert Declaring Venezuelan Airspace a Hazard Due to Escalating Security Conditions
The U.S. State Department Announces That Mass Migration Constitutes an Existential Threat to Western Civilization and Undermines the Stability of Key American Allies
Students Challenge AI-Driven Teaching at University of Staffordshire
Pikeville Medical Center Partners with UK’s Golisano Children’s Network to Expand Pediatric Care
Germany, France and UK Confirm Full Support for Ukraine in US-Backed Security Plan
UK Low-Traffic Neighbourhoods Face Rising Backlash as Pandemic Schemes Unravel
UK Records Coldest Night of Autumn as Sub-Zero Conditions Sweep the Country
UK at Risk of Losing International Doctors as Workforce Exodus Grows, Regulator Warns
ASU Launches ASU London, Extending Its Innovation Brand to the UK Education Market
UK Prime Minister Keir Starmer to Visit China in January as Diplomatic Reset Accelerates
Google Launches Voluntary Buyouts for UK Staff Amid AI-Driven Company Realignment
×