London Daily

Focus on the big picture.
Wednesday, Nov 12, 2025

OpenAI's o3 AI model reaches human-level performance on a general intelligence assessment.

OpenAI's o3 AI model hits a significant milestone by achieving human-level performance on the ARC-AGI benchmark, igniting discussions about the potential of artificial general intelligence.
In a major development, OpenAI's o3 system reached human-level performance on a test assessing general intelligence.

On December 20, 2024, o3 achieved an 85% score on the ARC-AGI benchmark, surpassing the previous top AI score of 55% and equaling the average human score.

This is a pivotal moment in the quest for artificial general intelligence (AGI), with the o3 system excelling at tasks that evaluate AI's ability to adapt to new situations with limited data, a crucial measure of intelligence.

The ARC-AGI benchmark assesses AI's "sample efficiency"—its capacity to learn from minimal examples—and is considered a fundamental step toward AGI.

Unlike systems like GPT-4 that depend on large datasets, o3 appears to perform well with minimal training data, a significant challenge in AI development.

Although OpenAI has not fully revealed the technical specifics, o3’s success might derive from its ability to discern "weak rules" or simpler patterns that can be generalized to solve new problems.

The model likely explores various "chains of thought," choosing the most effective strategy based on heuristics or basic rules.

This strategy is similar to methods used by systems like Google's AlphaGo, which employs heuristic decision-making to play the game of Go.

Despite the encouraging results, many questions remain about whether o3 truly marks progress towards AGI.

There is speculation that the system might still depend on language-based learning instead of genuinely generalized cognitive abilities.

As OpenAI shares more information, the AI community will require further testing to evaluate o3's actual adaptability and whether it can match human intelligence's versatility.

The implications of o3’s performance are significant, especially if it proves to be as adaptable as humans.

It could begin a new era of advanced AI systems capable of addressing a broad range of complex tasks.

However, a complete understanding of its capabilities will necessitate more evaluations, leading to new benchmarks and discussions regarding AGI governance.
Newsletter

Related Articles

0:00
0:00
Close
UK Upholds Firm Rules on Stablecoins to Shield Financial System
Brussels Divided as UK-EU Reset Stalls Over Budget Access
Prince Harry’s Remembrance Day Essay Expresses Strong Regret at Leaving Britain
UK Unemployment Hits 5% as Wage Growth Slows, Paving Way for Bank of England Rate Cut
Starmer Warns of Resurgent Racism in UK Politics as He Vows Child-Poverty Reforms
UK Grocery Inflation Slows to 4.7% as Supermarkets Launch Pre-Christmas Promotions
UK Government Backs the BBC amid Editing Scandal and Trump Threat of Legal Action
UK Assessment Mis-Estimated Fallout From Palestine Action Ban, Records Reveal
UK Halts Intelligence Sharing with US Amid Lethal Boat-Strike Concerns
King Charles III Leads Britain in Remembrance Sunday Tribute to War Dead
UK Retail Sales Growth Slows as Households Hold Back Ahead of Black Friday and Budget
Shell Pulls Out of Two UK Floating Wind Projects Amid Renewables Retreat
Viagogo Hit With £15 Million Tax Bill After HMRC Transfer-Pricing Inquiry
Jaguar Land Rover Cyberattack Pinches UK GDP, Bank of England Says
UK and Germany Sound Alarm on Russian-Satellite Threat to Critical Infrastructure
Former Prince Andrew Faces U.S. Congressional Request for Testimony Amid Brexit of Royal Title
BBC Director-General Tim Davie and News CEO Deborah Turness Resign Amid Editing Controversy
Tom Cruise Arrives by Helicopter at UK Scientology Fundraiser Amid Local Protests
Prince Andrew and Sarah Ferguson Face Fresh UK Probes Amid Royal Fallout
Mothers Link Teen Suicides to AI Chatbots in Growing Legal Battle
UK Government to Mirror Denmark’s Tough Immigration Framework in Major Policy Shift
UK Government Turns to Denmark-Style Immigration Reforms to Overhaul Border Rules
UK Chancellor Warned Against Cutting Insulation Funding as Budget Looms
UK Tenant Complaints Hit Record Levels as Rental Sector Faces Mounting Pressure
Apple to Pay Google About One Billion Dollars Annually for Gemini AI to Power Next-Generation Siri
UK Signals Major Shift as Nuclear Arms Race Looms
BBC’s « Celebrity Traitors UK » Finale Breaks Records with 11.1 Million Viewers
UK Spy Case Collapse Highlights Implications for UK-Taiwan Strategic Alignment
On the Road to the Oscars? Meghan Markle to Star in a New Film
A Vote Worth a Trillion Dollars: Elon Musk’s Defining Day
AI Researchers Claim Human-Level General Intelligence Is Already Here
President Donald Trump Challenges Nigeria with Military Options Over Alleged Christian Killings
Nancy Pelosi Finally Announces She Will Not Seek Re-Election, Signalling End of Long Congressional Career
UK Pre-Budget Blues and Rate-Cut Concerns Pile Pressure on Pound
ITV Warns of Nine-Per-Cent Drop in Q4 Advertising Revenue Amid Budget Uncertainty
National Grid Posts Slightly Stronger-Than-Expected Half-Year Profit as Regulatory Investments Drive Growth
UK Business Lobby Urges Reeves to Break Tax Pledges and Build Fiscal Headroom
UK to Launch Consultation on Stablecoin Regulation on November 10
UK Savers Rush to Withdraw Pension Cash Ahead of Budget Amid Tax-Change Fears
Massive Spoilers Emerge from MAFS UK 2025: Couple Swaps, Dating App Leaks and Reunion Bombshells
Kurdish-led Crime Network Operates UK Mini-Marts to Exploit Migrants and Sell Illicit Goods
UK Income Tax Hike Could Trigger £1 Billion Cut to Scotland’s Budget, Warns Finance Secretary
Tommy Robinson Acquitted of Terror-related Charge After Phone PIN Dispute
Boris Johnson Condemns Western Support for Hamas at Jewish Community Conference
HII Welcomes UK’s Westley Group to Strengthen AUKUS Submarine Supply Chain
Tragedy in Serbia: Coach Mladen Žižović Collapses During Match and Dies at 44
Diplo Says He Dated Katy Perry — and Justin Trudeau
Dick Cheney, Former U.S. Vice President, Dies at 84
Trump Calls Title Removal of Andrew ‘Tragic Situation’ Amid Royal Fallout
UK Bonds Rally as Chancellor Reeves Briefs Markets Ahead of November Budget
×