London Daily

Focus on the big picture.
Monday, Dec 01, 2025

Students Beat ChatGPT At This Exam, Score 76%, Compared To Chatbot's 47%

Students Beat ChatGPT At This Exam, Score 76%, Compared To Chatbot's 47%

Despite this, they said that ChatGPT's performance was "impressive" and that it was a "game changer that will change the way everyone teaches and learns - for the better."
Researchers found students to have fared better at accounting exams than ChatGPT, OpenAI's chatbot product.

Despite this, they said that ChatGPT's performance was "impressive" and that it was a "game changer that will change the way everyone teaches and learns - for the better."

The researchers from Brigham Young University (BYU), US, and 186 other universities wanted to know how OpenAI's technology would fare on accounting exams. They have published their findings in the journal Issues in Accounting Education.

In the researchers' accounting exam, students scored an overall average of 76.7 per cent, compared to ChatGPT's score of 47.4 per cent.

While in 11.3 per cent of the questions, ChatGPT was found to score higher than the student average, doing particularly well on accounting information systems (AIS) and auditing, the AI bot was found to perform worse on tax, financial, and managerial assessments. Researchers think this could possibly be because ChatGPT struggled with the mathematical processes required for the latter type.

The AI bot, which uses machine learning to generate natural language text, was further found to do better on true/false questions (68.7 per cent correct) and multiple-choice questions (59.5 per cent), but struggled with short-answer questions (between 28.7 and 39.1 per cent).

In general, the researchers said that higher-order questions were harder for ChatGPT to answer. In fact, sometimes ChatGPT was found to provide authoritative written descriptions for incorrect answers, or answer the same question different ways.

They also found that ChatGPT often provided explanations for its answers, even if they were incorrect. Other times, it went on to select the wrong multiple-choice answer, despite providing accurate descriptions.

Researchers importantly noted that ChatGPT sometimes made up facts. For example, when providing a reference, it generated a real-looking reference that was completely fabricated. The work and sometimes the authors did not even exist.

The bot was seen to also make nonsensical mathematical errors such as adding two numbers in a subtraction problem, or dividing numbers incorrectly.

Wanting to add to the intense ongoing debate about how how models like ChatGPT should factor into education, lead study author David Wood, a BYU professor of accounting, decided to recruit as many professors as possible to see how the AI fared against actual university accounting students.

His co-author recruiting pitch on social media exploded: 327 co-authors from 186 educational institutions in 14 countries participated in the research, contributing 25,181 classroom accounting exam questions.

They also recruited undergraduate BYU students to feed another 2,268 textbook test bank questions to ChatGPT. The questions covered AIS, auditing, financial accounting, managerial accounting and tax, and varied in difficulty and type (true/false, multiple choice, short answer).
Newsletter

Related Articles

0:00
0:00
Close
EU Firms Struggle with 3,000-Hour Paperwork Load — While Automakers Fear De Facto 2030 Petrol Car Ban
White House launches ‘Hall of Shame’ site to publicly condemn media outlets for alleged bias
UK Budget’s New EV Mileage Tax Undercuts Case for Plug-In Hybrids
UK Government Launches National Inquiry into ‘Grooming Gangs’ After US Warning and Rising Public Outcry
Taylor Swift Extends U.K. Chart Reign as ‘The Fate of Ophelia’ Hits Six Weeks at No. 1
250 Still Missing in the Massive Fire, 94 Killed. One Day After the Disaster: Survivor Rescued on the 16th Floor
Trump: National Guard Soldier Who Was Shot in Washington Has Died; Second Soldier Fighting for His Life
UK Chancellor Reeves Defends Tax Rises as Essential to Reduce Child Poverty and Stabilise Public Finances
No Evidence Found for Claim That UK Schools Are Shifting to Teaching American English
European Powers Urge Israel to Halt West Bank Settler Violence Amid Surge in Attacks
"I Would Have Given Her a Kidney": She Lent Bezos’s Ex-Wife $1,000 — and Received Millions in Return
European States Approve First-ever Military-Grade Surveillance Network via ESA
UK to Slash Key Pension Tax Perk, Targeting High Earners Under New Budget
UK Government Announces £150 Annual Cut to Household Energy Bills Through Levy Reforms
UK Court Hears Challenge to Ban on Palestine Action as Critics Decry Heavy-Handed Measures
Investors Rush Into UK Gilts and Sterling After Budget Eases Fiscal Concerns
UK to Raise Online Betting Taxes by £1.1 Billion Under New Budget — Firms Warn of Fallout
Lamine Yamal? The ‘Heir to Messi’ Lost to Barcelona — and the Kingdom Is in a Frenzy
Warner Music Group Drops Suit Against Suno, Launches Licensed AI-Music Deal
HP to Cut up to 6,000 Jobs Globally as It Ramps Up AI Integration
MediaWorld Sold iPad Air for €15 — Then Asked Customers to Return Them or Pay More
UK Prime Minister Sir Keir Starmer Promises ‘Full-Time’ Education for All Children as School Attendance Slips
UK Extends Sugar Tax to Sweetened Milkshakes and Lattes in 2028 Health Push
UK Government Backs £49 Billion Plan for Heathrow Third Runway and Expansion
UK Gambling Firms Report £1bn Surge in Annual Profits as Pressure Mounts for Higher Betting Taxes
UK Shares Advance Ahead of Budget as Financials and Consumer Staples Lead Gains
Domino’s UK CEO Andrew Rennie Steps Down Amid Strategic Reset
UK Economy Stalls as Reeves Faces First Budget Test
UK Economy’s Weak Start Adds Pressure on Prime Minister Starmer
UK Government Acknowledges Billionaire Exodus Amid Tax Rise Concerns
UK Budget 2025: Markets Brace as Chancellor Faces Fiscal Tightrope
UK Unveils Strategic Plan to Secure Critical Mineral Supply Chains
UK Taskforce Calls for Radical Reset of Nuclear Regulation to Cut Costs and Accelerate Build
UK Government Launches Consultation on Major Overhaul of Settlement Rules
Google Struggles to Meet AI Demand as Infrastructure, Energy and Supply-Chain Gaps Deepen
Car Parts Leader Warns Europe Faces Heavy Job Losses in ‘Darwinian’ Auto Shake-Out
Arsenal Move Six Points Clear After Eze’s Historic Hat-Trick in Derby Rout
Wealthy New Yorkers Weigh Second Homes as the ‘Mamdani Effect’ Ripples Through Luxury Markets
Families Accuse OpenAI of Enabling ‘AI-Driven Delusions’ After Multiple Suicides
UK Unveils Critical-Minerals Strategy to Break China Supply-Chain Grip
Taylor Swift’s “The Fate of Ophelia” Extends U.K. No. 1 Run to Five Weeks
UK VPN Sign-Ups Surge by Over 1,400 % as Age-Verification Law Takes Effect
Former MEP Nathan Gill Jailed for Over Ten Years After Taking Pro-Russia Bribes
Majority of UK Entrepreneurs Regard Government as ‘Anti-Business’, Survey Shows
UK’s Starmer and US President Trump Align as Geneva Talks Probe Ukraine Peace Plan
UK Prime Minister Signals Former Prince Andrew Should Testify to US Epstein Inquiry
Royal Navy Deploys HMS Severn to Shadow Russian Corvette and Tanker Off UK Coast
China’s Wedding Boom: Nightclubs, Mountains and a Demographic Reset
Fugees Founding Member Pras Michel Sentenced to 14 Years in High-Profile US Foreign Influence Case
WhatsApp’s Unexpected Rise Reshapes American Messaging Habits
×