Tuesday, 2 Jun 2026
  • Contact
  • Privacy Policy
  • Terms & Conditions
  • DMCA
logo logo
  • World
  • Politics
  • Crime
  • Economy
  • Tech & Science
  • Sports
  • Entertainment
  • More
    • Education
    • Celebrities
    • Culture and Arts
    • Environment
    • Health and Wellness
    • Lifestyle
  • 🔥
  • Trump
  • House
  • ScienceAlert
  • White
  • VIDEO
  • man
  • Trumps
  • Season
  • star
  • Years
Font ResizerAa
American FocusAmerican Focus
Search
  • World
  • Politics
  • Crime
  • Economy
  • Tech & Science
  • Sports
  • Entertainment
  • More
    • Education
    • Celebrities
    • Culture and Arts
    • Environment
    • Health and Wellness
    • Lifestyle
Follow US
© 2024 americanfocus.online – All Rights Reserved.
American Focus > Blog > Tech and Science > People are using Super Mario to benchmark AI now
Tech and Science

People are using Super Mario to benchmark AI now

Last updated: March 3, 2025 5:38 pm
Share
People are using Super Mario to benchmark AI now
SHARE

Pitting AI Against Super Mario Bros: A Tougher Benchmark?

While Pokémon has long been considered a tough benchmark for AI, a recent study suggests that Super Mario Bros. may be an even tougher challenge. Researchers at the Hao AI Lab, based at the University of California San Diego, recently conducted experiments involving AI and live Super Mario Bros. games. The results were surprising, with Anthropic’s Claude 3.7 emerging as the top performer, followed closely by Claude 3.5. However, Google’s Gemini 1.5 Pro and OpenAI’s GPT-4o faced significant challenges in the gaming environment.

It’s important to note that the version of Super Mario Bros. used in the study was not the original 1985 release. Instead, the game ran in an emulator and was integrated with a framework called GamingAgent, developed by the Hao Lab. This framework gave the AIs control over Mario, allowing them to navigate the game world and interact with obstacles and enemies.

Image Credits:Hao Lab

GamingAgent provided the AI with basic instructions and in-game screenshots, allowing them to generate Python code to control Mario. Despite this assistance, the game still required the models to learn complex maneuvers and develop gameplay strategies. Interestingly, the researchers found that reasoning models, which typically excel in problem-solving tasks, performed worse than non-reasoning models in this real-time gaming environment.

According to Hao, reasoning models struggle in games like Super Mario Bros. because they take longer to make decisions, which can be detrimental in fast-paced situations where split-second timing is crucial. This highlights the unique challenges that real-time games pose for AI models.

See also  When Jasprit Bumrah defended 11 runs in a Super Over during MI vs GL match in IPL 2017 [Watch]

While games have been used to benchmark AI for decades, some experts have raised concerns about drawing direct parallels between gaming skills and overall technological advancement. Games, being abstract and relatively simple, may not fully capture the complexities of real-world scenarios that AI systems are designed to tackle.

These recent gaming benchmarks underscore what Andrej Karpathy, a research scientist at OpenAI, has described as an “evaluation crisis” in the field of AI. With rapidly advancing technologies and evolving benchmarks, it can be challenging to assess the true capabilities of AI models.

Despite these challenges, watching AI navigate the world of Super Mario Bros. offers a fascinating glimpse into the capabilities and limitations of artificial intelligence in real-time gaming scenarios.

TAGGED:BenchmarkMariopeopleSuper
Share This Article
Twitter Email Copy Link Print
Previous Article Inflation will move toward 2% target, but risks to outlook are rising, says Fed’s Musalem Inflation will move toward 2% target, but risks to outlook are rising, says Fed’s Musalem
Next Article Bernie Sanders Hits MAGA Right In The Patriotism While Destroying Trump Propaganda Bernie Sanders Hits MAGA Right In The Patriotism While Destroying Trump Propaganda
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *


The reCAPTCHA verification period has expired. Please reload the page.

Popular Posts

Cathie Wood predicts ‘once unattainable new home’ demand will spark wealth shift

The mortgage landscape in the United States is on the brink of a significant transformation,…

June 29, 2025

13 Best Chanel Perfumes for Women and Men in 2025, According to Vogue

ozChanel Gabrielle Chanel Essence Eau de ParfumChanelGabrielle Chanel Essence Eau de ParfumWhy We Love It:…

May 27, 2025

Director Clea DuVall on That Discus Murder

SPOILER ALERT: This article contains spoilers for “The Big Pump,” Season 2, Episode 10 of…

June 26, 2025

Why Industrial Policy Is (Almost) Always a Bad Idea (with Scott Sumner)

In a recent podcast episode of EconTalk, economist Scott Sumner joined Russ Roberts to discuss…

December 9, 2024

Braxton Berrios triggers dating rumors with Alix Earle’s girlfriend 2 months after “difficult” split from SI Swimswuit model

Houston Texans wide receiver Braxton Berrios was spotted with Olivia Jade Giannulli at the Palm…

February 25, 2026

You Might Also Like

Why you need to future proof your brain in middle age and how to start
Tech and Science

Why you need to future proof your brain in middle age and how to start

June 2, 2026
The Google Pixel 11 Will Have More of Everything. Here’s Why
Tech and Science

The Google Pixel 11 Will Have More of Everything. Here’s Why

June 2, 2026
Turning your purse into a cyberdeck is the most fun way to resist big tech
Tech and Science

Turning your purse into a cyberdeck is the most fun way to resist big tech

June 2, 2026
Astronomers Have Uncovered a Strange Pattern in The Winds of Alien Worlds : ScienceAlert
Tech and Science

Astronomers Have Uncovered a Strange Pattern in The Winds of Alien Worlds : ScienceAlert

June 2, 2026
logo logo
Facebook Twitter Youtube

About US


Explore global affairs, political insights, and linguistic origins. Stay informed with our comprehensive coverage of world news, politics, and Lifestyle.

Top Categories
  • Crime
  • Environment
  • Sports
  • Tech and Science
Usefull Links
  • Contact
  • Privacy Policy
  • Terms & Conditions
  • DMCA

© 2024 americanfocus.online –  All Rights Reserved.

Welcome Back!

Sign in to your account

Lost your password?