Friday, 19 Sep 2025
  • Contact
  • Privacy Policy
  • Terms & Conditions
  • DMCA
logo logo
  • World
  • Politics
  • Crime
  • Economy
  • Tech & Science
  • Sports
  • Entertainment
  • More
    • Education
    • Celebrities
    • Culture and Arts
    • Environment
    • Health and Wellness
    • Lifestyle
  • 🔥
  • Trump
  • House
  • VIDEO
  • White
  • ScienceAlert
  • Trumps
  • Watch
  • man
  • Health
  • Season
Font ResizerAa
American FocusAmerican Focus
Search
  • World
  • Politics
  • Crime
  • Economy
  • Tech & Science
  • Sports
  • Entertainment
  • More
    • Education
    • Celebrities
    • Culture and Arts
    • Environment
    • Health and Wellness
    • Lifestyle
Follow US
© 2024 americanfocus.online – All Rights Reserved.
American Focus > Blog > Tech and Science > Google’s Gemini 2.5 Flash introduces ‘thinking budgets’ that cut AI costs by 600% when turned down
Tech and Science

Google’s Gemini 2.5 Flash introduces ‘thinking budgets’ that cut AI costs by 600% when turned down

Last updated: April 17, 2025 7:20 pm
Share
Google’s Gemini 2.5 Flash introduces ‘thinking budgets’ that cut AI costs by 600% when turned down
SHARE

Google has unveiled its latest AI model, Gemini 2.5 Flash, which offers businesses and developers unprecedented control over the level of “thinking” their AI performs. This new model, available in preview through Google AI Studio and Vertex AI, aims to enhance reasoning capabilities while keeping pricing competitive in the crowded AI market.

One of the key features of Gemini 2.5 Flash is the introduction of a “thinking budget,” allowing developers to specify the amount of computational power allocated to reasoning through complex problems before generating a response. This approach addresses the trade-off between sophisticated reasoning, latency, and pricing in AI systems.

Tulsee Doshi, Product Director for Gemini Models at Google DeepMind, highlighted the importance of cost and latency for developers in various use cases. The flexibility to adjust the thinking capability according to needs makes Gemini 2.5 Flash Google’s “first fully hybrid reasoning model.”

The pricing structure for Gemini 2.5 Flash emphasizes the cost of reasoning in AI systems. Developers pay $0.15 per million tokens for input, with output costs varying based on reasoning settings. With thinking turned off, the cost is $0.60 per million tokens, while enabling reasoning increases the cost to $3.50 per million tokens.

Google claims that Gemini 2.5 Flash delivers competitive performance across benchmarks while maintaining a smaller model size compared to alternatives. The model outperforms competitors on tests like Humanity’s Last Exam, GPQA diamond, and AIME mathematics exams, showcasing its strength in math, multimodal reasoning, and long-context tasks.

The ability to adjust reasoning levels based on the query represents a significant advancement in AI deployment. Simple queries can benefit from disabling thinking for cost efficiency, while complex tasks requiring multi-step reasoning can leverage the thinking function for optimal results.

See also  The 17 best science fiction TV shows of all time - according to New Scientist writers

In addition to the Gemini 2.5 Flash launch, Google has introduced Veo 2 video generation capabilities and announced free access to Gemini Advanced for U.S. college students until spring 2026. These moves align with Google’s strategy to compete in the AI market and build loyalty among future knowledge workers.

As Gemini 2.5 Flash continues to evolve, businesses can expect more nuanced approaches to AI deployment, allowing for customized reasoning capabilities tailored to specific tasks. The model is available for developers to start building with, with ongoing refinements based on feedback during the preview phase.

Overall, Google’s focus on customizable reasoning in AI reflects a maturing market where cost optimization and performance tuning are essential considerations, signaling a new phase in the commercialization of generative AI technologies.

TAGGED:budgetsCostscutFlashGeminiGooglesIntroducesThinkingturned
Share This Article
Twitter Email Copy Link Print
Previous Article Kettle blowing off steam sparks fire callout Kettle blowing off steam sparks fire callout
Next Article The Hot List: 20 Must-Have Pieces, According to Instagram’s Favorite Fashion Sourcer The Hot List: 20 Must-Have Pieces, According to Instagram’s Favorite Fashion Sourcer
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Popular Posts

Resilience, a Private Japanese Spacecraft, Crash-Landed on the Moon

A Japanese spacecraft has recently crash-landed on the Moon, marking the second unsuccessful landing attempt…

June 6, 2025

President Donald J. Trump Extends the Hiring Freeze – The White House

EXTENDING THE HIRING FREEZE: Today, President Donald J. Trump has taken the bold step of…

April 17, 2025

28 Fascinating and Fun Facts About Earth

Earth is undoubtedly a unique and special planet in our solar system. As the third…

March 11, 2025

Jenna Johnson Slams Hurtful Comment About Son Using a Bottle

Reality TV star Jenna Johnson recently spoke out against online bullying after receiving hurtful messages…

July 6, 2025

Colorado officials blast Republican tax bill as Medicaid cuts loom

Colorado’s Democratic leaders strongly criticized congressional Republicans’ tax bill as a “complete betrayal” on Tuesday…

July 1, 2025

You Might Also Like

Huawei Watch GT6 Series Announced With Huge Battery Life
Tech and Science

Huawei Watch GT6 Series Announced With Huge Battery Life

September 19, 2025
Unforgeable quantum money can be stored in an ultracold ‘debit card’
Tech and Science

Unforgeable quantum money can be stored in an ultracold ‘debit card’

September 19, 2025
Google Pixel 10 Review: The New Normal
Tech and Science

Google Pixel 10 Review: The New Normal

September 19, 2025
Math puzzle: The four islands
Tech and Science

Math puzzle: The four islands

September 19, 2025
logo logo
Facebook Twitter Youtube

About US


Explore global affairs, political insights, and linguistic origins. Stay informed with our comprehensive coverage of world news, politics, and Lifestyle.

Top Categories
  • Crime
  • Environment
  • Sports
  • Tech and Science
Usefull Links
  • Contact
  • Privacy Policy
  • Terms & Conditions
  • DMCA

© 2024 americanfocus.online –  All Rights Reserved.

Welcome Back!

Sign in to your account

Lost your password?