Saturday, 2 May 2026
  • Contact
  • Privacy Policy
  • Terms & Conditions
  • DMCA
logo logo
  • World
  • Politics
  • Crime
  • Economy
  • Tech & Science
  • Sports
  • Entertainment
  • More
    • Education
    • Celebrities
    • Culture and Arts
    • Environment
    • Health and Wellness
    • Lifestyle
  • 🔥
  • Trump
  • House
  • ScienceAlert
  • White
  • VIDEO
  • man
  • Trumps
  • Season
  • star
  • Years
Font ResizerAa
American FocusAmerican Focus
Search
  • World
  • Politics
  • Crime
  • Economy
  • Tech & Science
  • Sports
  • Entertainment
  • More
    • Education
    • Celebrities
    • Culture and Arts
    • Environment
    • Health and Wellness
    • Lifestyle
Follow US
© 2024 americanfocus.online – All Rights Reserved.
American Focus > Blog > Tech and Science > Google’s Gemini 2.5 Flash introduces ‘thinking budgets’ that cut AI costs by 600% when turned down
Tech and Science

Google’s Gemini 2.5 Flash introduces ‘thinking budgets’ that cut AI costs by 600% when turned down

Last updated: April 17, 2025 7:20 pm
Share
Google’s Gemini 2.5 Flash introduces ‘thinking budgets’ that cut AI costs by 600% when turned down
SHARE

Google has unveiled its latest AI model, Gemini 2.5 Flash, which offers businesses and developers unprecedented control over the level of “thinking” their AI performs. This new model, available in preview through Google AI Studio and Vertex AI, aims to enhance reasoning capabilities while keeping pricing competitive in the crowded AI market.

One of the key features of Gemini 2.5 Flash is the introduction of a “thinking budget,” allowing developers to specify the amount of computational power allocated to reasoning through complex problems before generating a response. This approach addresses the trade-off between sophisticated reasoning, latency, and pricing in AI systems.

Tulsee Doshi, Product Director for Gemini Models at Google DeepMind, highlighted the importance of cost and latency for developers in various use cases. The flexibility to adjust the thinking capability according to needs makes Gemini 2.5 Flash Google’s “first fully hybrid reasoning model.”

The pricing structure for Gemini 2.5 Flash emphasizes the cost of reasoning in AI systems. Developers pay $0.15 per million tokens for input, with output costs varying based on reasoning settings. With thinking turned off, the cost is $0.60 per million tokens, while enabling reasoning increases the cost to $3.50 per million tokens.

Google claims that Gemini 2.5 Flash delivers competitive performance across benchmarks while maintaining a smaller model size compared to alternatives. The model outperforms competitors on tests like Humanity’s Last Exam, GPQA diamond, and AIME mathematics exams, showcasing its strength in math, multimodal reasoning, and long-context tasks.

The ability to adjust reasoning levels based on the query represents a significant advancement in AI deployment. Simple queries can benefit from disabling thinking for cost efficiency, while complex tasks requiring multi-step reasoning can leverage the thinking function for optimal results.

See also  Submerged bumblebee queens breathe underwater

In addition to the Gemini 2.5 Flash launch, Google has introduced Veo 2 video generation capabilities and announced free access to Gemini Advanced for U.S. college students until spring 2026. These moves align with Google’s strategy to compete in the AI market and build loyalty among future knowledge workers.

As Gemini 2.5 Flash continues to evolve, businesses can expect more nuanced approaches to AI deployment, allowing for customized reasoning capabilities tailored to specific tasks. The model is available for developers to start building with, with ongoing refinements based on feedback during the preview phase.

Overall, Google’s focus on customizable reasoning in AI reflects a maturing market where cost optimization and performance tuning are essential considerations, signaling a new phase in the commercialization of generative AI technologies.

TAGGED:budgetsCostscutFlashGeminiGooglesIntroducesThinkingturned
Share This Article
Twitter Email Copy Link Print
Previous Article Kettle blowing off steam sparks fire callout Kettle blowing off steam sparks fire callout
Next Article The Hot List: 20 Must-Have Pieces, According to Instagram’s Favorite Fashion Sourcer The Hot List: 20 Must-Have Pieces, According to Instagram’s Favorite Fashion Sourcer
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *


The reCAPTCHA verification period has expired. Please reload the page.

Popular Posts

Richard Sherman makes feelings known about Baker Mayfield’s “petty” attitude

Former NFL cornerback Richard Sherman had some flattering words for Baker Mayfield, emphasizing the quarterback's…

January 15, 2025

Otago edged by Wellington in thriller

Wellington managed to hold on to the Mike Gibson trophy with a narrow 32-28 victory…

September 11, 2024

Tired of an Old-Fashioned Yule Log Video? Try These Scientific Alternatives Instead

The holiday season is upon us, and while a crackling yule log in a digital…

December 23, 2025

Brock Lesnar’s WrestleMania matches ranked from worst to best

In the opening match of WrestleMania 42 on Sunday, Brock Lesnar faced a defeat against…

April 20, 2026

Breaking: Israel Fires Off Precision Strikes in Lavizan Area of Tehran Where Ayatollah Khamenei Is Reportedly Hiding with Bunker-Busting Bombs |

Ayatollah Ali Khamenei is reportedly in hiding / photo from a previous sermon Unconfirmed reports…

June 19, 2025

You Might Also Like

Seattle mayor Katie Wilson’s interview cut short as staff blocks questions on guns, surveillance
World News

Seattle mayor Katie Wilson’s interview cut short as staff blocks questions on guns, surveillance

May 2, 2026
Uber wants to turn its millions of drivers into a sensor grid for self-driving companies
Tech and Science

Uber wants to turn its millions of drivers into a sensor grid for self-driving companies

May 2, 2026
Experts Reveal The Secret to Helping Your Pet Lose Weight : ScienceAlert
Tech and Science

Experts Reveal The Secret to Helping Your Pet Lose Weight : ScienceAlert

May 1, 2026
200,000 MCP servers expose a command execution flaw that Anthropic calls a feature
Tech and Science

200,000 MCP servers expose a command execution flaw that Anthropic calls a feature

May 1, 2026
logo logo
Facebook Twitter Youtube

About US


Explore global affairs, political insights, and linguistic origins. Stay informed with our comprehensive coverage of world news, politics, and Lifestyle.

Top Categories
  • Crime
  • Environment
  • Sports
  • Tech and Science
Usefull Links
  • Contact
  • Privacy Policy
  • Terms & Conditions
  • DMCA

© 2024 americanfocus.online –  All Rights Reserved.

Welcome Back!

Sign in to your account

Lost your password?