Saturday, 12 Jul 2025
  • Contact
  • Privacy Policy
  • Terms & Conditions
  • DMCA
logo logo
  • World
  • Politics
  • Crime
  • Economy
  • Tech & Science
  • Sports
  • Entertainment
  • More
    • Education
    • Celebrities
    • Culture and Arts
    • Environment
    • Health and Wellness
    • Lifestyle
  • 🔥
  • Trump
  • House
  • VIDEO
  • ScienceAlert
  • White
  • Watch
  • Trumps
  • man
  • Health
  • Day
Font ResizerAa
American FocusAmerican Focus
Search
  • World
  • Politics
  • Crime
  • Economy
  • Tech & Science
  • Sports
  • Entertainment
  • More
    • Education
    • Celebrities
    • Culture and Arts
    • Environment
    • Health and Wellness
    • Lifestyle
Follow US
© 2024 americanfocus.online – All Rights Reserved.
American Focus > Blog > Tech and Science > Google’s Gemini 2.5 Flash introduces ‘thinking budgets’ that cut AI costs by 600% when turned down
Tech and Science

Google’s Gemini 2.5 Flash introduces ‘thinking budgets’ that cut AI costs by 600% when turned down

Last updated: April 17, 2025 7:20 pm
Share
Google’s Gemini 2.5 Flash introduces ‘thinking budgets’ that cut AI costs by 600% when turned down
SHARE

Google has unveiled its latest AI model, Gemini 2.5 Flash, which offers businesses and developers unprecedented control over the level of “thinking” their AI performs. This new model, available in preview through Google AI Studio and Vertex AI, aims to enhance reasoning capabilities while keeping pricing competitive in the crowded AI market.

One of the key features of Gemini 2.5 Flash is the introduction of a “thinking budget,” allowing developers to specify the amount of computational power allocated to reasoning through complex problems before generating a response. This approach addresses the trade-off between sophisticated reasoning, latency, and pricing in AI systems.

Tulsee Doshi, Product Director for Gemini Models at Google DeepMind, highlighted the importance of cost and latency for developers in various use cases. The flexibility to adjust the thinking capability according to needs makes Gemini 2.5 Flash Google’s “first fully hybrid reasoning model.”

The pricing structure for Gemini 2.5 Flash emphasizes the cost of reasoning in AI systems. Developers pay $0.15 per million tokens for input, with output costs varying based on reasoning settings. With thinking turned off, the cost is $0.60 per million tokens, while enabling reasoning increases the cost to $3.50 per million tokens.

Google claims that Gemini 2.5 Flash delivers competitive performance across benchmarks while maintaining a smaller model size compared to alternatives. The model outperforms competitors on tests like Humanity’s Last Exam, GPQA diamond, and AIME mathematics exams, showcasing its strength in math, multimodal reasoning, and long-context tasks.

The ability to adjust reasoning levels based on the query represents a significant advancement in AI deployment. Simple queries can benefit from disabling thinking for cost efficiency, while complex tasks requiring multi-step reasoning can leverage the thinking function for optimal results.

See also  This Painting of Lounging Lions Was Hanging in a Family's Living Room. It Turned Out to Be an Original Delacroix

In addition to the Gemini 2.5 Flash launch, Google has introduced Veo 2 video generation capabilities and announced free access to Gemini Advanced for U.S. college students until spring 2026. These moves align with Google’s strategy to compete in the AI market and build loyalty among future knowledge workers.

As Gemini 2.5 Flash continues to evolve, businesses can expect more nuanced approaches to AI deployment, allowing for customized reasoning capabilities tailored to specific tasks. The model is available for developers to start building with, with ongoing refinements based on feedback during the preview phase.

Overall, Google’s focus on customizable reasoning in AI reflects a maturing market where cost optimization and performance tuning are essential considerations, signaling a new phase in the commercialization of generative AI technologies.

TAGGED:budgetsCostscutFlashGeminiGooglesIntroducesThinkingturned
Share This Article
Twitter Email Copy Link Print
Previous Article Kettle blowing off steam sparks fire callout Kettle blowing off steam sparks fire callout
Next Article The Hot List: 20 Must-Have Pieces, According to Instagram’s Favorite Fashion Sourcer The Hot List: 20 Must-Have Pieces, According to Instagram’s Favorite Fashion Sourcer
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Popular Posts

After yield surge, US Treasury expected to keep auction sizes steady

The U.S. Treasury Department is set to announce its refunding plans on Wednesday, with expectations…

April 29, 2025

Lionel Messi returns from injury with a bang, delivers two quick goals for Inter Miami vs. Philadelphia Union

Inter Miami welcomed the Philadelphia Union to Chase Stadium on Saturday, with a familiar face…

September 14, 2024

Pollution-eating microbes are thriving in infamous NYC canal

The Gowanus Canal in Brooklyn, New York, has a notorious reputation for being one of…

April 16, 2025

Brian Crossman Jr., son of victim in Vermont triple homicide, charged with murder

A tragic incident unfolded in a quiet Vermont town as a man was arrested for…

September 20, 2024

Taylor Swift’s Next Album, Engagement: Burning Questions Answered (EXCL)

As Taylor Swift's highly successful Eras Tour comes to a close after 152 dates across…

December 3, 2024

You Might Also Like

Planet Discovery Reveals Out-of-Sync Double Star System : ScienceAlert
Tech and Science

Planet Discovery Reveals Out-of-Sync Double Star System : ScienceAlert

July 12, 2025
Sequoia bets on silence | JS
Tech and Science

Sequoia bets on silence | JS

July 11, 2025
Andrew Schulz Turned On Trump For Breaking Campaign Promises
World News

Andrew Schulz Turned On Trump For Breaking Campaign Promises

July 11, 2025
Marjorie Taylor Greene Plans Hearing on Geoengineering amid Cloud Seeding Conspiracy Theories
Tech and Science

Marjorie Taylor Greene Plans Hearing on Geoengineering amid Cloud Seeding Conspiracy Theories

July 11, 2025
logo logo
Facebook Twitter Youtube

About US


Explore global affairs, political insights, and linguistic origins. Stay informed with our comprehensive coverage of world news, politics, and Lifestyle.

Top Categories
  • Crime
  • Environment
  • Sports
  • Tech and Science
Usefull Links
  • Contact
  • Privacy Policy
  • Terms & Conditions
  • DMCA

© 2024 americanfocus.online –  All Rights Reserved.

Welcome Back!

Sign in to your account

Lost your password?