Close Menu
Newstech24.com
  • Home
  • News
  • Arabic News
  • Technology
  • Economy & Business
  • Sports News
What's Hot

تمثال مسيحي قرب روما كان يذرف دما تبين أنه عائد لمحتالة إيطالية

June 25, 2025

FIFA probing Pachuca’s Cabral after Rüdiger racism allegation

June 25, 2025

How oil merchants referred to as the Center East battle

June 25, 2025
Facebook X (Twitter) Instagram
Wednesday, June 25
Facebook X (Twitter) Instagram
Newstech24.com
  • Home
  • News
  • Arabic News
  • Technology
  • Economy & Business
  • Sports News
Newstech24.com
Home»Technology»Google launches ‘implicit caching’ to make accessing its latest AI models cheaper
Technology

Google launches ‘implicit caching’ to make accessing its latest AI models cheaper

AdminBy AdminMay 8, 2025No Comments3 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email
Google's Gemini chatbot gets upgraded image creation tools
Share
Facebook Twitter LinkedIn Pinterest Email

Google is rolling out a feature in its Gemini API that the company claims will make its latest AI models cheaper for third-party developers.

Google calls the feature “implicit caching” and says it can deliver 75% savings on “repetitive context” passed to models via the Gemini API. It supports Google’s Gemini 2.5 Pro and 2.5 Flash models.

That’s likely to be welcome news to developers as the cost of using frontier models continues to grow.

We just shipped implicit caching in the Gemini API, automatically enabling a 75% cost savings with the Gemini 2.5 models when your request hits a cache 🚢

We also lowered the min token required to hit caches to 1K on 2.5 Flash and 2K on 2.5 Pro!

— Logan Kilpatrick (@OfficialLoganK) May 8, 2025

Caching, a widely adopted practice in the AI industry, reuses frequently accessed or pre-computed data from models to cut down on computing requirements and cost. For example, caches can store answers to questions users often ask of a model, eliminating the need for the model to recreate answers to the same request.

Google previously offered model prompt caching, but only explicit prompt caching, meaning devs had to define their highest-frequency prompts. While cost savings were supposed to be guaranteed, explicit prompt caching typically involved a lot of manual work.

Some developers weren’t pleased with how Google’s explicit caching implementation worked for Gemini 2.5 Pro, which they said could cause surprisingly large API bills. Complaints reached a fever pitch in the past week, prompting the Gemini team to apologize and pledge to make changes.

In contrast to explicit caching, implicit caching is automatic. Enabled by default for Gemini 2.5 models, it passes on cost savings if a Gemini API request to a model hits a cache.

Techcrunch event

Berkeley, CA
|
June 5


BOOK NOW

“[W]hen you send a request to one of the Gemini 2.5 models, if the request shares a common prefix as one of previous requests, then it’s eligible for a cache hit,” explained Google in a blog post. “We will dynamically pass cost savings back to you.”

The minimum prompt token count for implicit caching is 1,024 for 2.5 Flash and 2,048 for 2.5 Pro, according to Google’s developer documentation, which is not a terribly big amount, meaning it shouldn’t take much to trigger these automatic savings. Tokens are the raw bits of data models work with, with a thousand tokens equivalent to about 750 words.

Given that Google’s last claims of cost savings from caching ran afoul, there are some buyer-beware areas in these new claims. For one, Google recommends that developers keep repetitive context at the beginning of requests to increase the chances of implicit cache hits. Context that might change from request to request should be appended at the end, the company says.

For another, Google didn’t offer any third-party verification that the new implicit caching system would deliver the promised automatic savings. So we’ll have to see what early adopters say.


{content}

Source: {feed_title}

Share this:

  • Click to share on Facebook (Opens in new window) Facebook
  • Click to share on X (Opens in new window) X
accessing caching cheaper Google implicit Latest launches models
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Admin
  • Website

Related Posts

Ironheart evaluate: a reminder that Marvel’s younger heroes are the long run

June 25, 2025

DJI ‘stays dedicated to the US market’ as cabinets go naked of drones

June 24, 2025

Assessment: Misen Chef’s Knife | WIRED

June 24, 2025
Leave A Reply Cancel Reply

Don't Miss
Arabic News

تمثال مسيحي قرب روما كان يذرف دما تبين أنه عائد لمحتالة إيطالية

By AdminJune 25, 20250

روما: تبيّنَ أن الدم الذي كان يذرفه دمعا تمثال صغير لمريم العذراء في بلدة قرب…

Share this:

  • Click to share on Facebook (Opens in new window) Facebook
  • Click to share on X (Opens in new window) X

FIFA probing Pachuca’s Cabral after Rüdiger racism allegation

June 25, 2025

How oil merchants referred to as the Center East battle

June 25, 2025

Man Metropolis’s Claudio Echeverri misses coaching, in boot

June 25, 2025

Zohran Mamdani stuns Democratic institution in New York mayor race

June 25, 2025

الاحتلال يقتحم يعبد قرب جنين ويحوّل منازل لثكنات عسكرية

June 25, 2025

Membership World Cup Each day: Delap will get first objective, Chelsea qualify

June 25, 2025

Cal-Maine Meals Inventory: Low Beta Serial Acquirer Outperforming The Market (NASDAQ:CALM)

June 25, 2025

إيران تلقي القبض على 700 شخص بتهمة التخابر لصالح إسرائيل خلال 12 يوما

June 25, 2025

جيش الاحتلال يعترف رسميا بمقتل 7 جنود في جنوب غزة

June 25, 2025
Advertisement
About Us
About Us

NewsTech24 is your premier digital news destination, delivering breaking updates, in-depth analysis, and real-time coverage across sports, technology, global economics, and the Arab world. We pride ourselves on accuracy, speed, and unbiased reporting, keeping you informed 24/7. Whether it’s the latest tech innovations, market trends, sports highlights, or key developments in the Middle East—NewsTech24 bridges the gap between news and insight.

Company
  • Home
  • About Us
  • Contact Us
  • Privacy Policy
  • Disclaimer
  • Terms Of Use
Latest Posts

تمثال مسيحي قرب روما كان يذرف دما تبين أنه عائد لمحتالة إيطالية

June 25, 2025

FIFA probing Pachuca’s Cabral after Rüdiger racism allegation

June 25, 2025

How oil merchants referred to as the Center East battle

June 25, 2025

Man Metropolis’s Claudio Echeverri misses coaching, in boot

June 25, 2025

Zohran Mamdani stuns Democratic institution in New York mayor race

June 25, 2025
Newstech24.com
Facebook X (Twitter) Tumblr Threads RSS
  • Home
  • News
  • Arabic News
  • Technology
  • Economy & Business
  • Sports News
© 2025 ThemeSphere. Designed by ThemeSphere.

Type above and press Enter to search. Press Esc to cancel.