In response to user feedback about Gemini’s new compute‑resource quotas draining too quickly, Google has recently adjusted its quota management strategy. At the Google I/O 2026 conference, Gemini switched from a fixed message‑count model to a compute‑resource‑based usage limit, where tasks of different complexity consume varying amounts of quota: simple text prompts use far less than heavy video or code analysis. However, after the update, many users found that free credits were exhausted within minutes when uploading large files or performing complex operations.
To mitigate this issue, Gemini head Josh Woodward said Google has now implemented a per‑prompt quota cap to prevent any single request from hogging too many resources, allowing users to make fuller use of their Pro model quota. For heavy tasks like Deep Research, Google will also introduce more detailed usage breakdowns and notification features to help users track quota consumption in real time. Currently, the gemini.google.com/usage dashboard only shows overview data; future updates will add granular reports.

