Gemini 1.5 Pro 和 1.5 Flash GA、1.5 Flash 调整支持、更高的速率限制及更多 API 更新

五月 30, 2024

Logan Kilpatrick Senior Product Manager Gemini API and Google AI Studio

Shrestha Basu Mallick Group Product Manager Gemini API

Editor’s note: The post has been updated to reflect that 1.5 Flash tuning support has been delayed a few weeks and will not launch on June 17.

借着 Google I/O 大会的强劲势头，我们宣布推出重要的 Gemini API 和 Google AI Studio 更新，包括：

Gemini 1.5 Flash 和 1.5 Pro 稳定版本及计费

提高 Gemini 1.5 Flash 的速率限制

Gemini 1.5 Flash 调整

JSON 架构模式

为 Google AI Studio 提供移动支持和浅色模式

我们非常高兴看到您使用这些新模型构建出精彩作品，同时也致力于打造世界一流的开发者体验。您可以在 Google AI Studio 中开始免费使用 Gemini 1.5 Flash 和 1.5 Pro。

Gemini 1.5 Flash 更新

Gemini 1.5 Flash 是目前最快、最具成本效益的模型，专为处理大规模任务而设计，可满足开发者对低延迟和低成本的需求。今天，我们将 1.5 Flash 的速率限制提高到每分钟 1000 个请求 (RPM)，并取消了每日请求限制。1.5 Pro 速率限制目前不会发生变化，但如果您因为进行扩展或获得反馈而需要提高限制，请联系我们。

Customizing models can help you reach the performance threshold needed to take AI models into production. To support that, we will also be rolling out tuning support for Gemini 1.5 Flash in the coming weeks. Tuning will be supported in both Google AI Studio and the Gemini API directly. Currently, tuning jobs are free of charge, and using a tuned model does not incur any additional per-token costs. You can learn more about tuning in the Gemini API docs.

Gemini API 计费

除了免费等级之外，从今天开始，开发者还可以通过在 Google AI Studio 中开启计费帐号来解锁更高的 API 速率限制。

Set up billing in Google AI Studio

您可以在 ai.google.dev/pricing 上了解有关 Gemini 1.5 模型定价的更多信息。如果您在设置计费方式时遇到任何问题，请在开发者论坛上告知我们。对于希望使用企业级功能进行扩展的开发者，我们的企业级 AI 平台 Vertex AI 上也提供了相同的模型。

JSON 架构模式

今年早些时候，我们在 Gemini API 和 Google AI Studio 中推出了 JSON 模式，让您可以更好地控制模型输出。从今天开始，您可以为模型指定所需的 JSON 架构来响应。这会解锁许多新的用例，包括模型需要遵守某些输出约束条件，例如遵循预定义的结构或仅输出特定文本。您可以在 Gemini API 文档中阅读更多有关 JSON 架构模式的信息。