---
id: 20260304-T0-12
title: "Google发布Gemini 3.1 Flash-Lite轻量模型"
title_en: "Google Releases Gemini 3.1 Flash-Lite Lightweight Model"
url: https://ai.daily.yangsir.net/daily/20260304-T0-12
issue_date: 2026-03-04
publish_date: 2026-03-03T16:34:00.000Z
source_name: "Google AI Blog"
source_url: https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-3-1-flash-lite/
---

# Google发布Gemini 3.1 Flash-Lite轻量模型

Google推出Gemini 3.1 Flash-Lite，成为Gemini 3系列中最快、成本最低的模型。该模型推理速度比Gemini 3.1 Flash快3倍，单位token成本降低60%，同时保持90%以上的性能。Flash-Lite支持128k上下文长度，专为大规模推理任务设计，已集成到Google Cloud Vertex AI中。企业用户可用其处理高并发文本生成、数据分析任务，单月处理量可达千万级。

## English Version

**Google Releases Gemini 3.1 Flash-Lite Lightweight Model**

Google launched Gemini 3.1 Flash-Lite, the fastest and lowest-cost model in the Gemini 3 series. It delivers 3x faster inference speed than Gemini 3.1 Flash and 60% lower cost per token while maintaining over 90% performance. Flash-Lite supports a 128k context length, designed for large-scale inference tasks, and is integrated into Google Cloud Vertex AI. Enterprise users can process high-concurrency text generation and data analysis tasks, handling up to tens of millions monthly.

---

**来源**：[Google AI Blog](https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-3-1-flash-lite/)

**详情页**：https://ai.daily.yangsir.net/daily/20260304-T0-12

---

*智语观潮 · Daily — https://ai.daily.yangsir.net/llms.txt*