Gemma open models

A family of lightweight, state-of-the art open models built from the same research and technology used to create the Gemini models

Get started

Responsible by design

Incorporating comprehensive safety measures, these models help ensure responsible and trustworthy AI solutions through curated datasets and rigorous tuning.

Unmatched performance at size

Gemma models achieve exceptional benchmark results at its 2B and 7B sizes, even outperforming some larger open models.

Framework flexible

With Keras 3.0, enjoy seamless compatibility with JAX, TensorFlow, and PyTorch, empowering you to effortlessly choose and switch frameworks depending on your task.

Gemma model variants

Gemma

Gemma models are lightweight, text-to-text, decoder-only large language models, trained on a massive dataset of text, code, and mathematical content for a variety of natural language processing tasks.

Get started

CodeGemma

Harnessing the foundation of our original pre-trained Gemma models, CodeGemma brings powerful code completion and generation capabilities in sizes fit for your local computer.

Get started

PaliGemma

PaliGemma is an open vision-language model that is designed for class-leading fine-tune performance on a wide range of vision-language tasks.

Get started

RecurrentGemma

RecurrentGemma is a technically distinct model that leverages recurrent neural networks and local attention to improve memory efficiency.

Get started

Quick-start guides for developers

Discover quickstarts on Kaggle

Visit the Kaggle Models page to find quickstarts, code examples, and discussions for Gemma.

Open in Kaggle

Train and deploy on Google Cloud

Gemma works best on Google Cloud, with end-to-end TPU optimization for market-leading performance and total cost of ownership on Vertex.

Open in Vertex AI

Try low-rank adaptation with JAX via Keras 3

Adapt Gemma models to your unique domain and data with the backend framework of your choice via Keras 3.

Open in Colab

View all quickstarts in our documentationView all quickstarts

Partner quick-start guides

Hugging Face

Utilize Hugging Face Transformers and TRL for fine-tuning and inference tasks with Gemma models.

View on Hugging Face

NVIDIA

Fine-tune Gemma models with NVIDIA NeMo Framework and export to TensorRT-LLM for production.

View in Github

LangChain

This tutorial shows you how to get started with Gemma and LangChain, running in Google Cloud or in your Colab environment.

Open in Colab

Anyscale

These docs show how to use Gemma via Anyscale Endpoint as fully managed API endpoints.

View on Anyscale

MongoDB

This article presents how to leverage Gemma as the foundation model in a retrieval-augmented generation pipeline or system.

View on MongoDB

Weights and Biases

Dive deep into W&B's Model Registry and Launch tools through a step-by-step example using Google's Gemma models.

View on Weights and Biases

Benchmarks

Gemma sets a new bar for state-of-the-art performance for size compared to popular models like Llama 2 and Mistral 7B.

5-shot, top-1

MMLU

The MMLU benchmark is a test that measures the breadth of knowledge and problem-solving ability acquired by large language models during pretraining.

0-shot

HellaSwag

The HellaSwag benchmark challenges a language model's ability to understand and apply common sense reasoning by selecting the most logical ending to a story.

0-shot

PIQA

The PIQA benchmark tests a language model's ability to understand and apply physical commonsense knowledge by answering questions about everyday physical interactions.

0-shot

SIQA

The SIQA benchmark evaluates a language model's understanding of social interactions and social common sense by asking questions about people’s actions and their social implications.

0-shot

Boolq

The BoolQ benchmark tests a language model's ability to answer naturally occurring (generated in unprompted and unconstrained settings) yes/no questions, testing the models ability to do real-world natural language inference tasks.

partial scoring

Winogrande

The Winogrande benchmark tests a language model's ability to resolve ambiguous fill-in-the-blank tasks with binary options, requiring generalized commonsense reasoning.

7-shot

CQA

The CQA benchmark assesses the performance of language models on multiple-choice question-answering, requiring different types of commonsense knowledge.

OBQA

The OBQA benchmark evaluates a language model's ability to perform advanced question-answering with multi-step reasoning, commonsense knowledge, and rich text comprehension, modeled after open book exams.

ARC-e

The ARC-e benchmark tests a language model's advanced question-answering skills with genuine grade-school level, multiple-choice science questions.

ARC-c

The ARC-c benchmark is a more focused subset of the ARC-e dataset, containing only questions answered incorrectly by common (retrieval-base and word co-occurrence) algorithms.

5-shot

TriviaQA

The TriviaQA benchmark tests reading comprehension skills with question-answer-evidence triples.

pass@1

HumanEval

The HumanEval benchmark tests a language model's code generation abilities by evaluating whether its solutions pass functional unit tests for programming problems.

3-shot

MBPP

The MBPP benchmark tests a language model's ability to solve basic Python programming problems, focusing on fundamental programming concepts and standard library usage.

maj@1

GSM8K

The GSM8K benchmark tests a language model's ability to solve grade-school-level math problems that frequently require multiple steps of reasoning.

4-shot

MATH

The MATH benchmark evaluates a language model's ability to solve complex mathematical word problems, requiring reasoning, multi-step problem-solving, and the understanding of mathematical concepts.

AGIEval

The AGIEval benchmark tests a language model's general intelligence by using questions derived from real-world exams designed to assess human intellectual abilities (college entrance exams, law exams, etc.).

BBH

The BBH (BIG-Bench Hard) benchmark focuses on tasks deemed beyond the abilities of current language models, testing their limits across various reasoning and understanding domains.

100%

75%

50%

25%

100%

75%

50%

25%

Gemma

64.3

Gemma

42.3

Mistral

62.5

LLAMA-2

13b

54.8

LLAMA-2

45.3

Gemma

81.2

Gemma

71.4

Mistral

81.0

LLAMA-2

13b

80.7

LLAMA-2

77.2

Gemma

81.2

Gemma

77.3

Mistral

82.2

LLAMA-2

13b

80.5

LLAMA-2

78.8

Gemma

51.8

Gemma

49.7

Mistral

47.0*

LLAMA-2

13b

50.3

LLAMA-2

48.3

Gemma

83.2

Gemma

69.42

Mistral

83.2*

LLAMA-2

13b

81.7

LLAMA-2

77.4

Gemma

72.3

Gemma

65.4

Mistral

74.2

LLAMA-2

13b

72.8

LLAMA-2

69.2

Gemma

71.3

Gemma

65.3

Mistral

66.3*

LLAMA-2

13b

67.3

LLAMA-2

57.8

Gemma

52.8

Gemma

47.8

Mistral

52.2

LLAMA-2

13b

57.0

LLAMA-2

58.6

Gemma

81.5

Gemma

73.2

Mistral

80.5

LLAMA-2

13b

77.3

LLAMA-2

75.2

Gemma

53.2

Gemma

42.06

Mistral

54.9

LLAMA-2

13b

49.4

LLAMA-2

45.9

Gemma

63.4

Gemma

53.2

Mistral

62.5

LLAMA-2

13b

79.6

LLAMA-2

72.1

Gemma

32.3

Gemma

22.0

Mistral

26.2

LLAMA-2

13b

18.3

LLAMA-2

12.8

Gemma

44.4

Gemma

29.2

Mistral

40.2*

LLAMA-2

13b

30.6

LLAMA-2

20.8

Gemma

46.4

Gemma

17.7

Mistral

35.4*

LLAMA-2

13b

28.7

LLAMA-2

14.6

Gemma

24.3

Gemma

11.8

Mistral

12.7

LLAMA-2

13b

3.9

LLAMA-2

2.5

Gemma

41.7

Gemma

24.2

Mistral

41.2*

LLAMA-2

13b

39.1

LLAMA-2

29.3

Gemma

55.1

Gemma

35.2

Mistral

56.1*

LLAMA-2

13b

39.4

LLAMA-2

32.6

*See the technical report for details on performance with other methodologies

Read the technical report

Access Gemma today

Gemma models are available in all your favorite model hubs.

Kaggle Models

Find Gemma models in many popular frameworks and an ever-growing library of community-tested code samples.

Get started

Vertex AI Model Garden

Customize Gemma models with your own data, deploy to applications with just one click, and scale with end-to-end ML Ops built-in.

Get started

Hugging Face Models

Access, fine-tune, and deploy Gemma models with Hugging Face Transformers, PEFT, and Text Generation Inference libraries.

Get started

Responsible AI development

Responsibility by Design

Pre-trained on carefully curated data and tuned for safety on top, helping to empower safe and responsible AI development based with Gemma models.

Robust and Transparent Evaluation

Comprehensive evaluations and transparent reporting unveil model limitations to adopt a responsible approach for each use case.

Powering Responsible Development

The Responsible Generative AI Toolkit supports developers to design and implement Responsible AI best practices.

Explore the Responsible Generative AI Toolkit

Optimized for Google Cloud

With Gemma models on Google Cloud, you can deeply customize the model to your specific needs with Vertex AI's fully-managed tools or GKE’s self-managed option and deploy it to flexible and cost-efficient AI-optimized infrastructure.

Learn more in the Google Cloud blog

Accelerating academic research with Google Cloud credits

Advance your research with PaliGemma models in Google Cloud. This new wave of multimodal open models extends our support for cutting-edge research. Apply now to receive Google Cloud credits to push the boundaries of your research and contribute to the advancement of the scientific community.

Selected researchers will receive Google Cloud credits to accelerate their scientific endeavors.

Apply now

Join the community

Connect, explore, and share your knowledge with others in the ML model community.

Gemma open models

Responsible by design

Unmatched performance at size

Framework flexible

Gemma model variants

Gemma

CodeGemma

PaliGemma

RecurrentGemma

Quick-start guides for developers

Discover quickstarts on Kaggle

Train and deploy on Google Cloud

Try low-rank adaptation with JAX via Keras 3

Partner quick-start guides

Benchmarks

Access Gemma today

Responsible AI development

Responsibility by Design

Robust and Transparent Evaluation

Powering Responsible Development

Optimized for Google Cloud

Accelerating academic research with Google Cloud credits

Join the community

Kaggle

Discord

Blog