Learn more about our Google AI Edge announcements from I/O

Google AI Edge

On-device AI for mobile, web, and embedded applications

On-device solutions from models to pipelines

Accelerate ML deployment, optimize pipelines, and easily access powerful LLMs

MediaPipe

Perform common vision, audio, and language tasks using the included pipelines, or build your own. Includes models for many use cases, and supports a variety of popular LLMs.

Learn more

TensorFlow Lite

Deploy models authored in any framework to mobile, web and embedded.

Learn more

Gemini Nano

Access our most efficient Gemini model for on-device tasks via Android AICore. Coming soon to Chrome.

Learn more

Model Explorer

Visualize ML models to quickly troubleshoot and find optimization targets.

Learn more

An end-to-end stack featuring both high and low level components

What's new from I/O?

PyTorch in TFLite

Easily convert PyTorch models to TensorFlow Lite with new built-in tools.

Learn more

Torch Generative API

Bring your own LLM architectures to TensorFlow Lite.

Learn more

Larger models on the web

Our newest release of the MediaPipe LLM Inference API now supports larger models on the web, up to Gemma 7B at int8, hardware permitting.

Learn more

LoRA in MediaPipe

Fine tune LLM models using LoRA weights to improve accuracy with minimal impact on model footprint.

Learn more

Model Explorer

Speed up conversion, quantization, and debugging with powerful model visualization.

Learn more

MediaPipe

Easily create innovative on-device ML solutions

MediaPipe Tasks

Low-code APIs to deploy ready-made on-device machine learning models.

Get started

MediaPipe Studio

Preview on-device models and pipelines.

Get started

MediaPipe Model Maker

Fine tune models using your own data.

Get started

MediaPipe Framework

Access low-level components to build fast and flexible on-device ML pipelines.

Get started

Solve common challenges with MediaPipe

Vision

Analyze things in images and videos.

Natural Language

Understand meanings behind text.

Audio

Listen and recognize sounds.

Try the demosTry the demos

TensorFlow Lite

A lightweight, multi-framework library for deploying models on mobile, web, and microcontrollers.

Pick a Model

Pick a new model, retrain an existing one, or bring your own, with support for TensorFlow, JAX, and PyTorch.

Developer guide

Convert

Convert a model into a compressed flat buffer.

Developer guide

Deploy

Take the compressed .tflite file, load, and run it on a device.

Developer guide

Quantize

Increase model speed and reduce model size.

Developer guide

Generative AI, running on-device

MediaPipe LLM Inference API

Run LLMs completely on-device and perform a wide range of tasks, such as generating text, retrieving information in natural language form, and summarizing documents. The API provides built-in support for multiple text-to-text large language models, so you can apply the latest on-device generative AI models to your apps and products. Learn more

Torch Generative API

Author high performance LLMs in PyTorch, then convert them to run on-device using the TensorFlow Lite (TFLite) runtime. Learn more.

Gemini Nano

Access our most efficient Gemini model for on-device tasks via Android AICore. Coming soon to Chrome.

Why deploy ML on edge devices?

Latency

Skip the server round trip for easy, fast, real-time media processing.

Privacy

Perform inference locally, without sensitive data leaving the device.

Cost

Use on-device compute resources and save on server costs.

Offline availability

No network connection, no problem.