New
Work in progress: Agents Directory has just launched. Stay tuned, more content is on the way.
Sign InGemma 3 12B
Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...
Capabilities:
Input
- Text input
- Image input (vision)
- File input (PDF)
- Audio input
- Video input
Output
- Text output
- Image output
- Audio output
Pricing & availability:
OpenRouter
$0.05 / $0.15 per M
Sources:
Details:
Provider
Google DeepMindContext window
131KInput price
$0.05/MOutput price
$0.15/MOpen weights
YesKnowledge cutoff
Aug 2024Released
Mar 2025