New
Work in progress: Agents Directory has just launched. Stay tuned, more content is on the way.
Sign InGLM 4.5V
GLM-4.5V is a vision-language foundation model for multimodal agent applications. Built on a Mixture-of-Experts (MoE) architecture with 106B parameters and 12B activated parameters, it achieves state-of-the-art results in video understanding,...
Capabilities:
Input
- Text input
- Image input (vision)
- File input (PDF)
- Audio input
- Video input
Output
- Text output
- Image output
- Audio output
Pricing & availability:
OpenRouter
$0.6 / $1.8 per M
Sources:
Details:
Provider
Z.AIContext window
66KInput price
$0.6/MOutput price
$1.8/MOpen weights
YesKnowledge cutoff
Dec 2024Released
Aug 2025