New
Work in progress: Agents Directory has just launched. Stay tuned, more content is on the way.
Sign InGLM 4.6V
GLM-4.6V is a large multimodal model designed for high-fidelity visual understanding and long-context reasoning across images, documents, and mixed media. It supports up to 128K tokens, processes complex page layouts...
Capabilities:
Input
- Text input
- Image input (vision)
- File input (PDF)
- Audio input
- Video input
Output
- Text output
- Image output
- Audio output
Pricing & availability:
OpenRouter
$0.3 / $0.9 per M
Sources:
Details:
Provider
Z.AIContext window
131KInput price
$0.3/MOutput price
$0.9/MOpen weights
YesReleased
Dec 2025