New

Work in progress: Agents Directory has just launched. Stay tuned, more content is on the way.

Sign In
Zhipu

GLM 4.6V

GLM-4.6V is a large multimodal model designed for high-fidelity visual understanding and long-context reasoning across images, documents, and mixed media. It supports up to 128K tokens, processes complex page layouts...

Capabilities:

Input
  • Text input
  • Image input (vision)
  • File input (PDF)
  • Audio input
  • Video input
Output
  • Text output
  • Image output
  • Audio output
Pricing & availability:
  • OpenRouterOpenRouter


    $0.3 / $0.9 per M
Share:
Details:
  • ZhipuProvider


    Z.AI
  • Context window


    131K
  • Input price


    $0.3/M
  • Output price


    $0.9/M
  • Open weights


    Yes
  • Released


    Dec 2025