New

Work in progress: Agents Directory has just launched. Stay tuned, more content is on the way.

Sign In
Qwen

Qwen3 VL 32B Instruct

Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-precision understanding and reasoning across text, images, and video. With 32 billion parameters, it combines deep visual perception with advanced text...

Capabilities:

Input
  • Text input
  • Image input (vision)
  • File input (PDF)
  • Audio input
  • Video input
Output
  • Text output
  • Image output
  • Audio output
Pricing & availability:
  • OpenRouterOpenRouter


    $0.104 / $0.416 per M
Share:
Details:
  • QwenProvider


    Qwen
  • Context window


    262K
  • Input price


    $0.104/M
  • Output price


    $0.416/M
  • Open weights


    Yes
  • Released


    Oct 2025