New
Work in progress: Agents Directory has just launched. Stay tuned, more content is on the way.
Sign InQwen3 VL 8B Instruct
Qwen3-VL-8B-Instruct is a multimodal vision-language model from the Qwen3-VL series, built for high-fidelity understanding and reasoning across text, images, and video. It features improved multimodal fusion with Interleaved-MRoPE for long-horizon...
Capabilities:
Input
- Text input
- Image input (vision)
- File input (PDF)
- Audio input
- Video input
Output
- Text output
- Image output
- Audio output
Pricing & availability:
OpenRouter
$0.08 / $0.5 per M
Sources:
Details:
Provider
QwenContext window
256KInput price
$0.08/MOutput price
$0.5/MOpen weights
YesReleased
Oct 2025