New
Work in progress: Agents Directory has just launched. Stay tuned, more content is on the way.
Sign InStep 3.7 Flash
Step 3.7 Flash is StepFun's latest high-efficiency multimodal Mixture-of-Experts model. It pairs a 196B-parameter language backbone with a vision encoder for native image and video understanding, activating roughly 11B parameters...
Capabilities:
Input
- Text input
- Image input (vision)
- File input (PDF)
- Audio input
- Video input
Output
- Text output
- Image output
- Audio output
Pricing & availability:
OpenRouter
$0.2 / $1.15 per M
Sources:
Details:
Provider
StepFunContext window
256KInput price
$0.2/MOutput price
$1.15/MOpen weights
NoReleased
May 2026