Back to all models
qwen
Qwen: Qwen3 VL 8B Instruct
Qwen3-VL-8B-Instruct is a multimodal vision-language model from the Qwen3-VL series, built for high-fidelity understanding and reasoning across text, images, and video. It features improved multimodal fusion with Interleaved-MRoPE for long-horizon...
qwen/qwen3-vl-8b-instruct
Context Size
131.072K
Input Price
492 Ks/M
Output Price
3,075 Ks/M
Architecture
Image
Text
Supported Parameters
frequency_penaltylogit_biasmax_tokensmin_ppresence_penaltyrepetition_penaltyresponse_formatseedstopstructured_outputstemperaturetool_choicetoolstop_ktop_p
Details
TokenizerQwen3
Max Completion32,768 tokens
Provider Context131.072K tokens
ModeratedNo