Category: provider
Model Studio QVQ Visual Reasoning
Validation
mkdir -p output/alicloud-ai-multimodal-qvq python -m py_compile skills/ai/multimodal/alicloud-ai-multimodal-qvq/scripts/prepare_qvq_request.py && echo "py_compile_ok" > output/alicloud-ai-multimodal-qvq/validate.txt
Pass criteria: command exits 0 and output/alicloud-ai-multimodal-qwen-vqv/validate.txt is generated.
Critical model names
Use one of these exact model strings:
-
qvq-plus
-
qvq-max
Typical use
-
Mathematical reasoning from screenshots
-
Diagram and chart reasoning
-
Visually grounded multi-step problem solving
Quick start
python skills/ai/multimodal/alicloud-ai-multimodal-qvq/scripts/prepare_qvq_request.py
--output output/alicloud-ai-multimodal-qvq/request.json
Notes
-
Use skills/ai/multimodal/alicloud-ai-multimodal-qwen-vl/ for standard image understanding.
-
Use QVQ when the task explicitly needs stronger reasoning over visual evidence.
References
- references/sources.md