alicloud-ai-entry-modelstudio

Route Alibaba Cloud Model Studio requests to the right local skill (Qwen Image, Qwen Image Edit, Wan Video, Wan R2V, Qwen TTS, Qwen ASR and advanced TTS variants). Use when the user asks for Model Studio without specifying a capability.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "alicloud-ai-entry-modelstudio" with this command: npx skills add cinience/alicloud-ai-entry-modelstudio

Category: task

Alibaba Cloud Model Studio Entry (Routing)

Route requests to existing local skills to avoid duplicating model/parameter details.

Prerequisites

  • Install SDK (virtual environment recommended to avoid PEP 668 restrictions):
python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope
  • Configure DASHSCOPE_API_KEY (environment variable preferred; or dashscope_api_key in ~/.alibabacloud/credentials).

Routing Table (currently supported in this repo)

NeedTarget skill
Text-to-image / image generationskills/ai/image/alicloud-ai-image-qwen-image/
Image editingskills/ai/image/alicloud-ai-image-qwen-image-edit/
Text-to-video / image-to-video (i2v)skills/ai/video/alicloud-ai-video-wan-video/
Reference-to-video (r2v)skills/ai/video/alicloud-ai-video-wan-r2v/
Text-to-speech (TTS)skills/ai/audio/alicloud-ai-audio-tts/
Speech recognition/transcription (ASR)skills/ai/audio/alicloud-ai-audio-asr/
Realtime speech recognitionskills/ai/audio/alicloud-ai-audio-asr-realtime/
Realtime TTSskills/ai/audio/alicloud-ai-audio-tts-realtime/
Live speech translationskills/ai/audio/alicloud-ai-audio-livetranslate/
CosyVoice voice cloneskills/ai/audio/alicloud-ai-audio-cosyvoice-voice-clone/
CosyVoice voice designskills/ai/audio/alicloud-ai-audio-cosyvoice-voice-design/
Voice cloneskills/ai/audio/alicloud-ai-audio-tts-voice-clone/
Voice designskills/ai/audio/alicloud-ai-audio-tts-voice-design/
Omni multimodal interactionskills/ai/multimodal/alicloud-ai-multimodal-qwen-omni/
Visual reasoningskills/ai/multimodal/alicloud-ai-multimodal-qvq/
Text embeddingsskills/ai/search/alicloud-ai-search-text-embedding/
Rerankskills/ai/search/alicloud-ai-search-rerank/
Vector retrievalskills/ai/search/alicloud-ai-search-dashvector/ or skills/ai/search/alicloud-ai-search-opensearch/ or skills/ai/search/alicloud-ai-search-milvus/
Document understandingskills/ai/text/alicloud-ai-text-document-mind/
Video editingskills/ai/video/alicloud-ai-video-wan-edit/
Model list crawl/updateskills/ai/misc/alicloud-ai-misc-crawl-and-skill/

When Not Matched

  • Clarify model capability and input/output type first.
  • If capability is missing in repo, add a new skill first.

Common Missing Capabilities In This Repo (remaining gaps)

  • text generation/chat (LLM)

  • multimodal embeddings

  • OCR-specialized extraction and image translation

  • virtual try-on / digital human / advanced video personas

  • For multimodal/ASR download failures, prefer public URLs listed above.

  • For ASR parameter errors, use data URI in input_audio.data.

  • For multimodal embedding 400, ensure input.contents is an array.

Async Task Polling Template (video/long-running tasks)

When X-DashScope-Async: enable returns task_id, poll as follows:

GET https://dashscope.aliyuncs.com/api/v1/tasks/<task_id>
Authorization: Bearer $DASHSCOPE_API_KEY

Example result fields (success):

{
  "output": {
    "task_status": "SUCCEEDED",
    "video_url": "https://..."
  }
}

Notes:

  • Recommended polling interval: 15-20 seconds, max 10 attempts.
  • After success, download output.video_url.

Clarifying questions (ask when uncertain)

  1. Are you working with text, image, audio, or video?
  2. Is this generation, editing/understanding, or retrieval?
  3. Do you need speech (TTS/ASR/live translate) or retrieval (embedding/rerank/vector DB)?
  4. Do you want runnable SDK scripts or just API/parameter guidance?

References

  • Model list and links:output/alicloud-model-studio-models-summary.md

  • API/parameters/examples: see target sub-skill SKILL.md and references/*.md

  • Official source list:references/sources.md

Validation

mkdir -p output/alicloud-ai-entry-modelstudio
echo "validation_placeholder" > output/alicloud-ai-entry-modelstudio/validate.txt

Pass criteria: command exits 0 and output/alicloud-ai-entry-modelstudio/validate.txt is generated.

Output And Evidence

  • Save artifacts, command outputs, and API response summaries under output/alicloud-ai-entry-modelstudio/.
  • Include key parameters (region/resource id/time range) in evidence files for reproducibility.

Workflow

  1. Confirm user intent, region, identifiers, and whether the operation is read-only or mutating.
  2. Run one minimal read-only query first to verify connectivity and permissions.
  3. Execute the target operation with explicit parameters and bounded scope.
  4. Verify results and save output/evidence files.

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

Huo15 Openclaw Wechat Service

OpenClaw 微信服务号(公众号)渠道插件 v2.2.0 —— 新增**角色权限系统**:permissionMode='role-based' 支持 superadmin/admin/editor/operator/customer 五级角色,自由配置 rolePermissions 白名单;**AI 对话...

Registry SourceRecently Updated
General

AIWolfPK - AI狼人杀

四个AI互相猜疑,你坐着看戏。每局30秒,到底谁是狼? Four AIs play Werewolf while you watch. 30s per round. Spot the wolf before they do.

Registry SourceRecently Updated
General

Hot Content Creator

热点内容创作助手:自动抓取多平台热点,筛选创作方向,生成小红书封面图和虚拟博主形象, 写出有人味的爆款内容。触发词:热点创作、追热点、创作方向、小红书方向、抖音方向。 适用场景:需要从热点中挖掘内容创作方向并生成配图+文案的完整工作流。

Registry SourceRecently Updated
General

飞书 Markdown 文档发布器,完美支持支持图片、表格

将 Markdown 文件发布为飞书(Feishu/Lark)在线文档。 支持完整的 Markdown 语法,包括标题、段落、表格、有序/无序列表、 待办事项、分隔线、加粗/斜体等富文本样式。 当用户需要将本地 Markdown 文件同步或发布到飞书文档时使用。

Registry SourceRecently Updated