aliyun-qwen-multimodal-embedding

Use when multimodal embeddings are needed from Alibaba Cloud Model Studio models such as `qwen3-vl-embedding` for image, video, and text retrieval, cross-modal search, clustering, or offline vectorization pipelines.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "aliyun-qwen-multimodal-embedding" with this command: npx skills add cinience/aliyun-qwen-multimodal-embedding

Category: provider

Model Studio Multimodal Embedding

Validation

mkdir -p output/aliyun-qwen-multimodal-embedding
python -m py_compile skills/ai/search/aliyun-qwen-multimodal-embedding/scripts/prepare_multimodal_embedding_request.py && echo "py_compile_ok" > output/aliyun-qwen-multimodal-embedding/validate.txt

Pass criteria: command exits 0 and output/aliyun-qwen-multimodal-embedding/validate.txt is generated.

Output And Evidence

  • Save normalized request payloads, selected dimensions, and sample input references under output/aliyun-qwen-multimodal-embedding/.
  • Record the exact model, modality mix, and output vector dimension for reproducibility.

Use this skill when the task needs text, image, or video embeddings from Model Studio for retrieval or similarity workflows.

Critical model names

Use one of these exact model strings as needed:

  • qwen3-vl-embedding
  • qwen2.5-vl-embedding
  • tongyi-embedding-vision-plus-2026-03-06

Selection guidance:

  • Prefer qwen3-vl-embedding for the newest multimodal embedding path.
  • Use qwen2.5-vl-embedding when you need compatibility with an older deployed pipeline.

Prerequisites

  • Set DASHSCOPE_API_KEY in your environment, or add dashscope_api_key to ~/.alibabacloud/credentials.
  • Pair this skill with a vector store such as DashVector, OpenSearch, or Milvus when building retrieval systems.

Normalized interface (embedding.multimodal)

Request

  • model (string, optional): default qwen3-vl-embedding
  • texts (array<string>, optional)
  • images (array<string>, optional): public URLs or local paths uploaded by your client layer
  • videos (array<string>, optional): public URLs where supported
  • dimension (int, optional): e.g. 2560, 2048, 1536, 1024, 768, 512, 256 for qwen3-vl-embedding

Response

  • embeddings (array<object>)
  • dimension (int)
  • usage (object, optional)

Quick start

python skills/ai/search/aliyun-qwen-multimodal-embedding/scripts/prepare_multimodal_embedding_request.py \
  --text "A cat sitting on a red chair" \
  --image "https://example.com/cat.jpg" \
  --dimension 1024

Operational guidance

  • Keep input.contents as an array; malformed shapes are a common 400 cause.
  • Pin the output dimension to match your index schema before writing vectors.
  • Use the same model and dimension across one vector index to avoid mixed-vector incompatibility.
  • For large image or video batches, stage files in object storage and reference stable URLs.

Output location

  • Default output: output/aliyun-qwen-multimodal-embedding/request.json
  • Override base dir with OUTPUT_DIR.

References

  • references/sources.md

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

通义晓蜜 - 智能外呼

触发阿里云晓蜜外呼机器人任务,自动批量拨打电话。适用于批量外呼、客户回访、满意度调查、简历筛查约面试等场景。可从前置工具或节点获取外呼名单。

Registry SourceRecently Updated
General

Letterboxd Watchlist

Scrape a public Letterboxd user's watchlist into a CSV/JSONL list of titles and film URLs without logging in. Use when a user asks to export, scrape, or mirror a Letterboxd watchlist, or to build watch-next queues.

Registry SourceRecently Updated
General

Seedance Video Generation

Generate AI videos using ByteDance Seedance. Use when the user wants to: (1) generate videos from text prompts, (2) generate videos from images (first frame, first+last frame, reference images), or (3) query/manage video generation tasks. Supports Seedance 1.5 Pro (with audio), 1.0 Pro, 1.0 Pro Fast, and 1.0 Lite models.

Registry SourceRecently Updated
4.2K17jackycser
General

Universal Skills Manager

The master coordinator for AI skills. Discovers skills from multiple sources (SkillsMP.com, SkillHub, and ClawHub), manages installation, and synchronization...

Registry SourceRecently Updated