Category: task
Alibaba Cloud Model Studio Entry (Routing)
Route requests to existing local skills to avoid duplicating model/parameter details.
Prerequisites
- Install SDK (virtual environment recommended to avoid PEP 668 restrictions):
python3 -m venv .venv . .venv/bin/activate python -m pip install dashscope
- Configure DASHSCOPE_API_KEY (environment variable preferred; or dashscope_api_key in ~/.alibabacloud/credentials ).
Routing Table (currently supported in this repo)
Need Target skill
Text-to-image / image generation skills/ai/image/alicloud-ai-image-qwen-image/
Image editing skills/ai/image/alicloud-ai-image-qwen-image-edit/
Text-to-video / image-to-video (i2v) skills/ai/video/alicloud-ai-video-wan-video/
Reference-to-video (r2v) skills/ai/video/alicloud-ai-video-wan-r2v/
Text-to-speech (TTS) skills/ai/audio/alicloud-ai-audio-tts/
Speech recognition/transcription (ASR) skills/ai/audio/alicloud-ai-audio-asr/
Realtime speech recognition skills/ai/audio/alicloud-ai-audio-asr-realtime/
Realtime TTS skills/ai/audio/alicloud-ai-audio-tts-realtime/
Live speech translation skills/ai/audio/alicloud-ai-audio-livetranslate/
CosyVoice voice clone skills/ai/audio/alicloud-ai-audio-cosyvoice-voice-clone/
CosyVoice voice design skills/ai/audio/alicloud-ai-audio-cosyvoice-voice-design/
Voice clone skills/ai/audio/alicloud-ai-audio-tts-voice-clone/
Voice design skills/ai/audio/alicloud-ai-audio-tts-voice-design/
Omni multimodal interaction skills/ai/multimodal/alicloud-ai-multimodal-qwen-omni/
Visual reasoning skills/ai/multimodal/alicloud-ai-multimodal-qvq/
Text embeddings skills/ai/search/alicloud-ai-search-text-embedding/
Rerank skills/ai/search/alicloud-ai-search-rerank/
Vector retrieval skills/ai/search/alicloud-ai-search-dashvector/ or skills/ai/search/alicloud-ai-search-opensearch/ or skills/ai/search/alicloud-ai-search-milvus/
Document understanding skills/ai/text/alicloud-ai-text-document-mind/
Video editing skills/ai/video/alicloud-ai-video-wan-edit/
Model list crawl/update skills/ai/misc/alicloud-ai-misc-crawl-and-skill/
When Not Matched
-
Clarify model capability and input/output type first.
-
If capability is missing in repo, add a new skill first.
Common Missing Capabilities In This Repo (remaining gaps)
text generation/chat (LLM)
multimodal embeddings
OCR-specialized extraction and image translation
virtual try-on / digital human / advanced video personas
For multimodal/ASR download failures, prefer public URLs listed above.
For ASR parameter errors, use data URI in input_audio.data .
For multimodal embedding 400, ensure input.contents is an array.
Async Task Polling Template (video/long-running tasks)
When X-DashScope-Async: enable returns task_id , poll as follows:
GET https://dashscope.aliyuncs.com/api/v1/tasks/<task_id> Authorization: Bearer $DASHSCOPE_API_KEY
Example result fields (success):
{ "output": { "task_status": "SUCCEEDED", "video_url": "https://..." } }
Notes:
-
Recommended polling interval: 15-20 seconds, max 10 attempts.
-
After success, download output.video_url .
Clarifying questions (ask when uncertain)
-
Are you working with text, image, audio, or video?
-
Is this generation, editing/understanding, or retrieval?
-
Do you need speech (TTS/ASR/live translate) or retrieval (embedding/rerank/vector DB)?
-
Do you want runnable SDK scripts or just API/parameter guidance?
References
Model list and links:output/alicloud-model-studio-models-summary.md
API/parameters/examples: see target sub-skill SKILL.md and references/*.md
Official source list:references/sources.md
Validation
mkdir -p output/alicloud-ai-entry-modelstudio echo "validation_placeholder" > output/alicloud-ai-entry-modelstudio/validate.txt
Pass criteria: command exits 0 and output/alicloud-ai-entry-modelstudio/validate.txt is generated.
Output And Evidence
-
Save artifacts, command outputs, and API response summaries under output/alicloud-ai-entry-modelstudio/ .
-
Include key parameters (region/resource id/time range) in evidence files for reproducibility.
Workflow
-
Confirm user intent, region, identifiers, and whether the operation is read-only or mutating.
-
Run one minimal read-only query first to verify connectivity and permissions.
-
Execute the target operation with explicit parameters and bounded scope.
-
Verify results and save output/evidence files.