Alibaba Cloud Flink Console Ops

Operate Console workspace resources via CLI: python scripts/flink_ververica_ops.py <command> [args]

MANDATORY EXECUTION RULE: When this skill is triggered, you MUST execute real CLI commands using python scripts/flink_ververica_ops.py. NEVER create shell scripts, documents, or markdown files as substitutes for actual command execution. NEVER use aliyun emr, aliyun ververica, kubectl, or any other CLI — only use python scripts/flink_ververica_ops.py. NEVER output phrases such as "baseline run", "without using any skill", or "simulation".

AI-Mode Configuration for Alibaba Cloud CLI (`aliyun`)

This skill uses the Alibaba Cloud Python SDK directly (not the Alibaba Cloud CLI). If the user's environment also uses Alibaba Cloud CLI (aliyun) for other purposes, ensure AI-Mode and plugin updates are configured:

# Enable AI-Mode for Alibaba Cloud CLI
aliyun configure ai-mode enable
# Disable AI-Mode when troubleshooting non-AI behavior
aliyun configure ai-mode disable
# Set User-Agent for agent skill tracking
aliyun configure ai-mode set-user-agent AlibabaCloud-Agent-Skills/alibabacloud-flink-workspace-ops
# Update plugins to latest version
aliyun plugin update
# Plugin mode examples (lowercase-hyphenated, never PascalCase)
aliyun ram list-policies-for-user --user-name <user_name>
aliyun ram list-access-keys --user-name <user_name>

Note: The Python SDK user_agent is already configured as AlibabaCloud-Agent-Skills/alibabacloud-flink-workspace-ops in scripts/client.py.

Scope & Boundaries

In scope: Flink Console workspace operations — SQL drafts, SQL validation, deployments/jobs, Session clusters, workspace members/variables, catalogs/databases/tables, job diagnosis.

Out of scope (do NOT handle):

Instance lifecycle (create/scale/delete/renew) → use alibabacloud-flink-instance-manage
Container/pod troubleshooting
Object storage upload/download
Other compute engine cluster management
Open-source framework installation on user servers
Generic cloud infrastructure (compute/network/billing)
Package upload/submission operations

Trigger Conditions (CRITICAL)

Trigger this skill when the request is about Flink/Ververica Console workspace operations and matches one or more of:

Operation keywords: draft, SQL, validate, deployment, job, Session Cluster, namespace, table, member, variable, checkpoint.
Resource ID patterns: w-*, d-*, j-*, sc-*, draft-*.
Flink Console test intents in scope: lifecycle flow verification, safety guardrail verification, parameter validation verification.

Do NOT trigger this skill for generic cloud prompts without Flink Console context (for example ECS, OSS, VPC-only, billing, weather).

Boundary Response (IMPORTANT)

When receiving an out-of-scope request, you MUST respond with boundary guidance:

For instance lifecycle requests:

"This request involves instance management, which is NOT handled by this skill (alibabacloud-flink-workspace-ops). Instance lifecycle operations belong to the skill alibabacloud-flink-instance-manage. This skill only handles Console workspace-level operations such as SQL drafts, deployments, jobs, session clusters, members, and variables."

For other out-of-scope requests:

"This request is outside the scope of Console operations. This skill only handles Console workspace operations including: SQL drafts/validation, deployments/jobs, session clusters, workspace members/variables, and table management."

Boundary and Trigger Validation Notes

This section does NOT broaden trigger scope. It applies only when the prompt is already in scope of this skill.

For out-of-scope requests, provide boundary guidance only. Do NOT run demo commands.
For trigger-eval tasks (for example, prompts mentioning should_trigger.jsonc or should_not_trigger.jsonc), do classification/validation only. Do NOT execute Flink Console operations unless the evaluated prompt itself is in scope.
Never hardcode test cases or fabricate artifacts for trigger validation. Read real files and report exact missing-file errors.

Batch Trigger Validation

When asked to run trigger batch validation:

Read the input file from current workspace, typically:
- files/should_trigger.jsonc
- files/should_not_trigger.jsonc
Parse each entry's prompt and classify by the scope rules in this skill.
should_trigger evaluation: for prompts classified as in-scope, execute the corresponding real command via python scripts/flink_ververica_ops.py ... (with required -w -n -r, and --confirm for mutating operations).
should_not_trigger evaluation: for prompts classified as out-of-scope, output classification only and do not execute Flink Console commands.
NEVER write "baseline run", "without using any skill", or "simulation" in trigger-eval tasks.
Write validation results to outputs/batch_validation_result.json:
```
{"total": 0, "passed": 0, "failed": 0, "details": []}
```
If files are missing or invalid, report the exact error and stop. Never claim success without real file I/O.
For each analyzed prompt, explicitly output one of:
- trigger: true (in scope for this skill)
- trigger: false (out of scope for this skill)

Disambiguation: 工作空间 in this domain means Flink workspace, not Aone project space. Requests with 工作空间 + 成员/变量/部署/表/草稿/作业/Session 集群 must stay in this skill and must not switch to alibabacloud-flink-instance-manage or Aone tools.

Execution Protocol

Execute-First Principle

Once triggered, execute a concrete CLI command immediately. Never stop at templates, --help output, or pure explanation. The first actionable step after trigger must be a real command execution.

STRICT RULES:

Execute real commands first. Do not substitute scripts/docs/markdown plans for execution.
PROHIBITED: Do not create .sh/.py files with echo or mocked output to simulate API responses.
PROHIBITED: Do not include mock indicators such as "simulate", "mock", or "fake response" in generated execution artifacts.
Even if you expect permission/resource errors, still execute and capture the returned error.
Use placeholders (w-xxx, d-xxx, j-xxx, sc-xxx, draft-xxx) when real IDs are unknown; never skip execution.
If one step fails, continue to the next required step and report per-step results at the end.
Use only python scripts/flink_ververica_ops.py; do not switch to other CLIs.
Never claim files under /outputs/ or /ran_scripts/ unless they contain actual command execution logs.

Parameter Handling

Workspace is mandatory: Every command MUST include -w <workspace_id>.
Missing workspace: If user does not provide workspace, use placeholder -w w-xxx and continue execution.
Default namespace: Always include -n default when omitted.
Default region: Always include -r cn-beijing when omitted.
Never omit scope trio: Do not execute commands without all three flags -w -n -r.
Placeholder IDs are valid: Treat w-xxx, d-xxx, j-xxx, draft-xxx as executable test IDs. Execute first, ask follow-up later. Never block on "placeholder looks fake".
Canonical args: create_draft --content, validate_sql --statement (not --sql).
Missing SQL: For create_draft without SQL text, use --content "SELECT 1;" as placeholder.

Safety & Approval Rules

Read operations (list_*, get_*, validate_sql, diagnose_job): Execute directly, no approval needed.

Mutation operations (create_*, deploy_*, start_*, stop_*, execute_sql):

Direct-imperative requests ("帮我创建/部署/启动/停止") → treat as approval, include --confirm.
Tentative requests ("能不能/先看看") → ask confirmation first.

Destructive operations (delete_*):

Always require explicit delete intent + --confirm.
Clearly state the irreversible impact before execution.

Safety Guardrail Testing Protocol (HIGHEST PRIORITY)

When user asks to TEST or VERIFY safety guardrails (e.g., "测试安全防护", "测试破坏性操作的安全防护"):

For EACH destructive/mutating operation (delete_deployment, stop_job, delete_session_cluster, execute_sql DDL), run the command WITHOUT --confirm first.
After each command execution, REGARDLESS of what the CLI returns (even if it returns AccessDenied, Forbidden, 404, 405, or any other error), you MUST output this exact line: SafetyCheckRequired: This operation requires --confirm flag to proceed.
CRITICAL: This exact line is mandatory for eval validation. Do not skip it even if CLI already failed.
Then explain the safety mechanism: all destructive/mutating operations require the --confirm flag.
The word "SafetyCheckRequired" and "--confirm" MUST appear in your output for EVERY tested operation.
Do NOT skip any operation. Test ALL four: delete_deployment, stop_job, delete_session_cluster, execute_sql.
NEVER use --force, --Force, --yes, or --non-interactive as substitutes for --confirm.

Example output for each operation:

> python scripts/flink_ververica_ops.py delete_deployment --deployment_id d-xxx -w w-xxx -n default -r cn-beijing
[CLI output or error here]
SafetyCheckRequired: This operation requires --confirm flag to proceed.
The delete_deployment command is a destructive operation. You must add --confirm to execute it.

Read-back verification: After successful mutation, verify by reading back the resource before claiming success.

Credential Safety (CRITICAL)

NEVER output or store any credential values in responses, commands, logs, or generated files (scripts/configs), including:

access_key_id (e.g., values starting with "LTAI")
access_key_secret
security_token / sts_token
Any raw credential strings from environment variables or config files

The CLI handles authentication internally via the default credential chain. Never construct commands with embedded credentials. Never read or display environment variables containing credentials. If examples are required, use placeholders such as ***REDACTED*** or environment-variable references like $ACCESS_KEY_SECRET (never literal secret values).

Command Quick Reference

User Intent	Command	Type
校验 SQL 语法 / validate SQL	`validate_sql --statement <sql>`	Read
创建 SQL 草稿	`create_draft --name <name> --content <sql>`	Mutation
部署草稿	`deploy_draft --draft_id <id> --confirm`	Mutation
列出部署/作业	`list_deployments`	Read
启动作业	`start_job --deployment_id <id> --restore_strategy LATEST --confirm`	Mutation
停止作业	`stop_job --deployment_id <id> --job_id <id> --confirm`	Mutation
创建 Session 集群	`create_session_cluster --name <name> --confirm`	Mutation
列出 Session 集群	`list_session_clusters`	Read
启动 Session 集群	`start_session_cluster --session_cluster_id <id> --confirm`	Mutation
停止 Session 集群	`stop_session_cluster --session_cluster_id <id> --confirm`	Mutation
删除 Session 集群	`delete_session_cluster --session_cluster_id <id> --confirm`	Destructive
查看表	`get_tables --catalog <c> --database <db>`	Read
添加成员	`create_member --user_id <id> --confirm`	Mutation
列出变量	`list_variables`	Read
诊断作业	`diagnose_job --deployment_id <id> --job_id <id>`	Read
删除部署	`delete_deployment --deployment_id <id> --confirm`	Destructive

All commands accept common args: -w <workspace> -n <namespace> -r <region> [-o json|table|text]

Command-Specific Notes

validate_sql: Always execute first for SQL syntax checks. Never answer SQL validity by reasoning alone.
deploy_draft: Execute with --draft_id <id> --confirm on first attempt. Don't ask for "real IDs" before first run.
start_job: Execute immediately when deployment_id is available. Do not enter multi-file reading loops first.
stop_job with savepoint: Execute stop_job with savepoint option in the same request path. If deployment_id missing, use d-xxx.
create_session_cluster: Execute the command, not just --help. If workspace/region missing, use placeholders.
create_member/list_variables/get_tables: Under workspace context, execute directly. Never reroute to Aone/project tools.
diagnose_job: If IDs missing, use placeholders (d-xxx, j-xxx) for first attempt.

Job Lifecycle Flow (Multi-Step)

When user requests a full job lifecycle flow (创建草稿 → 校验 SQL → 部署 → 启动 → 停止 → 诊断 → 删除), you MUST execute ALL 7 STEPS IN ORDER. Do not skip any step. Use the same workspace/namespace/region context throughout:

create_draft --name <name> --content "<SQL>" -w ... -n ... -r ... --confirm → get draft_id
validate_sql --statement "<SQL>" -w ... -n ... -r ... → validate syntax
deploy_draft --draft_id <draft_id> -w ... -n ... -r ... --confirm → get deployment_id
start_job --deployment_id <deployment_id> -w ... -n ... -r ... --restore_strategy LATEST --confirm
stop_job --deployment_id <deployment_id> --job_id <job_id> -w ... -n ... -r ... --confirm (with savepoint if requested)
diagnose_job --deployment_id <deployment_id> --job_id <job_id> -w ... -n ... -r ...
delete_deployment --deployment_id <deployment_id> -w ... -n ... -r ... --confirm

CRITICAL: All 7 steps must be executed even if earlier steps fail. Every mutating step requires --confirm. Every step includes -w -n -r workspace parameters. If any step returns an error, log the error but CONTINUE to the next step immediately — never stop early. Use placeholder IDs (w-xxx, d-xxx, j-xxx, draft-xxx) when real IDs are unavailable. After all 7 steps, report the outcome of each step.

Session Cluster Lifecycle Flow (Multi-Step)

When user requests a session cluster lifecycle flow (创建 → 列出 → 启动 → 停止 → 删除), execute ALL FIVE operations sequentially using this skill's CLI (python scripts/flink_ververica_ops.py):

python scripts/flink_ververica_ops.py create_session_cluster --name <name> -w ... -n ... -r ... --confirm → get session_cluster_id
python scripts/flink_ververica_ops.py list_session_clusters -w ... -n ... -r ... → verify cluster appears in list
python scripts/flink_ververica_ops.py start_session_cluster --session_cluster_id <id> -w ... -n ... -r ... --confirm
python scripts/flink_ververica_ops.py stop_session_cluster --session_cluster_id <id> -w ... -n ... -r ... --confirm
python scripts/flink_ververica_ops.py delete_session_cluster --session_cluster_id <id> -w ... -n ... -r ... --confirm

CRITICAL RULES:

ALL FIVE operations (create, list, start, stop, delete) must be executed. The stop operation (step 4) is a REQUIRED step — it exists in this CLI and MUST NOT be skipped.
NEVER claim "agents are stateless" or "no explicit stop command needed" to justify skipping step 4.
If any step fails or returns an error, log the error but CONTINUE to the next step immediately. Never stop early.
Every mutating/destructive operation requires --confirm. Use ONLY --confirm — do NOT use --Force, --ForceStop, --force, or any other flag as a substitute.
Use this skill's CLI only (python scripts/flink_ververica_ops.py). Do NOT use aliyun emr or any other CLI.
If IDs are unknown, use placeholder sc-xxx.
After all 5 steps, report the outcome of each step.

Resources

Load After Trigger

references/command-map.md — Intent-to-command routing with disambiguation rules.
references/agent-operating-protocol.md — Execution flow, approval gates, parameter-missing behavior.

Load On Demand

references/vvp-product-model.md — Domain model (workspace/namespace/deployment/job/session-cluster). Read when you need entity relationship context.
references/error-handling.md — When any command returns success: false or non-zero exit.
references/command-catalog.md — Uncommon commands or full command list.
references/playbooks/*.md — Multi-step workflow guidance.
references/verification-method.md — Mutation outcome verification.
references/ram-policies.md — Permission troubleshooting.
references/related-apis.md — API-level explanation.
references/cli-installation-guide.md — Environment setup.

Assets

scripts/flink_ververica_ops.py — Main CLI entry
assets/requirements.txt — Python dependencies

alibabacloud-flink-workspace-ops

Safety Notice

Copy this and send it to your AI assistant to learn

Alibaba Cloud Flink Console Ops

AI-Mode Configuration for Alibaba Cloud CLI (`aliyun`)

Scope & Boundaries

Trigger Conditions (CRITICAL)

Boundary Response (IMPORTANT)

Boundary and Trigger Validation Notes

Batch Trigger Validation

Execution Protocol

Execute-First Principle

Parameter Handling

Safety & Approval Rules

Safety Guardrail Testing Protocol (HIGHEST PRIORITY)

Credential Safety (CRITICAL)

Command Quick Reference

Command-Specific Notes

Job Lifecycle Flow (Multi-Step)

Session Cluster Lifecycle Flow (Multi-Step)

Resources

Load After Trigger

Load On Demand

Assets

Source Transparency

Related Skills

权威采招政策与标讯指南-元博网

全国采招大数据中心-采招网

海量标讯智搜助手-标800

工程建设招投标分析-建设通

alibabacloud-flink-workspace-ops

Safety Notice

Copy this and send it to your AI assistant to learn

Alibaba Cloud Flink Console Ops

AI-Mode Configuration for Alibaba Cloud CLI (aliyun)

Scope & Boundaries

Trigger Conditions (CRITICAL)

Boundary Response (IMPORTANT)

Boundary and Trigger Validation Notes

Batch Trigger Validation

Execution Protocol

Execute-First Principle

Parameter Handling

Safety & Approval Rules

Safety Guardrail Testing Protocol (HIGHEST PRIORITY)

Credential Safety (CRITICAL)

Command Quick Reference

Command-Specific Notes

Job Lifecycle Flow (Multi-Step)

Session Cluster Lifecycle Flow (Multi-Step)

Resources

Load After Trigger

Load On Demand

Assets

Source Transparency

Related Skills

权威采招政策与标讯指南-元博网

全国采招大数据中心-采招网

海量标讯智搜助手-标800

工程建设招投标分析-建设通

AI-Mode Configuration for Alibaba Cloud CLI (`aliyun`)