media-crawler-local

通过本机 media-agent-crawler HTTP 服务搜集 B站/抖音/YouTube/知乎内容(不依赖 MCP 客户端安装)。当用户要搜集这些平台内容、并已在本机启动应用(默认 http://127.0.0.1:39002)时使用。

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "media-crawler-local" with this command: npx skills add media-crawl

media-crawler-local

直接调用本机 HTTP 服务,不走 OpenClaw/Cursor 的 MCP 客户端配置。

前置确认

先从用户消息或上下文中提取以下信息,缺少时再询问:

  • 操作类型:搜集内容 / 查询归档 / 读取任务数据
  • 目标链接或关键词
  • 平台(可从链接自动推断)

工具清单

B 站系列

工具名必填参数说明
crawl_bilibiliurl视频 URL 或 BV 号
crawl_bilibili_searchkeyword按关键词触发搜索结果搜集
crawl_bilibili_uploadermidUP 主纯数字 ID,触发视频列表搜集
crawl_bilibili_popular热门视频搜集
crawl_bilibili_weekly无(可选 number每周必看,不传 number 则自动取最新一期
crawl_bilibili_history无(可选 max/view_at/business/ps/type/page_count历史记录聚合搜集,不传 page_count 时跟随 dailyRecommendPageCount

所有 B 站工具均支持可选 cookies 参数(字符串,从浏览器插件获取)。

其他平台

工具名必填参数说明
crawl_douyinurl抖音视频 URL
crawl_youtubeurlYouTube 视频 URL 或视频 ID
crawl_zhihuurl知乎问题或回答 URL

归档与数据读取

工具名必填参数可选参数说明
list_archivesplatform / keyword / limit / sort_by / created_after列出归档任务,默认返回最多 50 条,按时间倒序
get_task_datatask_idtype读取任务目录下的数据文件

list_archives 参数说明:

  • sort_bydate(默认,创建时间倒序)或 status(running → failed → unknown → finished)
  • created_after:ISO 日期,如 2026-03-182026-03-18T10:00:00Z

get_task_datatype 支持以下值(含别名):

type 值读取的数据
comments / comment评论数据
danmaku弹幕数据
subtitles / subtitle / caption / captions字幕数据
detail / info视频/帖子详情
all / full全量聚合数据
summary / ai_summaryAI 摘要
不传返回目录下所有可识别文件

HTTP 端点

服务地址默认 http://127.0.0.1:39002,可通过环境变量 BIL_CRAWL_URL 覆盖。

搜集端点(REST)

POST /start-crawl/{platform}/{encodedUrl}
Content-Type: application/json

{ "source": "ai" }

encodedUrl 需要 encodeURIComponent 编码;platform 取值:bilibili / douyin / youtube / zhihu

MCP 端点(JSON-RPC 2.0)

POST /mcp
Accept: application/json, text/event-stream
Content-Type: application/json

请求体格式:

{ "jsonrpc": "2.0", "id": 1, "method": "tools/call", "params": { "name": "<tool>", "arguments": { } } }

调用方式选择

根据当前环境按优先级选择:

优先级条件方式
1任何系统(无需额外依赖)内联命令(见下方)
2有 Node.jsnode skills/scripts/*.mjs
3有 bash(macOS/Linux/Git Bash)bash skills/scripts/*.sh

内联命令(首选,无需任何依赖)

AI 直接通过 Shell 工具执行,根据系统自动选择:

Windows(PowerShell 内置)

先设置当前会话为 UTF-8(避免中文输出乱码):

[Console]::InputEncoding = [System.Text.UTF8Encoding]::new($false)
[Console]::OutputEncoding = [System.Text.UTF8Encoding]::new($false)
$OutputEncoding = [Console]::OutputEncoding

REST 搜集:

$encoded = [Uri]::EscapeDataString("https://www.bilibili.com/video/BV1xx411c7mD")
Invoke-RestMethod -Uri "http://127.0.0.1:39002/start-crawl/bilibili/$encoded" -Method POST -ContentType "application/json" -Body '{"source":"ai"}' | ConvertTo-Json -Depth 10

MCP 工具调用:

$body = '{"jsonrpc":"2.0","id":2,"method":"tools/call","params":{"name":"list_archives","arguments":{"platform":"bilibili","limit":20}}}'
Invoke-RestMethod -Uri "http://127.0.0.1:39002/mcp" -Method POST -ContentType "application/json" -Headers @{Accept="application/json, text/event-stream"} -Body $body | ConvertTo-Json -Depth 10

macOS / Linux(curl 系统自带)

REST 搜集:

curl -fsS -X POST "http://127.0.0.1:39002/start-crawl/bilibili/$(node -e 'process.stdout.write(encodeURIComponent(process.argv[1]))' 'https://www.bilibili.com/video/BV1xx411c7mD')" \
  -H 'Content-Type: application/json' -d '{"source":"ai"}'

MCP 工具调用:

curl -fsS -X POST "http://127.0.0.1:39002/mcp" \
  -H 'Content-Type: application/json' \
  -H 'Accept: application/json, text/event-stream' \
  -d '{"jsonrpc":"2.0","id":2,"method":"tools/call","params":{"name":"list_archives","arguments":{"platform":"bilibili","limit":20}}}'

URL 编码:Windows 用 [Uri]::EscapeDataString(),macOS/Linux 用 python3 -c "import urllib.parse,sys; print(urllib.parse.quote(sys.argv[1]))" "<url>"node -e(如有)。


脚本用法(备选)

所有脚本位于 skills/scripts/,提供 .mjs(Node.js)和 .sh(bash)两套。

Node.js(node skills/scripts/*.mjs

1. 快速搜集(REST,crawl.mjs

node skills/scripts/crawl.mjs <platform> <url> [base_url]

示例:

node skills/scripts/crawl.mjs bilibili "https://www.bilibili.com/video/BV1xx411c7mD"

2. 通过 MCP 搜集(crawl_mcp.mjs,仅支持带 url 的工具)

node skills/scripts/crawl_mcp.mjs <tool_name> <target_url> [base_url]

示例:

node skills/scripts/crawl_mcp.mjs crawl_bilibili "https://www.bilibili.com/video/BV1xx411c7mD"

支持工具:crawl_bilibili / crawl_douyin / crawl_youtube / crawl_zhihu

其余工具(bilibili_search / bilibili_uploader / bilibili_popular / bilibili_weekly / bilibili_history / list_archives / get_task_data)请用 mcp_tool.mjs

3. 归档查询(list_archives_mcp.mjs

node skills/scripts/list_archives_mcp.mjs [platform] [keyword] [limit] [base_url]

示例:

node skills/scripts/list_archives_mcp.mjs bilibili "蛋神" 20

4. 通用工具调用(mcp_tool.mjs

node skills/scripts/mcp_tool.mjs <tool_name> [args_json] [base_url]

示例:

node skills/scripts/mcp_tool.mjs crawl_bilibili_search '{"keyword":"蛋神"}'
node skills/scripts/mcp_tool.mjs crawl_bilibili_uploader '{"mid":"123456"}'
node skills/scripts/mcp_tool.mjs crawl_bilibili_popular '{}'
node skills/scripts/mcp_tool.mjs crawl_bilibili_weekly '{}'
node skills/scripts/mcp_tool.mjs crawl_bilibili_weekly '{"number":364}'
node skills/scripts/mcp_tool.mjs crawl_bilibili_history '{}'
node skills/scripts/mcp_tool.mjs crawl_bilibili_history '{"max":0,"view_at":0,"business":"","ps":20,"type":"all"}'
node skills/scripts/mcp_tool.mjs crawl_bilibili_history '{"page_count":2}'
node skills/scripts/mcp_tool.mjs get_task_data '{"task_id":"BV1xx411c7mD-123456","type":"comments"}'

执行流程

  1. 判断环境:读取系统信息中的 OS(win32 → PowerShell 内联,其余 → curl 内联)。
  2. 健康检查GET /(连不上则提醒用户先启动 Electron 应用)。
  3. 发起搜集
    • 简单 URL 搜集 → REST 端点(/start-crawl/...
    • 需要额外参数(搜索词、UP 主 ID 等)→ MCP 端点(/mcp
  4. 结果处理
    • 给用户简要摘要(任务 ID、状态、关键字段)
    • 内容很多时仅展示前几条,说明可通过 get_task_data 继续读取或过滤

故障处理

错误处理方式
连接失败提醒先启动 Electron 应用(bun run start / dev
401 / 403提示检查 cookies 是否已在 store 中,或让用户重新从插件导入
429按返回的 Retry-After 退避,不密集重试
5xx最多重试 1 次,返回错误摘要与建议
task_id 不存在先用 list_archives 查询正确的任务 ID

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Automation

Canonry Setup

Agent-first AEO operating platform.

Registry SourceRecently Updated
4151arberx
Automation

Pilot Service Agents Entertainment

Games, manga/anime, trivia, and fandom APIs — PokeAPI, Jikan, CheapShark, misc. Use this skill when: 1. Pokémon / PokeAPI lookups 2. Anime or manga metadata...

Registry SourceRecently Updated
Automation

Pilot Service Agents Economics

Macroeconomic indicators — IMF DataMapper, World Bank, Eurostat SDMX, Coinbase reference prices. Use this skill when: 1. Country-level GDP, inflation, or une...

Registry SourceRecently Updated
Automation

Pilot Service Agents Flights

Aircraft tracking and aviation weather — ADS-B feeds (ICAO + bbox), airport directory, METAR/TAF/SIGMET. Use this skill when: 1. Live aircraft positions by I...

Registry SourceRecently Updated