web-fetch

使用带 Stealth 插件的无头浏览器抓取网页内容并转换为 Markdown。用于当需要获取特定网页的正文、新闻详情、公司财报或其他长篇网页内容时。支持绕过大多数基础反爬虫检测。

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "web-fetch" with this command: npx skills add dlutwuwei/web-anti-crawl-fetch

Web Fetch Skill

使用无头浏览器(Playwright + Stealth Plugin)抓取指定 URL 的网页内容,并自动转换为 Markdown 格式以便于阅读和进一步处理。

主要特性

  • 反爬虫绕过: 集成了 playwright-extrapuppeteer-extra-plugin-stealth,自动处理各种浏览器指纹和自动化特征检测。
  • 内容转换: 使用 turndown 库将复杂的 HTML 页面转换为简洁的 Markdown 格式。
  • 环境模拟: 模拟真实用户视口大小和无头浏览器配置。

使用方法

运行抓取脚本:

cd /Users/wuwei/.openclaw/workspace/skills/web-fetch/scripts
node fetch.js <url>

参数说明

  • url: 需要抓取的完整网页 URL(包括 http/https)。

示例

# 抓取新浪财经
node fetch.js "https://finance.sina.com.cn/stock/"

# 抓取特定新闻页面
node fetch.js "https://finance.eastmoney.com/a/202403143012345678.html"

输出

脚本将会在控制台输出以下内容:

  1. 抓取进度说明。
  2. 页面标题。
  3. 转换后的 Markdown 正文内容(较长内容会截断)。

依赖

  • playwright-extra: 插件化 Playwright 核心。
  • puppeteer-extra-plugin-stealth: 提供各种 evasion 策略。
  • turndown: HTML 到 Markdown 转换服务。

安装依赖:

cd /Users/wuwei/.openclaw/workspace/skills/web-fetch/scripts
npm install

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

通义晓蜜 - 智能外呼

触发阿里云晓蜜外呼机器人任务,自动批量拨打电话。适用于批量外呼、客户回访、满意度调查、简历筛查约面试等场景。可从前置工具或节点获取外呼名单。

Registry SourceRecently Updated
General

Letterboxd Watchlist

Scrape a public Letterboxd user's watchlist into a CSV/JSONL list of titles and film URLs without logging in. Use when a user asks to export, scrape, or mirror a Letterboxd watchlist, or to build watch-next queues.

Registry SourceRecently Updated
General

Seedance Video Generation

Generate AI videos using ByteDance Seedance. Use when the user wants to: (1) generate videos from text prompts, (2) generate videos from images (first frame, first+last frame, reference images), or (3) query/manage video generation tasks. Supports Seedance 1.5 Pro (with audio), 1.0 Pro, 1.0 Pro Fast, and 1.0 Lite models.

Registry SourceRecently Updated
4.2K17jackycser
General

Universal Skills Manager

The master coordinator for AI skills. Discovers skills from multiple sources (SkillsMP.com, SkillHub, and ClawHub), manages installation, and synchronization...

Registry SourceRecently Updated
web-fetch | V50.AI