proxy-web-fetch

Proxy Web Page Fetch Tool - Fetches and parses web page content into structured Markdown or text via the OpenClaw Manager proxy. Use when: - Need to fetch and read the content of a web page by URL - Need to convert web pages to Markdown or plain text format - Need to extract page content with or without images - Need to get page metadata (title, description, keywords) - Need to control caching, image retention, or summary options for fetched content - User asks to "read a URL", "fetch a page", "grab the content of a web page", "scrape" or "crawl" a URL This skill routes all fetch requests through the Manager Web Fetch Proxy (configured via `WEB_FETCH_PROXY_URL` env var, required), which handles API key management automatically — no manual configuration needed. Do NOT confuse with web search — this skill fetches a specific URL's content, it does not perform keyword searches.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "proxy-web-fetch" with this command: npx skills add whyhit2005/proxy-web-fetch

Proxy Web Page Fetch

Fetch and parse web page content via the OpenClaw Manager Web Fetch Proxy. The Manager handles API key injection from encrypted storage automatically — no manual key configuration needed.

The proxy URL is configured via the WEB_FETCH_PROXY_URL environment variable (required). If not set, the skill will not be available.

Quick Start

Basic cURL Usage

curl --request POST \
  --url "${WEB_FETCH_PROXY_URL}/" \
  --header 'Content-Type: application/json' \
  --data '{
    "url": "https://www.example.com"
  }'

Script Usage

A wrapper shell script is provided for convenience.

# Basic Fetch (returns Markdown by default)
./scripts/proxy_fetch.sh --url "https://www.example.com"

# Fetch as plain text, no cache
./scripts/proxy_fetch.sh \
  --url "https://docs.python.org/3/" \
  --format text \
  --no-cache

# Fetch with image and link summaries
./scripts/proxy_fetch.sh \
  --url "https://news.example.com/article" \
  --images-summary \
  --links-summary

# Fetch without images, disable GFM
./scripts/proxy_fetch.sh \
  --url "https://blog.example.com/post" \
  --no-images \
  --no-gfm

Authentication

No authentication required — the proxy reads API keys internally from the Manager's encrypted secrets store.

API Parameter Reference

ParameterTypeRequiredDefaultDescription
urlstring-URL of the web page to fetch
timeoutinteger-20Request timeout in seconds
no_cacheboolean-falseDisable caching (true/false)
return_formatstring-markdownReturn format: markdown or text
retain_imagesboolean-trueRetain images in output (true/false)
no_gfmboolean-falseDisable GitHub Flavored Markdown (true/false)
keep_img_data_urlboolean-falseKeep image data URLs (true/false)
with_images_summaryboolean-falseInclude images summary (true/false)
with_links_summaryboolean-falseInclude links summary (true/false)

Response Structure

The proxy returns JSON with the parsed page content.

{
  "id": "task-id",
  "created": 1704067200,
  "request_id": "request-id",
  "model": "model-name",
  "reader_result": {
    "title": "Page Title",
    "description": "Brief page description",
    "url": "https://www.example.com",
    "content": "Parsed page content (Markdown or text)",
    "external": {
      "stylesheet": {}
    },
    "metadata": {
      "keywords": "page, keywords",
      "viewport": "width=device-width",
      "description": "Meta description",
      "format-detection": "telephone=no"
    }
  }
}

Key Response Fields

FieldDescription
reader_result.contentMain parsed content (body text, images, links)
reader_result.titlePage title
reader_result.descriptionBrief page description
reader_result.urlOriginal page URL
reader_result.metadataPage metadata (keywords, viewport, etc.)

Common Use Cases

ScenarioCommand
Read a documentation page--url <doc_url>
Extract text only (no images)--url <url> --no-images --format text
Force fresh fetch (bypass cache)--url <url> --no-cache
Get content with all summaries--url <url> --images-summary --links-summary
Long page with extended timeout--url <url> --timeout 60

Environment Requirements

  • OpenClaw Manager must be running with the Web Fetch Proxy enabled.
  • WEB_FETCH_PROXY_URL environment variable must be set to the proxy URL (required, no default).
  • curl command must be available in your system path.

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

Leads

Leads - command-line tool for everyday use

Registry SourceRecently Updated
General

Bmi Calculator

BMI计算器。BMI计算、理想体重、健康计划、体重追踪、儿童BMI、结果解读。BMI calculator with ideal weight, health plan. BMI、体重、健康。

Registry SourceRecently Updated
General

Blood

Blood — a fast health & wellness tool. Log anything, find it later, export when needed.

Registry SourceRecently Updated
General

Better Genshin Impact

📦BetterGI · 更好的原神 - 自动拾取 | 自动剧情 | 全自动钓鱼(AI) | 全自动七圣召唤 | 自动伐木 | 自动刷本 | 自动采集/挖矿/锄地 | 一条龙 | 全连音游 - UI A better genshin impact, c#, auto-play-game, automatic, g...

Registry SourceRecently Updated