zhipu-web-fetch

Zhipu AI Web Page Reader Tool - Fetches and parses web page content into structured Markdown or text via cURL. Use when: - Need to fetch and read the content of a web page by URL - Need to convert web pages to Markdown or plain text format - Need to extract page content with or without images - Need to get page metadata (title, description, keywords) - Need to control caching, image retention, or summary options for fetched content - User asks to "read a URL", "fetch a page", "grab the content of a web page", "scrape" or "crawl" a URL Do NOT confuse with web search — this skill fetches a specific URL's content, it does not perform keyword searches.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "zhipu-web-fetch" with this command: npx skills add whyhit2005/zhipu-web-fetch

Zhipu Web Page Reader

Fetch and parse web page content via Zhipu AI's Reader API (/paas/v4/reader), using lightweight cURL. Returns parsed page content in Markdown or plain text format, along with metadata like title and description.

Quick Start

Basic cURL Usage

curl --request POST \
  --url https://open.bigmodel.cn/api/paas/v4/reader \
  --header "Authorization: Bearer $ZHIPU_API_KEY" \
  --header 'Content-Type: application/json' \
  --data '{
    "url": "https://www.example.com"
  }'

Script Usage

A wrapper shell script is provided for convenience.

# Basic Fetch (returns Markdown by default)
bash scripts/zhipu_fetch.sh --url "https://www.example.com"

# Fetch as plain text, no cache
bash scripts/zhipu_fetch.sh \
  --url "https://docs.python.org/3/" \
  --format text \
  --no-cache

# Fetch with image and link summaries
bash scripts/zhipu_fetch.sh \
  --url "https://news.example.com/article" \
  --images-summary \
  --links-summary

# Fetch without images, disable GFM
bash scripts/zhipu_fetch.sh \
  --url "https://blog.example.com/post" \
  --no-images \
  --no-gfm

API Parameter Reference

ParameterTypeRequiredDefaultDescription
urlstring-URL of the web page to fetch
timeoutinteger-20Request timeout in seconds
no_cacheboolean-falseDisable caching (true/false)
return_formatstring-markdownReturn format: markdown or text
retain_imagesboolean-trueRetain images in output (true/false)
no_gfmboolean-falseDisable GitHub Flavored Markdown (true/false)
keep_img_data_urlboolean-falseKeep image data URLs (true/false)
with_images_summaryboolean-falseInclude images summary (true/false)
with_links_summaryboolean-falseInclude links summary (true/false)

Response Structure

The API returns JSON with the parsed page content.

{
  "id": "task-id",
  "created": 1704067200,
  "request_id": "request-id",
  "model": "model-name",
  "reader_result": {
    "title": "Page Title",
    "description": "Brief page description",
    "url": "https://www.example.com",
    "content": "Parsed page content (Markdown or text)",
    "external": {
      "stylesheet": {}
    },
    "metadata": {
      "keywords": "page, keywords",
      "viewport": "width=device-width",
      "description": "Meta description",
      "format-detection": "telephone=no"
    }
  }
}

Key Response Fields

FieldDescription
reader_result.contentMain parsed content (body text, images, links)
reader_result.titlePage title
reader_result.descriptionBrief page description
reader_result.urlOriginal page URL
reader_result.metadataPage metadata (keywords, viewport, etc.)

Common Use Cases

ScenarioCommand
Read a documentation page--url <doc_url>
Extract text only (no images)--url <url> --no-images --format text
Force fresh fetch (bypass cache)--url <url> --no-cache
Get content with all summaries--url <url> --images-summary --links-summary
Long page with extended timeout--url <url> --timeout 60

Environment Requirements

  • Environment variable ZHIPU_API_KEY must be configured.
  • curl command must be available in your system path.

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

Leads

Leads - command-line tool for everyday use

Registry SourceRecently Updated
General

Bmi Calculator

BMI计算器。BMI计算、理想体重、健康计划、体重追踪、儿童BMI、结果解读。BMI calculator with ideal weight, health plan. BMI、体重、健康。

Registry SourceRecently Updated
General

Blood

Blood — a fast health & wellness tool. Log anything, find it later, export when needed.

Registry SourceRecently Updated
General

Better Genshin Impact

📦BetterGI · 更好的原神 - 自动拾取 | 自动剧情 | 全自动钓鱼(AI) | 全自动七圣召唤 | 自动伐木 | 自动刷本 | 自动采集/挖矿/锄地 | 一条龙 | 全连音游 - UI A better genshin impact, c#, auto-play-game, automatic, g...

Registry SourceRecently Updated