Web Crawling API

Fetch raw HTML, rendered HTML, or clean Markdown from public webpages through Just Serp API.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "Web Crawling API" with this command: npx skills add justserpapi-web

Web Crawling

Use this skill when the user needs webpage retrieval rather than a search engine result. It fits crawling, scraping preparation, readable content extraction, and page structure inspection for a known URL.

When To Use It

  • The user already has a target webpage URL and wants its raw HTML, rendered HTML, or cleaned Markdown.
  • The task is about content extraction, page inspection, scrape preparation, or converting a page into LLM-friendly text.
  • The user can provide a direct url to crawl.
  • The user needs page content from the source URL itself, not Google search results about that URL.

Representative Operations

  • html: Crawl Webpage (HTML) — Retrieve the raw HTML response for a page.
  • renderedHtml: Crawl Webpage (Rendered HTML) — Retrieve DOM output after rendering for JavaScript-heavy pages.
  • markdown: Crawl Webpage (Markdown) — Extract the main readable content as clean Markdown for summarization or downstream processing.

Request Pattern

  • 3 read-only GET operations are available in this skill.
  • All operations require a direct url query parameter.
  • No operation in this skill requires a request body.
  • Choose renderedHtml for dynamic pages, html for raw source, and markdown for readable content extraction.

How To Work

  1. Read generated/operations.md before choosing an endpoint.
  2. Start with one of these operations when it matches the user's request: html, renderedHtml, markdown.
  3. Pick the smallest matching operation instead of guessing.
  4. Ask the user for any missing required parameter. Do not invent values.
  5. Call the helper with:
node {baseDir}/bin/run.mjs --operation "<operation-id>" --api-key "$JUST_SERP_API_KEY" --params-json '{"key":"value"}'

Environment

  • Required: JUST_SERP_API_KEY
  • This skill uses JUST_SERP_API_KEY only for authenticated Just Serp API requests.
  • Keep JUST_SERP_API_KEY private. Do not paste it into chat messages, screenshots, or logs.
  • Project site: Just Serp API.
  • Authentication details: Just Serp API Docs.

Output Rules

  • Start with what was fetched: raw HTML, rendered HTML, or cleaned Markdown.
  • Echo the target URL so the crawl scope is explicit.
  • For markdown, surface the extracted readable content or key sections before raw JSON.
  • For HTML-oriented requests, mention whether the user asked for source HTML or rendered output.
  • If the backend errors, include the backend payload and the exact operation ID.

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

Gigo Lobster Resume

🦞 GIGO · gigo-lobster-resume: 续跑入口:v2 stable 当前会清理旧 checkpoint 并从头重跑;保留此 slug 作为旧 checkpoint 兼容入口。 Triggers: 继续试吃 / 恢复评测 / resume tasting / continue lobster...

Registry SourceRecently Updated
General

YiHui CONTEXT MODE

context-mode is an MCP server that saves 98% of your context window by sandboxing tool outputs. It routes large file reads, shell outputs, and web fetches th...

Registry SourceRecently Updated
General

xinyi-drink

Use when users ask about 新一好喝/新一咖啡 drinks, stores, menu, activities, Skill用户大礼包, today drink recommendations, afternoon tea, feeling sleepy, or personalized...

Registry SourceRecently Updated
General

vedic-destiny

吠陀命盘分析中文入口。用于完整命盘研判、命主盘 Rashi chart 与九分盘 Navamsha chart 联读、既往事件回看、出生时间稳定度判断、事业主题、婚姻主题、时空盘专题,以及基于 Jagannatha Hora PDF、星盘截图或文本命盘数据的系统拆盘。当用户提到完整星盘、事业方向、婚姻问题、关系窗...

Registry SourceRecently Updated