smart-scraper

AI-powered web scraper with intelligent structure recognition. Extracts lists, articles, and tables from any website with automatic type detection.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "smart-scraper" with this command: npx skills add yadanzheng68-cmyk/smart-scraper

Smart Scraper

Intelligent web scraping that understands page structure.

Features

  • Auto-detection: Automatically identifies list, article, or table layouts
  • Smart extraction: Parses prices, dates, URLs from unstructured text
  • Multiple formats: Output as JSON, CSV, or Markdown
  • Scroll support: Handles infinite scroll pages

Usage

# Extract product listings
smart-scraper --url "https://example.com/products" --type list

# Extract article content
smart-scraper --url "https://example.com/article" --type article --format markdown

# Extract table data
smart-scraper --url "https://example.com/data" --type table --format csv

Options

OptionDescriptionDefault
--url, -uTarget URL (required)-
--type, -tExtraction type: list, article, table, autoauto
--format, -fOutput format: json, csv, markdownjson
--max, -mMaximum items to extract100
--scrollEnable auto-scroll for lazy-loaded contentfalse

Examples

Extract Hacker News

smart-scraper -u https://news.ycombinator.com -t list -m 10

Save article as Markdown

smart-scraper -u https://blog.example.com/post -t article -f markdown > article.md

Export table to CSV

smart-scraper -u https://example.com/prices -t table -f csv > prices.csv

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

Baoyu Danger Gemini Web

Generates images and text via reverse-engineered Gemini Web API. Supports text generation, image generation from prompts, reference images for vision input,...

Registry SourceRecently Updated
8310wjctim
General

Easypost

EasyPost — shipping labels, rate comparison, package tracking, address verification, and insurance.

Registry SourceRecently Updated
General

Update Advisor

OpenClaw update check and upgrade assistant. Triggers on phrases like "check for updates", "any new version", "is openclaw updated", "run the update", "confi...

Registry SourceRecently Updated
General

Memory Management

Manage and standardize trading decision records, extract lessons, and support history retrieval and comparison within the PAI trading system.

Registry SourceRecently Updated