smart-scraper

AI-powered web scraper with intelligent structure recognition. Extracts lists, articles, and tables from any website with automatic type detection.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "smart-scraper" with this command: npx skills add yadanzheng68-cmyk/smart-scraper

Smart Scraper

Intelligent web scraping that understands page structure.

Features

Auto-detection: Automatically identifies list, article, or table layouts
Smart extraction: Parses prices, dates, URLs from unstructured text
Multiple formats: Output as JSON, CSV, or Markdown
Scroll support: Handles infinite scroll pages

Usage

# Extract product listings
smart-scraper --url "https://example.com/products" --type list

# Extract article content
smart-scraper --url "https://example.com/article" --type article --format markdown

# Extract table data
smart-scraper --url "https://example.com/data" --type table --format csv

Options

Option	Description	Default
`--url, -u`	Target URL (required)	-
`--type, -t`	Extraction type: `list`, `article`, `table`, `auto`	`auto`
`--format, -f`	Output format: `json`, `csv`, `markdown`	`json`
`--max, -m`	Maximum items to extract	100
`--scroll`	Enable auto-scroll for lazy-loaded content	false

Examples

Extract Hacker News

smart-scraper -u https://news.ycombinator.com -t list -m 10

Save article as Markdown

smart-scraper -u https://blog.example.com/post -t article -f markdown > article.md

Export table to CSV

smart-scraper -u https://example.com/prices -t table -f csv > prices.csv

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Open Registry Record Open in ClawHub

Related Skills

Related by shared tags or category signals.

General

Baoyu Danger Gemini Web

Generates images and text via reverse-engineered Gemini Web API. Supports text generation, image generation from prompts, reference images for vision input,...

Registry SourceRecently Updated

8310wjctim

General

Easypost

EasyPost — shipping labels, rate comparison, package tracking, address verification, and insurance.

Registry SourceRecently Updated

3680aiwithabidi

General

Update Advisor

OpenClaw update check and upgrade assistant. Triggers on phrases like "check for updates", "any new version", "is openclaw updated", "run the update", "confi...

Registry SourceRecently Updated

1200lzyling

General

Memory Management

Manage and standardize trading decision records, extract lessons, and support history retrieval and comparison within the PAI trading system.

Registry SourceRecently Updated

3860wuzimaki