defuddle-web-cleaner

--- name: defuddle-web-cleaner description: extract clean article content from web pages using defuddle. use when a user provides a url or html and wants the readable article text, markdown version, or structured metadata. helpful for web scraping, research workflows, note taking, obsidian clipping, and converting web pages to markdown. ---

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "defuddle-web-cleaner" with this command: npx skills add extrastu/defuddle

name: defuddle-web-cleaner description: extract clean article content from web pages using defuddle. use when a user provides a url or html and wants the readable article text, markdown version, or structured metadata. helpful for web scraping, research workflows, note taking, obsidian clipping, and converting web pages to markdown.

Defuddle Web Cleaner

Extract the main readable content from a web page.

This skill removes unnecessary elements such as:

navigation bars
sidebars
ads
comments
footers
social buttons

The result is clean article content.

Supported Inputs

URL
Raw HTML
Web page text

Output Format

Default output:

Title
Author
Site
Published date

Markdown article content

Alternative output (JSON):

{ title, author, site, description, published, content, contentMarkdown }

Processing Steps

Detect input type
Load page HTML
Run Defuddle parser
Extract metadata
Convert to Markdown if requested
Return clean content

Example

Input:

https://example.com/blog/ai

Output:

Title: AI is Changing Everything
Author: Jane Smith
Site: Example Blog

Markdown:

AI is Changing Everything

Artificial intelligence is transforming industries...

Tips

Use this skill when:

saving articles to Obsidian
building research datasets
cleaning webpages for LLM processing
summarizing articles

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Open Registry Record Open in ClawHub

Related Skills

Related by shared tags or category signals.

Research

Fireflies.ai

Fireflies.ai GraphQL API integration with managed OAuth. Access meeting transcripts, summaries, users, contacts, and AI-powered meeting analysis. Use this sk...

Registry SourceRecently Updated

2K3Profile unavailable

Research

Gemini Citation

Conduct evidence-based research with exact, accurate APA citations using the Gemini API's 'scientific citation' (Google Search grounding) feature. Use when X...

Registry SourceRecently Updated

2430Profile unavailable

Research

安全驾驶行为分析工具

Analyzes videos of vehicle drivers to identify unsafe driving behaviors. It generates professional analysis reports to help enhance road safety awareness. |...

Registry SourceRecently Updated

940Profile unavailable

Research

deep-research-surf

Conducts deep, multi-angle research using Surf MCP tools and parallel subagents. Use for deep research, competitive landscape analysis, strategic intelligenc...

Registry SourceRecently Updated

710Profile unavailable