web-scraper

Fetch any URL and extract clean readable content as text or markdown. Use this skill when the user asks to scrape a webpage, extract text from a URL, fetch website content, read an article from a link, or download webpage content.

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "web-scraper" with this command: npx skills add dalehurley/phpbot/dalehurley-phpbot-web-scraper

Skill: web-scraper

When to Use

Use this skill when the user asks to:

  • Scrape or fetch content from a URL
  • Extract text from a webpage
  • Read an article from a link
  • Get the content of a website
  • Download and convert a webpage to text/markdown
  • Summarize a web page

Input Parameters

ParameterRequiredDescriptionExample
urlYesURL to scrapehttps://example.com/article
formatNoOutput format: markdown (default), text, htmlmarkdown
outputNoSave output to file patharticle.md

Procedure

  1. Get the URL from the user's request

  2. Run the bundled script:

    # Fetch and extract as markdown (default)
    python3 skills/web-scraper/scripts/scrape.py "https://example.com/article"
    
    # Extract as plain text
    python3 skills/web-scraper/scripts/scrape.py "https://example.com" --format text
    
    # Save to file
    python3 skills/web-scraper/scripts/scrape.py "https://example.com" --output article.md
    
  3. The script auto-installs trafilatura and requests if needed

  4. Present the extracted content to the user

Bundled Scripts

ScriptTypeDescription
scripts/scrape.pyPythonFetch URL and extract clean content

Script Usage

# Extract article content as markdown
python3 scripts/scrape.py "https://example.com/blog-post"

# Extract as plain text
python3 scripts/scrape.py "https://example.com" --format text

# Keep raw HTML
python3 scripts/scrape.py "https://example.com" --format html

# Save to file
python3 scripts/scrape.py "https://example.com" --output page.md

# Include metadata (title, author, date)
python3 scripts/scrape.py "https://example.com/article" --metadata

Example

scrape this URL: https://example.com/article
extract the text from this webpage
fetch the content of this link and save it
read this article: https://blog.example.com/post-1
get the content from this website

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Automation

desktop-control

No summary provided by upstream source.

Repository SourceNeeds Review
Automation

summarize-unread-emails

No summary provided by upstream source.

Repository SourceNeeds Review
Automation

csv-tools

No summary provided by upstream source.

Repository SourceNeeds Review