agent-fetch

Fetch and extract full article content from URLs. Returns complete text with structure (headings, links, lists) instead of summaries. Multiple extraction strategies, browser impersonation, cookies, crawling, custom selectors, 200-700ms.

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "agent-fetch" with this command: npx skills add teng-lin/agent-fetch/teng-lin-agent-fetch-agent-fetch

agent-fetch Skill

A better web fetch for text content. Your built-in web fetch summarizes or truncates pages. agent-fetch extracts the complete article — every paragraph, heading, and link — using 7 extraction strategies and browser impersonation. No server required, runs as a local CLI tool.

When to Use This Skill

Use agent-fetch whenever you need to read a URL. It returns full article text with structure preserved — better than your built-in web fetch for any task involving page content.

  • User asks to read, fetch, or analyze a URL
  • User types /agent-fetch <url>
  • You need the full text, not a summary or truncation
  • Your built-in web fetch returned incomplete or garbled content

Prerequisites

agent-fetch runs via npx (no install needed):

npx agent-fetch --help

Commands

/agent-fetch <url> - Fetch and Extract Article

Default usage. Fetches URL with browser impersonation and extracts complete article content as markdown.

npx agent-fetch "<url>" --json

Parse the JSON output and present to the user:

---
title: {title}
author: {byline || "Unknown"}
source: {siteName}
url: {url}
date: {publishedTime || "Unknown"}
fetched_in: {latencyMs}ms
---

## {markdown || textContent}

{markdown || textContent}

If fetch fails, check suggestedAction in the JSON:

suggestedActionWhat it meansNext action
retry_with_extractNeeds full browserInform user; agent-fetch is HTTP-only
wait_and_retryRate limitedWait 60s and retry
skipCannot access this siteInform user

/agent-fetch raw <url> - Raw HTML

Fetch raw HTML without extraction.

npx agent-fetch "<url>" --raw

/agent-fetch quiet <url> - Markdown Only

Just the article markdown, no metadata.

npx agent-fetch "<url>" -q

/agent-fetch text <url> - Plain Text Only

Plain text content without formatting or metadata.

npx agent-fetch "<url>" --text

/agent-fetch cookies - Use Persistent Cookies

Load cookies from a Netscape format file or pass inline:

# From Netscape cookie file (export from browser)
npx agent-fetch "<url>" --cookie-file ~/.cookies.txt

# Inline cookies (repeatable)
npx agent-fetch "<url>" --cookie "sessionId=abc123; theme=dark"

/agent-fetch selectors <url> - Custom CSS Selectors

Extract specific elements or remove unwanted ones:

# Extract only the article, remove navigation and ads
npx agent-fetch "<url>" --select "article" --remove "nav, .sidebar, [class*='ad']"

# Extract all divs with class "post-content"
npx agent-fetch "<url>" --select ".post-content"

/agent-fetch crawl <url> - Crawl Multiple Pages

Follow links and extract content from multiple pages:

# Crawl with defaults (depth: 3, max 100 pages)
npx agent-fetch crawl "<url>"

# Deeper crawl with concurrency control
npx agent-fetch crawl "<url>" --depth 5 --limit 50 --concurrency 3

# Include/exclude specific URL patterns
npx agent-fetch crawl "<url>" --include "*/blog/*" --exclude "**/archive/**"

# Add rate limiting delay between requests
npx agent-fetch crawl "<url>" --delay 1000

# Allow cross-origin (stay on same origin by default)
npx agent-fetch crawl "<url>" --no-same-origin

# Output as JSONL for processing
npx agent-fetch crawl "<url>" --json

/agent-fetch pdf <file> - Extract from PDF

Extract text content from local PDF files:

# Extract PDF as markdown with metadata
npx agent-fetch document.pdf

# JSON output for programmatic access
npx agent-fetch document.pdf --json

# Just the text content
npx agent-fetch document.pdf --text

/agent-fetch preset - Custom TLS Fingerprint

Impersonate different browsers to bypass fingerprinting checks:

# Chrome 143 (default)
npx agent-fetch "<url>" --preset "chrome-143"

# iOS Safari 18
npx agent-fetch "<url>" --preset "ios-safari-18"

# Android Chrome 143
npx agent-fetch "<url>" --preset "android-chrome-143"

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Automation

agent-fetch

No summary provided by upstream source.

Repository SourceNeeds Review
General

notebooklm

No summary provided by upstream source.

Repository SourceNeeds Review
Automation

clinic-visit-prep

帮助患者整理就诊前问题、既往记录、检查清单与时间线,不提供诊断。;use for healthcare, intake, prep workflows;do not use for 给诊断结论, 替代医生意见.

Archived SourceRecently Updated