jina-reader

Web content extraction via Jina AI Reader API. Three modes: read (URL to markdown), search (web search + full content), ground (fact-checking). Extracts clean content without exposing server IP.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "jina-reader" with this command: npx skills add ericsantos/jina-reader

Jina Reader

Extract clean web content via Jina AI — without exposing your server IP.

Read a URL

{baseDir}/scripts/reader.sh "https://example.com/article"

Search the web (top 5 results with full content)

{baseDir}/scripts/reader.sh --mode search "latest AI news 2025"

Fact-check a statement

{baseDir}/scripts/reader.sh --mode ground "OpenAI was founded in 2015"

Options

FlagDescriptionDefault
--moderead, search, groundread
--selectorCSS selector to extract specific region
--waitCSS selector to wait for before extraction
--removeCSS selectors to remove (comma-separated)
--proxyCountry code for geo-proxy (br, us, etc.)
--nocacheForce fresh content (skip cache)off
--formatmarkdown, html, text, screenshotmarkdown
--jsonRaw JSON outputoff

Examples

# Extract article content
{baseDir}/scripts/reader.sh "https://blog.example.com/post"

# Extract specific section via CSS selector
{baseDir}/scripts/reader.sh --selector "article.main" "https://example.com"

# Remove nav and ads before extraction
{baseDir}/scripts/reader.sh --remove "nav,footer,.ads" "https://example.com"

# Search with JSON output
{baseDir}/scripts/reader.sh --mode search --json "AI enterprise trends"

# Read via Brazil proxy
{baseDir}/scripts/reader.sh --proxy br "https://example.com.br"

# Fact-check a claim
{baseDir}/scripts/reader.sh --mode ground "Tesla is the most valuable car company"

API Key

export JINA_API_KEY="jina_..."

Free tier: 10M tokens (no signup needed). Get key at https://jina.ai/reader/

Pricing

  • Read: ~$0.005/page (standard) | 3x for ReaderLM-v2
  • Search: 10K tokens fixed + variable per result
  • Ground: ~300K tokens/request (~30s latency)

Why Jina Reader?

  • IP protection — requests route through Jina's infra, not your server
  • Clean markdown — readability extraction + optional ReaderLM-v2
  • Dynamic content — headless Chrome renders JavaScript
  • Structured extraction — JSON schema support for data extraction

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

ksdsl-skilll

Captures learnings, errors, and corrections to enable continuous improvement. Use when: (1) A command or operation fails unexpectedly, (2) User corrects Clau...

Registry SourceRecently Updated
3640f1zzyw
General

Openclaw Skill Intelligence Ingestion

Auto-analyze URLs/info for OpenClaw strategic value, classify, create Obsidian notes, update memory. Use when user shares a URL, article, tweet, or any exter...

Registry SourceRecently Updated
General

Truth first

Evidence-first verification for status, config, file contents, actions, connectivity, mounts, and model selection. Use before answering any such claim.

Registry SourceRecently Updated
General

http-retry

Automatically retries HTTP requests with exponential backoff, timeout control, and connection pooling to handle network errors and rate limits.

Registry SourceRecently Updated