arxiv-reader

Read and analyze arXiv papers by fetching LaTeX source, listing sections, or extracting abstracts

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "arxiv-reader" with this command: npx skills add willamhou/arxiv-source

arxiv-reader

Read and analyze arXiv papers by fetching their public LaTeX source. Converts LaTeX into clean text suitable for LLM analysis.

Description

This skill fetches arXiv papers from the public arXiv API (arxiv.org), flattens LaTeX includes, and returns clean text. No local file access is required — all content is fetched over HTTPS from arXiv's public endpoints and cached in memory for the session.

Network access: Only connects to arxiv.org and export.arxiv.org to download publicly available paper source tarballs and metadata. No other network connections are made. No data is sent to external services — this is read-only.

Caching: Results are cached in memory (process-scoped) for fast repeat access within the same session. No files are written to disk.

Usage Examples

  • "Read the paper 2301.00001 from arXiv"
  • "What sections does paper 2405.12345 have?"
  • "Get the abstract of 2312.09876"
  • "Fetch paper 2301.00001 without the appendix"

Process

  1. Quick look — Use arxiv_abstract to get a paper's abstract before committing to a full read
  2. Survey structure — Use arxiv_sections to understand the paper's outline
  3. Deep read — Use arxiv_fetch to get the full flattened LaTeX for analysis

Tools

arxiv_fetch

Fetch the full flattened LaTeX source of an arXiv paper.

Parameters:

  • arxiv_id (string, required): arXiv paper ID (e.g. 2301.00001 or 2301.00001v2)
  • remove_comments (boolean, optional): Strip LaTeX comments (default: true)
  • remove_appendix (boolean, optional): Remove appendix sections (default: false)
  • figure_paths (boolean, optional): Replace figures with file paths only (default: false)

Returns: { content: string, arxiv_id: string, cached: boolean }

Example:

{ "arxiv_id": "2301.00001", "remove_appendix": true }

arxiv_sections

List all sections and subsections of an arXiv paper.

Parameters:

  • arxiv_id (string, required): arXiv paper ID

Returns: { arxiv_id: string, sections: string[] }

Example:

{ "arxiv_id": "2301.00001" }

arxiv_abstract

Extract just the abstract from an arXiv paper.

Parameters:

  • arxiv_id (string, required): arXiv paper ID

Returns: { arxiv_id: string, abstract: string }

Example:

{ "arxiv_id": "2301.00001" }

Notes

  • Results are cached in memory — repeat requests within the same session are instant
  • Paper IDs support version suffixes (e.g. 2301.00001v2)
  • Very large papers may take 10-30 seconds on first fetch
  • arxiv_abstract uses the public arXiv Atom API for fast metadata retrieval
  • No filesystem writes — all caching is in-memory only
  • Only connects to arxiv.org (read-only, public data)

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Research

Fireflies.ai

Fireflies.ai GraphQL API integration with managed OAuth. Access meeting transcripts, summaries, users, contacts, and AI-powered meeting analysis. Use this sk...

Registry SourceRecently Updated
2K3Profile unavailable
Research

Gemini Citation

Conduct evidence-based research with exact, accurate APA citations using the Gemini API's 'scientific citation' (Google Search grounding) feature. Use when X...

Registry SourceRecently Updated
2430Profile unavailable
Research

安全驾驶行为分析工具

Analyzes videos of vehicle drivers to identify unsafe driving behaviors. It generates professional analysis reports to help enhance road safety awareness. |...

Registry SourceRecently Updated
940Profile unavailable
Research

deep-research-surf

Conducts deep, multi-angle research using Surf MCP tools and parallel subagents. Use for deep research, competitive landscape analysis, strategic intelligenc...

Registry SourceRecently Updated
710Profile unavailable