RSS / Atom Feed Generator (v1.0.0)

Generates valid RSS 2.0 or Atom 1.0 XML feeds from any web page that lists posts, articles, or news items but does not publish its own feed.

Defaults & Settings

Parameter	Default value
Preferred format	RSS 2.0 (Atom 1.0 on request)
Max items	20 (most-recent first)
Date fallback	Page scrape date (ISO 8601, UTC)
Description	First 280 chars of post excerpt/lede
Encoding	UTF-8
Output modes	Raw XML · HTML page · Both
Interaction style	Friendly and professional

Trigger Phrases

#	User says …	Action
T1	"Generate an RSS/Atom feed for {url}"	Fetch page → detect posts → emit XML
T2	"This page has no feed: {url}"	Same as T1
T3	"Create a feed from {url}"	Same as T1
T4	"Convert {url} posts to RSS"	Same as T1
T5	"Make a self-hostable feed for {url}"	Emit XML + HTML wrapper
T6	"Update / refresh the feed for {url}"	Re-scrape and re-emit

Order of Operations

Page Fetch — Retrieve the target URL by invoking the WEB_FETCH function (defined in https://linkeddata.uriburner.com/chat/functions/openapi.yaml, endpoint /WEB_FETCH). Use whichever available protocol applies — REST, MCP, OPAL, or curl. Required parameter: url. Optional: headers, max_redirects, timeout_seconds. WEB_FETCH retrieves the page just like a web browser and returns the full page content for subsequent processing. See protocol-routing.md for exact invocation patterns per protocol.
Feed Discovery — Check <link rel="alternate"> tags. If a feed already exists, report it and stop (offer to proxy/mirror instead).
Post Extraction — Use the heuristics in references/extraction-rules.md to identify post entries (title, URL, date, author, summary).
Feed Assembly — Build XML using the templates in references/feed-templates.md. Validate structure against the checklist in references/validation-checklist.md.
Output — Present the feed XML in a code block; optionally wrap in the HTML discovery page template from references/html-wrapper-template.md.
Download Link — Save the .xml file to /mnt/user-data/outputs/ and present it with present_files.

Extraction Heuristics (summary — full rules in references/)

Structural signals to look for

Repeated <article>, <li>, or <div> blocks with consistent class names (e.g., post, entry, article, blog-post, card)
<h1>/<h2>/<h3> headings inside each block → item title
<a href> on or near the heading → item link (resolve relative URLs)
<time datetime="…"> or text matching date patterns → pubDate
First <p> inside the block (≤280 chars) → description / summary
<span class="author"> or byline text → author

Date handling

Situation	Strategy
`<time datetime="…">`	Use ISO value directly
Human-readable date text	Parse with locale awareness; emit RFC 822
No date found	Use today's date (UTC) with a `<!-- estimated -->` comment
Relative ("3 days ago")	Calculate from scrape time

URL normalisation

All item <link> values must be absolute. Resolve against the page's <base href> if present, otherwise against the origin of the page URL.

Feed Format Specs

RSS 2.0 required elements

<rss version="2.0">
  <channel>
    <title>, <link>, <description>   ← required channel fields
    <item>
      <title>, <link>, <guid>        ← required per item
      <pubDate>                      ← RFC 822 (e.g. Mon, 01 Jan 2024 00:00:00 +0000)
      <description>                  ← plain text or CDATA-wrapped HTML

Atom 1.0 required elements

<feed xmlns="http://www.w3.org/2005/Atom">
  <title>, <id>, <updated>           ← required feed fields
  <entry>
    <title>, <id>, <updated>         ← required per entry
    <link href="…" rel="alternate"/> ← post URL
    <summary> or <content>           ← excerpt or full body

Full templates → references/feed-templates.md

Output Modes

Mode	Description
`xml`	Raw feed XML in a fenced code block
`file`	Save to `/mnt/user-data/outputs/<slug>-feed.xml` and present link
`html`	HTML discovery page with `<link rel="alternate">` + embedded XML
`both`	`file` + `html` wrapper saved as `<slug>-feed-page.html`

Default: file (saves XML and presents download link).

Error Handling

Problem	Response
Page returns non-200	Report HTTP status; suggest checking URL or auth
No repeating post structure found	Show raw HTML skeleton; ask user to identify the pattern
Feed already exists	Report the existing feed URL; offer to mirror/augment
Dates unparseable	Use today's date; flag items with `<!-- date-estimated -->`
Relative URLs unresolvable	Ask user for the site's base URL

Commands

Command	Description
`/help`	Usage guidance for this skill
`/format [rss	atom]`
`/limit [n]`	Set maximum number of feed items
`/fulltext`	Attempt to embed full post body (fetches each post URL)
`/validate`	Run checklist from `references/validation-checklist.md`
`/preview`	Show first 3 items as formatted Markdown before XML

Operational Rules

Always check for an existing feed before generating a synthetic one.
Never fabricate post content — only use text found on the page.
Escape &, <, > in text nodes; use CDATA for HTML description bodies.
All <guid> / Atom <id> values must be the canonical post URL.
Sort items newest-first by default (pubDate DESC).
Do not request or store credentials; only scrape publicly accessible pages.
Clearly label generated feeds as synthetic (add <generator> tag).
Respect robots.txt — do not scrape pages that disallow crawlers.

rss-feed-generator

Safety Notice

Copy this and send it to your AI assistant to learn