X News Crawler
Use this skill to run one command and get structured X news JSON.
Prerequisites (Required)
Before any crawl command, run:
pnpm add -g agent-browser-stealth
pnpm approve-builds -g
Run Chrome with CDP on the regular profile before crawling:
/Applications/Google\ Chrome.app/Contents/MacOS/Google\ Chrome --remote-debugging-port=9333
Do not use --user-data-dir in that Chrome launch command.
Workflow
- Run the CLI, not raw
abscommands. - Use
--mode hybridby default: fetchtopfirst, thenlatest, then sort by time and dedupe. - Read JSON output and summarize signal/noise for the user.
Commands
From repository root, run:
./bin/x-news-crawler --query "openclaw" --mode hybrid --since-hours 72 --limit 30 --output .tmp/openclaw-news.json
Quick probes:
./bin/x-news-crawler --query "openclaw" --mode top --limit 10
./bin/x-news-crawler --query "openclaw" --mode latest --since-hours 24 --limit 20
./bin/x-news-crawler --query "openclaw" --cdp 9333 --limit 20
Output Contract
The CLI returns JSON with:
fetched_atquerymodecountwarnings[]failed_sources[]rows[]with:source,datetime,status_url,user,text,replies,reposts,likes
Guardrails
- Prefer
hybridunless the user explicitly asks onlytoporlatest. - Keep
since-hourssmall (24-168) to avoid stale content. - Treat unverified rumor posts as low confidence even with high engagement.
Reference
See cli.md for full flags.