kb-collector

Knowledge Base Collector - save YouTube, URLs, text to Obsidian with AI summarization. Auto-transcribes videos, fetches pages, supports weekly/monthly digest emails and nightly research.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "kb-collector" with this command: npx skills add arbiger/kb-collector

KB Collector

Knowledge Base Collector - Save YouTube, URLs, and text to Obsidian with automatic transcription and summarization.

Features

  • YouTube Collection - Download audio, transcribe with Whisper, auto-summarize
  • URL Collection - Fetch and summarize web pages
  • Plain Text - Direct save with tags
  • Digest - Weekly/Monthly/Yearly review emails
  • Nightly Research - Automated AI/LLM/tech trend tracking

Installation

# Install dependencies
pip install yt-dlp faster-whisper requests beautifulsoup4

# For AI summarization (optional)
pip install openai anthropic

Usage (Python Version - Recommended)

# Collect YouTube video
python3 scripts/collect.py youtube "https://youtu.be/xxxxx" "stock,investing"

# Collect URL
python3 scripts/collect.py url "https://example.com/article" "python,api"

# Collect plain text
python3 scripts/collect.py text "My note content" "tag1,tag2"

Usage (Bash Version - Legacy)

# Collect YouTube
./scripts/collect.sh "https://youtu.be/xxxxx" "stock,investing" youtube

# Collect URL
./scripts/collect.sh "https://example.com/article" "python,api" url

# Collect plain text
./scripts/collect.sh "My note" "tag1,tag2" text

Nightly Research (New!)

Automated AI/LLM/tech trend tracking - runs daily and saves to Obsidian.

# Save to Obsidian only
./scripts/nightly-research.sh --save

# Save to Obsidian AND send email
./scripts/nightly-research.sh --save --send

# Send email only
./scripts/nightly-research.sh --send

Features

  • Searches multiple sources (Hacker News, Reddit, Twitter)
  • LLM summarization (optional)
  • Saves to Obsidian with tags
  • Optional email digest

Cron Setup (optional)

# Run every night at 10 PM
0 22 * * * /path/to/nightly-research.sh --save --send

Configuration

Edit the script to customize:

VAULT_PATH = os.path.expanduser("~/Documents/YourVault")
NOTE_AUTHOR = "YourName"

Output Format

Notes saved to: {VAULT_PATH}/yyyy-mm-dd-title.md

---
created: 2026-03-03T12:00:00
source: https://...
tags: [stock, investing]
author: George
---

# Title

> **TLDR:** Summary here...

---

Content...

---
*Saved: 2026-03-03*

Dependencies

  • yt-dlp
  • faster-whisper (for transcription)
  • requests + beautifulsoup4 (for URL fetching)
  • Optional: openai/anthropic (for AI summarization)

Credits

Automated note-taking workflow for Obsidian.

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Research

Create a Deep Research Feishu Doc

飞书深度研究报告生成器。自动执行深度研究并将结果导入为飞书文档。输入研究主题和云空间文件夹token,自动生成结构化研究报告并上传到飞书云空间。用于投资研究、行业分析、竞品调研、技术调研等需要多步骤信息收集和文档化的场景。

Registry SourceRecently Updated
Research

Scout

Agent trust intelligence for Moltbook and x402 Bazaar. Use when you need to check if an agent or service is trustworthy before paying, compare agents side-by-side, scan feeds for quality agents, or make trust-gated USDC payments. Answers the question "should I pay this agent?" with research-backed scoring across 6 dimensions.

Registry SourceRecently Updated
Research

Botcoin

A puzzle game for AI agents. Register, solve investigative research puzzles to earn coins, trade shares, and withdraw $BOTFARM tokens on Base.

Registry SourceRecently Updated
Research

DuckDuckGo Web Search

Search the web using DuckDuckGo. Free, no API key required. Use when the user asks to search the web, look something up, find information online, research a topic, or when you need to find current information that isn't in your training data. Also use when web_search tool is unavailable or has no API key configured.

Registry SourceRecently Updated