data-pods

Create and manage modular portable database pods (SQLite + metadata + embeddings). Includes document ingestion with embeddings for semantic search. Full automation - just ask.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "data-pods" with this command: npx skills add init-v/initv-data-pods

Data Pods

Overview

Create and manage portable, consent-scoped database pods. Handles document ingestion with embeddings and semantic search.

Architecture

┌─────────────┐     ┌─────────────┐     ┌─────────────┐
│  Ingestion  │ ──► │   DB Pods   │ ──► │  Generation │
│  (ingest)   │     │  (storage)  │     │   (query)   │
└─────────────┘     └─────────────┘     └─────────────┘

Triggers

  • "create a pod" / "new pod"
  • "list my pods" / "what pods do I have"
  • "add to pod" / "add note" / "add content"
  • "query pod" / "search pod"
  • "ingest documents" / "add files"
  • "semantic search" / "find相关内容"
  • "export pod" / "pack pod"

Core Features

1. Create Pod

When user asks to create a pod:

  1. Ask for pod name and type (scholar/health/shared/projects)
  2. Run: python3 .../scripts/pod.py create <name> --type <type>
  3. Confirm creation

2. Add Content (Manual)

When user asks to add content:

  1. Ask for pod name, title, content, tags
  2. Run: python3 .../scripts/pod.py add <pod> --title "<title>" --content "<content>" --tags "<tags>"
  3. Confirm

3. Ingest Documents (Automated)

When user wants to ingest files:

  1. Ask for pod name and folder path
  2. Run: python3 .../scripts/ingest.py ingest <pod> <folder>
  3. Supports: PDF, TXT, MD, DOCX, PNG, JPG
  4. Auto-embeds text (if sentence-transformers installed)

4. Semantic Search

When user wants to search:

  1. Ask for pod name and query
  2. Run: python3 .../scripts/ingest.py search <pod> "<query>"
  3. Returns ranked results with citations

5. Query (Basic)

When user asks to search notes:

  1. Run: python3 .../scripts/pod.py query <pod> --text "<query>"

6. Export

When user asks to export:

  1. Run: python3 .../scripts/podsync.py pack <pod>

Dependencies

pip install PyPDF2 python-docx pillow pytesseract sentence-transformers

Storage Location

~/.openclaw/data-pods/

Key Commands

# Create pod
python3 .../scripts/pod.py create research --type scholar

# Add note
python3 .../scripts/pod.py add research --title "..." --content "..." --tags "..."

# Ingest folder
python3 .../scripts/ingest.py ingest research ./documents/

# Semantic search
python3 .../scripts/ingest.py search research "transformers"

# List documents
python3 .../scripts/ingest.py list research

# Query notes
python3 .../scripts/pod.py query research --text "..."

Notes

  • Ingestion auto-chunks large documents
  • Embeddings enable semantic search
  • File hash prevents duplicate ingestion
  • All data stored locally in SQLite

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

SQL Data Analyst

Natural language to SQL. Ask questions about your data in plain English, get queries, results, and explanations. Supports SQLite, PostgreSQL, and MySQL. Impo...

Registry SourceRecently Updated
4401Profile unavailable
General

SQL Guard Copilot

Simplify SQL querying and troubleshooting for MySQL, PostgreSQL, and SQLite. Use when users ask to inspect schema, convert natural language to SQL, debug SQL...

Registry SourceRecently Updated
3100Profile unavailable
General

Workspace Database Manager

Workspace Documentation and Tree Database Manager. SQLite-based indexing for all documentation, files, and directory structures with CSV/JSON export capabili...

Registry SourceRecently Updated
860Profile unavailable
General

Database Engineering Mastery

Database Engineering Mastery covers schema design, indexing, query optimization, and migration for PostgreSQL, MySQL, SQLite, supporting OLTP/OLAP workloads.

Registry SourceRecently Updated
7211Profile unavailable