daily-scan

Scan photographed documents into searchable PDFs with OCR and stable file naming. Use when the user sends document photos and asks to scan, save, archive, OCR, or retrieve prior scans. Trigger words: 스캔 or scan for a single page, 스캔연속 or scan multi for a multi-page combined PDF, and 스캔찾아 or scan find for retrieval.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "daily-scan" with this command: npx skills add kimjoohyeon-wq/daily-scan

Daily Scan

Overview

Turn phone photos of documents into searchable PDFs with OCR and stable filenames based on capture date and headline text. Preserve the original photo, generate a readable scan-like PDF, and support later retrieval of saved scan files.

Runtime Requirements

  • Default OCR path: local Tesseract CLI
  • Required local dependencies for the stable path:
    • tesseract
    • Python packages used by bundled scripts: opencv-python or cv2, Pillow, and either reportlab or ocrmypdf depending on the active PDF path
  • Optional/experimental OCR path:
    • PaddleOCR-based script exists but is not the default stable engine
  • No cloud upload is required for core operation
  • The skill assumes bundled helper scripts under scripts/ are present and callable by the host agent

Workflow

  1. Confirm the trigger.
  • 스캔 / scan — one-page processing
  • 스캔연속 / scan multi — combine multiple photos into one searchable PDF
  • 스캔찾아 / scan find — search previously saved scan files
  1. Collect attached image files or search keywords.
  2. Apply document-style cleanup when possible.
  • straighten or rotate when needed
  • improve contrast for readability
  • keep output practical rather than over-processed
  1. Run OCR in Korean and English.
  2. Build the filename as:
  • YYYY-MM-DD + headline text
  • derive headline text from the top 2 to 3 OCR lines
  1. Create a searchable PDF.
  2. Save output to the local storage destination.
  3. Keep the original image with the processed result.
  4. For retrieval requests, search by date, headline text, or OCR keyword in the configured scan storage path.
  5. Return:
  • filename
  • save location
  • OCR title line

Storage Rules

  • Default local staging/search path: daily-scan-storage/YYYY-MM
  • This skill is designed for local scan creation and retrieval only
  • Use year/month folder structure
  • Do not auto-classify document types

Operating Rules

  • For multi-page capture, combine pages into one PDF only when the trigger is 스캔연속 or scan multi
  • OCR language defaults to Korean plus English
  • Retrieval requests should search existing saved scan outputs before asking follow-up questions
  • Keep replies concise

Failure Handling

  • If OCR fails, still save the PDF when possible
  • If headline extraction fails, ask the user what title to use
  • If OCR fails, explicitly report that OCR failed
  • Preserve the original image unless the user later asks otherwise

Current Limits

  • Korean searchable PDF quality depends on OCR engine quality and PDF text-layer handling
  • The Tesseract path is the current stable default
  • The PaddleOCR path is experimental and should not be treated as the default engine
  • This skill does not require external upload tools or cloud credentials

Output Contract

Return only the practical result:

  • saved filename
  • save location
  • extracted title line when available

Resources

scripts/

Bundled scripts are used for:

  • image cleanup
  • OCR execution
  • searchable PDF generation
  • saved scan retrieval

references/

Store implementation notes for OCR engine choice and filename normalization if the skill grows more complex.

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

Img2img

Generate images from text descriptions using DALL-E 3 while adhering to usage policies and avoiding realistic human faces.

Registry SourceRecently Updated
General

Habitat-GS-Navigator

Navigate and interact with photo-realistic 3DGS environments via the Habitat-GS Bridge. Use when: user asks to explore a 3D scene, perform embodied navigatio...

Registry SourceRecently Updated
General

Memory Palace

持久化记忆管理。Use when: 用户告诉你个人信息/偏好/习惯、需要记住项目状态/技术决策、完成任务后有可复用经验、用户说"记住""别忘了""下次注意"、需要回忆之前的对话内容。支持语义搜索和时间推理。

Registry SourceRecently Updated
General

Podcast Transcript Mining Authority Positioning

Extract guest appearances, speaking topics, and soundbites from podcast transcripts to build authority portfolios and generate podcast pitch templates. Use w...

Registry SourceRecently Updated