auto-captcha-solver

Detect and solve simple image captchas during browser automation. Use when flows encounter 4-6 character text, distorted alphanumeric, numeric, rotated, or arithmetic captchas and need capture, OCR, optional calculation, input fill, and submit handling in Playwright, Puppeteer, or Selenium. Do not use for reCAPTCHA, hCaptcha, slider, or click-object challenges.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "auto-captcha-solver" with this command: npx skills add cx6226301/auto-captcha-solver

Auto Captcha Solver

Use this skill to solve simple captcha images in browser automation.

Supported Captcha Types

  • 4 to 6 character text captchas
  • Distorted alphanumeric captchas
  • Numeric captchas
  • Simple rotated characters
  • Arithmetic captchas (example: 3+8)

Do not use this skill for reCAPTCHA, hCaptcha, sliders, or click-object challenges.

Workflow

  1. Detect a captcha image element from the page.
  2. Capture a screenshot buffer of the captcha.
  3. Run preprocessing (grayscale, contrast normalization, resize, noise reduction).
  4. Run OCR and clean output.
  5. Detect arithmetic patterns and evaluate if needed.
  6. Fill the captcha input and optionally submit.

Capture Guidance

  • Prefer screenshotting only the captcha element, not the full page.
  • Accept only trusted http or https image URLs when reading captcha image source.
  • Reject suspicious schemes like javascript: or file:.
  • Enforce image size and pixel limits before OCR.

Return Format

Return a result object with:

  • solved: boolean
  • value: solved captcha text
  • type: alphanumeric, numeric, arithmetic, or unknown
  • confidence: OCR confidence score
  • hash: SHA1 image hash (cache key)
  • fromCache: optional boolean when a cached answer is used

Module Map

  • solve.js: main entry for solving an image buffer
  • preprocess.js: image normalization pipeline
  • ocr.js: OCR and text cleanup with multiple passes
  • cache.js: SHA1 captcha cache
  • browser.js: automation helpers for Playwright, Puppeteer, and Selenium

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Automation

Usercentrics

Usercentrics integration. Manage data, records, and automate workflows. Use when the user wants to interact with Usercentrics data.

Registry SourceRecently Updated
Automation

Session Cost

Analyze OpenClaw session logs to report token usage, costs, and performance metrics grouped by agent and model. Use when the user asks about API spending, to...

Registry SourceRecently Updated
Automation

Agented

Stateful, persistent text editor for LLM agents. Undo tree, marks, annotations, transactions. Backed by SQLite.

Registry SourceRecently Updated
Automation

Agent Memory System v8

Agent 记忆系统 — 6维坐标编码 + RRF双路检索 + sqlite-vec统一存储 + 写入时因果检测 + 多Agent共享 + 记忆蒸馏 + 时间旅行 + 情感编码 + 元认知 + 内在动机 + 叙事自我 + 数字孪生 + 角色模板

Registry SourceRecently Updated