auto-captcha-solver

Detect and solve simple image captchas during browser automation. Use when flows encounter 4-6 character text, distorted alphanumeric, numeric, rotated, or arithmetic captchas and need capture, OCR, optional calculation, input fill, and submit handling in Playwright, Puppeteer, or Selenium. Do not use for reCAPTCHA, hCaptcha, slider, or click-object challenges.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "auto-captcha-solver" with this command: npx skills add cx6226301/auto-captcha-solver

Auto Captcha Solver

Use this skill to solve simple captcha images in browser automation.

Supported Captcha Types

  • 4 to 6 character text captchas
  • Distorted alphanumeric captchas
  • Numeric captchas
  • Simple rotated characters
  • Arithmetic captchas (example: 3+8)

Do not use this skill for reCAPTCHA, hCaptcha, sliders, or click-object challenges.

Workflow

  1. Detect a captcha image element from the page.
  2. Capture a screenshot buffer of the captcha.
  3. Run preprocessing (grayscale, contrast normalization, resize, noise reduction).
  4. Run OCR and clean output.
  5. Detect arithmetic patterns and evaluate if needed.
  6. Fill the captcha input and optionally submit.

Capture Guidance

  • Prefer screenshotting only the captcha element, not the full page.
  • Accept only trusted http or https image URLs when reading captcha image source.
  • Reject suspicious schemes like javascript: or file:.
  • Enforce image size and pixel limits before OCR.

Return Format

Return a result object with:

  • solved: boolean
  • value: solved captcha text
  • type: alphanumeric, numeric, arithmetic, or unknown
  • confidence: OCR confidence score
  • hash: SHA1 image hash (cache key)
  • fromCache: optional boolean when a cached answer is used

Module Map

  • solve.js: main entry for solving an image buffer
  • preprocess.js: image normalization pipeline
  • ocr.js: OCR and text cleanup with multiple passes
  • cache.js: SHA1 captcha cache
  • browser.js: automation helpers for Playwright, Puppeteer, and Selenium

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Automation

Email Excel Transfer

Automatyzuje workflow pobierania danych z email i wstawiania ich do arkuszy kalkulacyjnych. Użyj gdy użytkownik chce przenieść informacje z poczty do Excela.

Registry SourceRecently Updated
Automation

Memori

Long-term memory for OpenClaw agents using the Memori SDK. Automatically captures conversations and equips the agent with explicit tools to recall context ac...

Registry SourceRecently Updated
Automation

Paired \u2014 Bluetooth Phone Bridge

Bridge an OpenClaw agent to the user's own phone via Bluetooth and ADB-over-USB. Provides SMS receive (MAP/MNS), SMS send (ADB autosend), outgoing calls (HFP...

Registry SourceRecently Updated
Automation

Billons Ai

Provides AI agent verification and secure identification within the Billons Network to assist users in unlocking system rewards.

Registry SourceRecently Updated