Auto Captcha Solver
Use this skill to solve simple captcha images in browser automation.
Supported Captcha Types
- 4 to 6 character text captchas
- Distorted alphanumeric captchas
- Numeric captchas
- Simple rotated characters
- Arithmetic captchas (example:
3+8)
Do not use this skill for reCAPTCHA, hCaptcha, sliders, or click-object challenges.
Workflow
- Detect a captcha image element from the page.
- Capture a screenshot buffer of the captcha.
- Run preprocessing (
grayscale,contrast normalization,resize,noise reduction). - Run OCR and clean output.
- Detect arithmetic patterns and evaluate if needed.
- Fill the captcha input and optionally submit.
Capture Guidance
- Prefer screenshotting only the captcha element, not the full page.
- Accept only trusted
httporhttpsimage URLs when reading captcha image source. - Reject suspicious schemes like
javascript:orfile:. - Enforce image size and pixel limits before OCR.
Return Format
Return a result object with:
solved: booleanvalue: solved captcha texttype:alphanumeric,numeric,arithmetic, orunknownconfidence: OCR confidence scorehash: SHA1 image hash (cache key)fromCache: optional boolean when a cached answer is used
Module Map
solve.js: main entry for solving an image bufferpreprocess.js: image normalization pipelineocr.js: OCR and text cleanup with multiple passescache.js: SHA1 captcha cachebrowser.js: automation helpers for Playwright, Puppeteer, and Selenium