indexing

When the user wants to fix indexing issues from Search Console, use noindex, or implement Google Indexing API. Also use when the user mentions "fix indexing," "not indexed," "Crawled - currently not indexed," "discovered - currently not indexed," "index coverage," "noindex," "noindex tag," "pages not indexed," "why not indexed," "request indexing," or "Google Indexing API." For sitemap, use xml-sitemap.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "indexing" with this command: npx skills add kostja94/indexing

SEO Technical: Indexing

Guides indexing troubleshooting and fix actions. For how to find and diagnose issues in GSC, see google-search-console.

When invoking: On first use, if helpful, open with 1–2 sentences on what this skill covers and why it matters, then provide the main output. On subsequent use or when the user asks to skip, go directly to the main output.

Scope (Technical SEO)

  • Fix actions: noindex, canonical, content quality, URL Inspection; verify robots.txt does not block (see robots-txt)
  • Noindex: Page-level index control; which pages to exclude and how. Complements robots-txt (path-level crawl control) and google-search-console (Coverage diagnosis)

Initial Assessment

Check for project context first: If .claude/project-context.md or .cursor/project-context.md exists, read it for site URL and indexing goals.

Identify issue from GSC (see google-search-console for Coverage report, issue types, diagnosis workflow). Then apply fix below.

Crawled - Currently Not Indexed

CauseAction
Low quality, duplicate, off-topicImprove content, fix duplicates, set correct canonical
Static assets (CSS/JS)See below
Feed, share URLs with paramsUsually OK to ignore; or noindex, canonical to main URL
Important content pagesUse URL Inspection, verify canonical/internal links/sitemap, Request indexing

Static Assets (Next.js / Vercel)

Vercel adds unique dpl= params to static assets per deploy, creating many "Crawled - currently not indexed" URLs.

DoDon't
Keep robots.txt allowing /_next/Do not block /_next/ (breaks CSS/JS loading). See robots-txt
Accept static assets in GSC as expectedDo not block /_next/static/css/ or ?dpl=
Use X-Robots-Tag for static assetsCSS/JS should not be indexed; no SEO impact

Static assets in "Crawled - currently not indexed" is normal and expected.

Other Issue Types (from GSC Coverage)

IssueFix
Excluded by «noindex» tagRemove noindex if accidental; keep if intentional
Blocked by robots.txtSee robots-txt; remove Disallow for important paths
Redirect / 404Fix URL or add redirect
Duplicate / CanonicalSet correct canonical; usually OK
Soft-404Page returns 200 but content says "not found" or empty—Google may treat as 404. Fix: return 404 status for truly missing pages; or add real content for 200 pages

Soft-404

A soft-404 occurs when a page returns HTTP 200 but the content indicates the page doesn't exist (e.g. "Page not found" message, empty state). Google may treat it as 404 and exclude from index.

FixWhen
Return 404Page truly doesn't exist; use proper 404 status
Add contentPage is intentional (e.g. empty search results); ensure substantive content or use noindex
RedirectIf URL moved, use 301 to correct destination

Noindex Usage

  • How: metadata.robots = { index: false } or <meta name="robots" content="noindex"> or X-Robots-Tag
  • Rationale: Not all site content should be indexed; noindex is a valid choice for many pages
  • Caution: Avoid noindex on important content pages
  • With robots.txt: robots.txt = path-level crawl control; noindex = page-level index control. Do not block noindex pages in robots.txt—crawlers must access the page to read the directive. Use both: robots for /admin/, /api/; noindex for /login/, /thank-you/, etc. See robots-txt for when to use which.
  • nofollow ≠ noindex: nofollow controls link equity only; it does not prevent indexing. To exclude from search, use noindex. See page-metadata for meta robots implementation.

Page Types That Typically Need Noindex

CategoryPage TypesTypical MetaReason
Auth & AccountLogin, Signup, Password reset, Account dashboardLogin: noindex,nofollow; Signup: noindex,followNo search value; login indexed = security risk; signup follow allows crawl of Privacy/Terms links
Admin & PrivateAdmin, Staging, Test pages, Internal toolsnoindex,nofollowNot for public; avoid discovery
Conversion EndpointsThank-you, Confirmation, Checkout success, Download gatenoindex,followPost-conversion; no SERP value; allow link equity
System & Utility404, Internal search results, Faceted/filter URLsnoindex,follow or noindex,nofollowThin/duplicate; 404 = error state
LegalPrivacy, Terms, Cookie Policy (optional)Often noindex,followLow-value indexed; reduces clutter
Duplicate & ThinPrinter-friendly, Parameter URLs, Near-duplicatenoindex,follow or canonicalDuplicate content; canonical preferred when possible
Low-ValueMedia kit, Feedback board (external), Thin pressnoindex or index for brand queriesCase-by-case

noindex,follow vs noindex,nofollow: Use noindex,follow for most cases—excludes from SERP but allows link equity. Use noindex,nofollow only for login (security), staging, or temporary test pages.

Google Indexing API

TypeTypical use
JobPostingJob boards
BroadcastEventLive platforms

Requirements: Enable Indexing API, create service account, add owner in Search Console, request quota (default 200 URLs/day).

Output Format

Related Skills

  • google-search-console: Find and diagnose indexing issues in GSC
  • robots-txt: Path-level crawl control; when to use robots.txt vs noindex; do not block /_next/ or noindex pages
  • page-metadata: Meta robots implementation; noindex vs nofollow
  • xml-sitemap: Submit and maintain sitemap
  • indexnow: Faster indexing for Bing
  • canonical-tag: Resolve duplicate content

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

GigaChat (Sber AI) Proxy

Integrate GigaChat (Sber AI) with OpenClaw via gpt2giga proxy

Registry SourceRecently Updated
3600smvlx
General

TencentCloud Video Face Fusion

通过提取两张人脸核心特征并实现自然融合,支持多种风格适配,提升创意互动性和内容传播力,广泛应用于创意营销、娱乐互动和社交分享场景。

Registry SourceRecently Updated
General

TencentCloud Image Face Fusion

图片人脸融合(专业版)为同步接口,支持自定义美颜、人脸增强、牙齿增强、拉脸等参数,最高支持8K分辨率,有多个模型类型供选择。

Registry SourceRecently Updated
General

YoudaoNote News

有道云笔记资讯推送:基于收藏笔记分析关注话题,推送最新相关资讯。支持对话触发与每日定时推送(如早上9点)。触发词:资讯推送、设置资讯推送、生成资讯推送。

Registry SourceRecently Updated
1.5K1lephix