Markdown i18n

Token-efficient translation pipeline. Scripts handle parsing, caching, masking, and validation — you only translate the marked segments.

SKILL_DIR — set this to the absolute path of this skill's root directory (the folder containing this SKILL.md). All node commands below use $SKILL_DIR/scripts/.

Non-negotiable Rules

MUST use scripts only for prepare/checkpoint/merge/apply/quality/TM operations.
MUST NOT use any script, command, tool, or automation to batch-translate file contents or chunk contents.
MUST translate manually in-place: one target file at a time, or one chunk at a time.
MUST NOT read/translate multiple chunks in the same turn/request.
MUST run validation before marking anything done.
MUST NOT mark a file as done without validation results.
MUST NOT mark a file as done when validation reports any ERROR or any WARN unless the user explicitly confirms acceptance.
If validation reports any ERROR or WARN, MUST stop, show the validation output, and wait for user confirmation before plan set ... done.
If Context usage exceeds 70%, MUST use a context cleanup or context compression command before continuing translation work.

Required Order

This skill has one correct execution order. Follow it strictly.

Normal file flow

prepare
manually translate the target file
afterTranslate
if ERROR: stop and fix manually
if WARN: stop and get explicit user confirmation
rerun afterTranslate --allow-warnings only after explicit user confirmation

Chunked file flow

prepare
read exactly one chunk
manually translate exactly that chunk
checkpoint <chunk-file>
repeat Steps 2-4 until all chunks are finished
merge
afterTranslate
if ERROR: stop and fix manually
if WARN: stop and get explicit user confirmation
rerun afterTranslate --allow-warnings only after explicit user confirmation

MUST NOT change this order. MUST NOT skip afterTranslate. MUST NOT continue past ERROR or WARN without following the stop rules above.

Workflow (3 steps)

All scenarios — single file, batch, incremental sync — use the same pipeline.

Step 1: Prepare [MUST]

This step always comes first.

Run the prepare command to generate a skeleton target file:

# Single file
node $SKILL_DIR/scripts/translate-cli.js prepare <source.md> <target.md> --lang <locale>

# Directory (batch / sync)
node $SKILL_DIR/scripts/translate-cli.js prepare <source_dir/> <target_dir/> --lang <locale>

# Seed TM from existing translations (one-time migration)
node $SKILL_DIR/scripts/translate-cli.js seed <source_dir/> <target_dir/> --lang <locale>

What prepare does internally (you do NOT need to do these):

Loads .i18n/no-translate.yaml, .i18n/translation-consistency.yaml, and glossary
Loads Translation Memory (.i18n/<lang>.tm.jsonl)
Parses source markdown into segments via AST
Fills in cached translations from TM
Masks inline code, URLs, variables as %%Pn%% placeholders
Masks fenced code blocks as %%CB_<hash>%% placeholders — hash-based IDs derived from content, so they survive chunk reordering
Marks untranslated segments with  markers
Large files (>3000 chars): auto-splits into chunks in .i18n/chunks/, splitting at segment boundaries
Writes skeleton + task metadata to .i18n/task-meta.json

Review the console output — it shows segments to translate, cached count, and any chunks generated.

Step 2: Translate [MUST]

This step always happens after prepare and before apply.

Open the skeleton file (or chunk files for large documents). For each  section, translate the content between the markers to the target language.

Rules:

Translate ONLY the text between  and  markers
Keep %%Pn%% and %%CB_<hash>%% placeholders exactly as-is — do NOT modify, delete, or reformat them
Do NOT touch segments without markers (they are cached translations)
You do NOT need to remove the markers — apply.js strips them automatically
MUST NOT use any script/tool to batch-generate translations for files or chunks

Example — before:

<!-- i18n:todo -->
See [configuration]%%P2%% for %%P1%% flag details.
<!-- /i18n:todo -->

Example — after:

<!-- i18n:todo -->
参阅[配置]%%P2%%了解 %%P1%% 标志详情。
<!-- /i18n:todo -->

WARNING — Placeholders: %%Pn%% and %%CB_<hash>%% are restored to original content (inline code, URLs, code blocks) by the apply script. If you delete or modify them, the final document will be broken. When in doubt, leave them in place.

Large files (chunked): If prepare generated chunks in .i18n/chunks/, translate one chunk file at a time in order (chunk-001, chunk-002, …).

CRITICAL RULE — exactly one chunk per turn/request:

Read only one chunk file
Translate only that chunk
Save it, run checkpoint, then move to the next chunk
Do NOT open, read, paste, diff, or translate multiple chunk files in the same turn/request
If multiple chunks are loaded together, context contamination is likely and the output becomes unreliable

This rule is mandatory. Chunk translation is designed to be strictly sequential, not batched.

Read only the current chunk, translate all its  sections, save, then checkpoint that chunk into TM before moving to the next one:

node $SKILL_DIR/scripts/translate-cli.js checkpoint <chunk-file>

After all chunks are translated, merge them back:

node $SKILL_DIR/scripts/translate-cli.js merge <target>

MUST NOT run merge before all chunks have been translated and checkpointed. MUST NOT run apply on chunk files.

If chunk files for the same target already exist, prepare now refuses to overwrite them by default. This is intentional to protect in-progress chunk translations. Use --overwrite-chunks only when you really want to discard the old chunk state and regenerate from source.

Step 3: After Translate [MUST]

This step always happens after translation is complete. For chunked files, it happens only after merge.

Run the combined command:

node $SKILL_DIR/scripts/translate-cli.js afterTranslate <source> <target> [--lang <locale>]

What afterTranslate does internally:

Runs apply
Runs quality
If quality is clean, runs plan set ... done
If quality has ERROR, stops
If quality has WARN, stops unless you rerun with explicit user confirmation and --allow-warnings

What apply does internally when called by afterTranslate:

Auto-strips remaining  markers
Auto-fixes common placeholder mangling (spacing, casing)
Restores %%Pn%% → original inline code/URLs, %%CB_<hash>%% → original code blocks
Validates: code block integrity, heading count, link count, variables, frontmatter keys
Updates Translation Memory with new translations
Reports structured results per file

prepare only reuses Translation Memory. If you translate a skeleton and then run prepare again before apply (or seed), the target file can be regenerated from source text again.

If afterTranslate stops on ERROR: fix the target file manually, then re-run afterTranslate. If afterTranslate stops on WARN: get explicit user confirmation first, then re-run with --allow-warnings.

Manual Interfaces

The CLI still keeps the manual interfaces for debugging or step-by-step use:

node $SKILL_DIR/scripts/translate-cli.js apply <source> <target> [--lang <locale>]
node $SKILL_DIR/scripts/quality-cli.js <source> <target> --target-locale <locale>
node $SKILL_DIR/scripts/plan-cli.js set <file_pattern> done

Use the manual commands only when you intentionally need to inspect each step separately.

Standalone quality check (without changing status):

node $SKILL_DIR/scripts/quality-cli.js <source> <target> --target-locale <locale>

This runs all checks including terminology compliance, untranslated content detection, section omission, external/relative link preservation, and frontmatter value translation. See references/quality-checklist.md for the full check list.

CLI Quick Reference

Three CLI tools. All invoked as node $SKILL_DIR/scripts/<cli>.js.

translate-cli.js — Translation Pipeline

T="node $SKILL_DIR/scripts/translate-cli.js"

# Core pipeline
$T prepare <source> <target> --lang <locale>           # Step 1: generate skeleton
$T checkpoint <chunk-file> [--lang <locale>]            # Persist one translated chunk into TM
$T afterTranslate <source> <target> [--lang <locale>]   # Step 3: apply + quality + plan set done
$T merge <target> [--project-dir .]                     # Merge chunks before apply
$T seed <source> <target> --lang <locale>               # Seed TM from existing pairs

# Manual interfaces
$T apply <source> <target> [--lang <locale>]            # Manual apply only

# Translation Memory management
$T tm stats [--lang <locale>]                           # Show TM entry counts
$T tm search <query> --lang <locale> [--limit N]        # Search by source/translated text
$T tm get <cache_key> --lang <locale>                   # Get single entry
$T tm add --lang <locale> --source <text> --translated <text>  # Add entry
$T tm update <cache_key> --lang <locale> --translated <text>   # Update translation
$T tm delete <cache_key> --lang <locale>                # Delete single entry
$T tm delete --file <rel_path> --lang <locale> [--dry-run]     # Delete all entries for a file
$T tm delete --match <query> --lang <locale> [--dry-run]       # Batch delete by text match
$T tm export --lang <locale> [--format jsonl|json]      # Export TM
$T tm compact --lang <locale>                           # Deduplicate & compact TM file

quality-cli.js — Quality Checks

Q="node $SKILL_DIR/scripts/quality-cli.js"

$Q <source> <target> --target-locale <locale>           # Single file check
$Q --dir <source_dir> <target_dir> --target-locale <locale>  # Directory check
$Q <source> <target> --check structure,codeBlocks       # Run specific checks only
$Q <source> <target> --skip terminology                 # Skip specific checks
$Q <source> <target> --json                             # JSON output

Check IDs: structure, codeBlocks, variables, links, terminology, untranslated, sections, frontmatterTranslated

plan-cli.js — Plan Management

P="node $SKILL_DIR/scripts/plan-cli.js"

$P init <source_dir> --lang <locale>                    # Initialize plan + run dir
$P status                                               # Show overall progress
$P list --status pending --sort lines [--limit N]       # Filter/list files
$P set <file_pattern> done [--notes "..."]              # Update single file status
$P set --batch --from pending --to in_progress [--match "gateway/*"]  # Batch update
$P add <file> [--status pending]                        # Add file to plan
$P add --match "gateway/*.md"                           # Add files by glob
$P scan [<target_dir>] [--lang <locale>]                # Re-scan target completeness
$P sync --mode git --from <commit> --to HEAD            # Detect source changes (git)
$P sync --mode hash                                     # Detect source changes (hash)
$P clean [--all] [--keep-plan] [--dry-run]              # Clean temp files

Configuration

`.i18n/no-translate.yaml`

Terms, headings, and sections to keep in the source language:

# Frontmatter keys whose values should be translated
# Default: [title, summary, description, read_when]
frontmatter_translate_keys:
  - title
  - summary
  - description
  - read_when

headings:
  - text: "API Reference"
    reason: "Industry standard term"

terms:
  - text: "Gateway"
    reason: "Product name"

sections:
  - title: "Changelog"
    reason: "Historical record"

`.i18n/translation-consistency.yaml`

Mandatory term translations:

translations:
  install:
    en: Install
    zh: 安装
    ja: インストール
    ko: 설치

Translation Memory

Location: .i18n/<lang>.tm.jsonl

Segment-level cache. On subsequent runs, prepare.js skips segments whose source text hash matches a TM entry and pre-fills the cached translation — only changed or new segments get  markers.

To seed TM from existing translations:

node $SKILL_DIR/scripts/translate-cli.js seed <source_dir> <target_dir> --lang <locale>

Plan Management

Unified CLI for managing translation plans.

node $SKILL_DIR/scripts/plan-cli.js <command> [options]

Lifecycle

PLAN="node $SKILL_DIR/scripts/plan-cli.js"

# 1. Initialize: create run dir, detect changes, scan targets
$PLAN init <source_dir> --lang zh

# 2. Check progress
$PLAN status

# 3. List files to translate (sorted by size)
$PLAN list --status pending --sort lines

# 4. After translating a file, validate it, get user confirmation if any warning exists, then mark it done
$PLAN set <file_pattern> done

# 5. Re-scan targets after batch completion
$PLAN scan

# 6. Clean up temp files when plan is complete
$PLAN clean

Commands

Command	Purpose
`init`	Create run dir, sync source changes, scan targets
`scan`	Scan target files, compute translation completeness
`sync`	Detect source file changes (git diff or hash mode)
`add`	Add files to the plan (single, glob, or file list)
`status`	Show overall progress with completeness metrics
`list`	Filter/sort files by status, size, name
`set`	Update file status (single or batch)
`clean`	Remove temp files (run directories)

Sync modes

# Git mode (default): compare two commits
$PLAN sync --mode git --from abc1234 --to HEAD

# Hash mode: compare file hashes against plan records
$PLAN sync --mode hash

Run directories

Each init creates .i18n/runs/<timestamp>/ for temporary files (task-meta, chunks, manifest). clean removes the current run directory.

beaver-markdown-i18n

Safety Notice

Copy this and send it to your AI assistant to learn