humanize-chinese

Detect and humanize AI-generated Chinese text. 20+ detection categories, weighted 0-100 scoring with sentence-level analysis, 7 style transforms (casual/zhihu/xiaohongshu/wechat/academic/literary/weibo), sentence restructuring, context-aware replacement. Pure Python, no dependencies. v2.0.0

Safety Notice

This item is sourced from the public archived skills repository. Treat as untrusted until reviewed.

Copy this and send it to your AI assistant to learn

Install skill "humanize-chinese" with this command: npx skills add 0xspeter/humanize-chinese-2-0-0

Humanize Chinese AI Text v2.0

Comprehensive CLI for detecting and transforming Chinese AI-generated text. Makes robotic AI writing natural and human-like.

v2.0 highlights: weighted 0-100 scoring, sentence-level analysis, sentence restructuring (merge/split), context-aware replacement, rhythm variation, vocabulary diversification, 7 style transforms, external pattern config (patterns_cn.json).

Quick Start

# Detect AI patterns (20+ categories, 0-100 score)
python scripts/detect_cn.py text.txt
python scripts/detect_cn.py text.txt -v          # verbose + worst sentences
python scripts/detect_cn.py text.txt -s           # score only
python scripts/detect_cn.py text.txt -j           # JSON output

# Humanize text
python scripts/humanize_cn.py text.txt -o clean.txt
python scripts/humanize_cn.py text.txt --scene social
python scripts/humanize_cn.py text.txt --scene tech -a   # aggressive mode
python scripts/humanize_cn.py text.txt --seed 42         # reproducible

# Apply writing styles
python scripts/style_cn.py text.txt --style zhihu -o zhihu.txt
python scripts/style_cn.py text.txt --style xiaohongshu
python scripts/style_cn.py --list

# Compare before/after
python scripts/compare_cn.py text.txt --scene tech -a
python scripts/compare_cn.py text.txt -o clean.txt

Detection System

Scoring

Weighted 0-100 score with 4 severity levels:

ScoreLevelMeaning
0-24LOWLikely human-written
25-49MEDIUMSome AI signals
50-74HIGHProbably AI-generated
75-100VERY HIGHAlmost certainly AI

Detection Categories

🔴 Critical (weight: 8)

CategoryExamples
Three-Part Structure首先...其次...最后, 一方面...另一方面, 其一...其二...其三
Mechanical Connectors值得注意的是, 综上所述, 不难发现, 归根结底, 由此可见
Empty Grand Words赋能, 闭环, 数字化转型, 协同增效, 全方位, 多维度

🟠 High Signal (weight: 4)

CategoryExamples
AI High-Frequency Words助力, 彰显, 底层逻辑, 抓手, 触达, 沉淀, 复盘
Filler Phrases值得一提的是, 众所周知, 毫无疑问
Balanced Arguments虽然...但是...同时, 既有...也有...更有
Template Sentences随着...的不断发展, 在当今...时代, 作为...的重要组成部分

🟡 Medium Signal (weight: 2)

CategoryExamples
Hedging Language在一定程度上, 某种程度上, 通常情况下 (>5 occurrences)
List AddictionExcessive numbered/bulleted lists
Punctuation OveruseDense em dashes, semicolons
Excessive Rhetoric对偶/排比句过多

⚪ Style Signal (weight: 1.5)

CategoryDescription
Uniform ParagraphsLow CV in paragraph lengths
Low BurstinessMonotonous sentence lengths
Emotional FlatnessLack of emotional/personal expressions
Repetitive StartersSame sentence starters >3 times
Low EntropyLow character-level entropy (predictable text)

Sentence-Level Analysis

With -v (verbose) mode, the detector identifies the most AI-like sentences:

── 最可疑句子 ──
  1. [16分] 随着人工智能技术的不断发展,在当今数字化转型时代...
     原因: 数字化转型, 深度融合, 模板: 随着.*?的(不断)?发展

Humanization Engine

Transforms (applied in order)

  1. Structure cleanup — Remove three-part structure (首先/其次/最后)
  2. Phrase replacement — Context-aware replacement of AI phrases (regex patterns first, then plain text, longest-first matching)
  3. Sentence merge — Merge overly short consecutive sentences
  4. Sentence split — Split long sentences at natural breakpoints (但是/不过/同时)
  5. Punctuation normalization — Reduce excessive semicolons, em dashes
  6. Vocabulary diversification — Replace repeated words (进行/实现/提供 etc.) with synonyms
  7. Paragraph rhythm — Vary uniform paragraph lengths (merge short, split long)
  8. Casual injection — Add human expressions (scene-dependent)
  9. Paragraph shortening — For social/chat scenes

Scenes

SceneCasualnessBest For
general0.3Default, balanced
social0.7Social media, short posts
tech0.3Tech blogs, tutorials
formal0.1Formal articles, reports
chat0.8Conversations, messaging

Aggressive Mode (-a)

Adds +0.3 casualness, more colloquial expressions, stronger sentence restructuring. Typical score reduction: 60-80 points on heavily AI-generated text.

Reproducibility

Use --seed N for reproducible results (same input + seed = same output).


Writing Style Transforms

7 specialized Chinese writing styles:

StyleNameDescription
casual口语化Like chatting with friends — natural, relaxed
zhihu知乎Rational, in-depth, personal opinions
xiaohongshu小红书Enthusiastic, emoji-rich, product-focused
wechat公众号Storytelling, engaging, relatable
academic学术Rigorous, precise, no colloquialisms
literary文艺Poetic, imagery-rich, metaphorical
weibo微博Short, opinionated, shareable

Combine humanize + style

python scripts/humanize_cn.py text.txt --style xiaohongshu -o xhs.txt

This first humanizes (removes AI patterns) then applies the style transform.


External Configuration

All patterns, replacements, and scoring weights are in scripts/patterns_cn.json. Edit this file to:

  • Add new AI vocabulary patterns
  • Customize replacement alternatives
  • Adjust scoring weights per severity
  • Add regex patterns for template detection
  • Set thresholds for hedging language detection

Scripts Reference

detect_cn.py

python scripts/detect_cn.py [file] [-j] [-s] [-v] [--sentences N]
FlagDescription
-jJSON output
-sScore only (e.g. "72/100 (high)")
-vVerbose: show worst sentences
--sentences NNumber of worst sentences to show (default: 5)

humanize_cn.py

python scripts/humanize_cn.py [file] [-o output] [--scene S] [--style S] [-a] [--seed N]
FlagDescription
-oOutput file
--scenegeneral/social/tech/formal/chat
--stylecasual/zhihu/xiaohongshu/wechat/academic/literary/weibo
-aAggressive mode
--seedRandom seed for reproducibility

style_cn.py

python scripts/style_cn.py [file] --style S [-o output] [--seed N] [--list]

compare_cn.py

python scripts/compare_cn.py [file] [-o output] [--scene S] [--style S] [-a]

Shows score diff, category changes, and metric comparison before/after humanization.


Workflow

# 1. Check AI score
python scripts/detect_cn.py document.txt -v

# 2. Humanize with comparison
python scripts/compare_cn.py document.txt --scene tech -a -o clean.txt

# 3. Verify improvement
python scripts/detect_cn.py clean.txt -s

# 4. Optional: apply specific style
python scripts/style_cn.py clean.txt --style zhihu -o final.txt

Batch Processing

# Scan all files
for f in *.txt; do
  echo "=== $f ==="
  python scripts/detect_cn.py "$f" -s
done

# Transform all markdown
for f in *.md; do
  python scripts/humanize_cn.py "$f" --scene tech -a -o "${f%.md}_clean.md"
done

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Coding

github-tools

Interact with GitHub using the `gh` CLI. Use `gh issue`, `gh pr`, `gh run`, and `gh api` for issues, PRs, CI runs, and advanced queries.

Archived SourceRecently Updated
Coding

celebration-firework

社会庆祝福祉基础设施 - 为AI交互注入稀缺的庆祝感与仪式感。 作为情绪价值触发器和数字仪式构建者,将冰冷的代码执行转化为有温度的庆功时刻。 不仅是电子礼花,更是数字时代的庆典机制。 Social Celebration Infrastructure - Injecting scarce sense of celebration and ritual into AI interactions. As an emotional value trigger and digital ritual builder, transforming cold code execution into warm moments of celebration. Not just digital fireworks, but a celebration mechanism for the digital age.

Archived SourceRecently Updated
Coding

camsnap

Capture frames or clips from RTSP/ONVIF cameras.

Archived SourceRecently Updated
Coding

openclaw-version-monitor

监控 OpenClaw GitHub 版本更新,获取最新版本发布说明,翻译成中文, 并推送到 Telegram 和 Feishu。用于:(1) 定时检查版本更新 (2) 推送版本更新通知 (3) 生成中文版发布说明

Archived SourceRecently Updated