humanize-chinese

Detect and humanize AI-generated Chinese text. 20+ detection categories, weighted 0-100 scoring with sentence-level analysis, 7 style transforms (casual/zhihu/xiaohongshu/wechat/academic/literary/weibo), sentence restructuring, context-aware replacement. Pure Python, no dependencies. v2.0.0

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "humanize-chinese" with this command: npx skills add 0xspeter/humanize-chinese-2-0-0

Humanize Chinese AI Text v2.0

Comprehensive CLI for detecting and transforming Chinese AI-generated text. Makes robotic AI writing natural and human-like.

v2.0 highlights: weighted 0-100 scoring, sentence-level analysis, sentence restructuring (merge/split), context-aware replacement, rhythm variation, vocabulary diversification, 7 style transforms, external pattern config (patterns_cn.json).

Quick Start

# Detect AI patterns (20+ categories, 0-100 score)
python scripts/detect_cn.py text.txt
python scripts/detect_cn.py text.txt -v          # verbose + worst sentences
python scripts/detect_cn.py text.txt -s           # score only
python scripts/detect_cn.py text.txt -j           # JSON output

# Humanize text
python scripts/humanize_cn.py text.txt -o clean.txt
python scripts/humanize_cn.py text.txt --scene social
python scripts/humanize_cn.py text.txt --scene tech -a   # aggressive mode
python scripts/humanize_cn.py text.txt --seed 42         # reproducible

# Apply writing styles
python scripts/style_cn.py text.txt --style zhihu -o zhihu.txt
python scripts/style_cn.py text.txt --style xiaohongshu
python scripts/style_cn.py --list

# Compare before/after
python scripts/compare_cn.py text.txt --scene tech -a
python scripts/compare_cn.py text.txt -o clean.txt

Detection System

Scoring

Weighted 0-100 score with 4 severity levels:

ScoreLevelMeaning
0-24LOWLikely human-written
25-49MEDIUMSome AI signals
50-74HIGHProbably AI-generated
75-100VERY HIGHAlmost certainly AI

Detection Categories

🔴 Critical (weight: 8)

CategoryExamples
Three-Part Structure首先...其次...最后, 一方面...另一方面, 其一...其二...其三
Mechanical Connectors值得注意的是, 综上所述, 不难发现, 归根结底, 由此可见
Empty Grand Words赋能, 闭环, 数字化转型, 协同增效, 全方位, 多维度

🟠 High Signal (weight: 4)

CategoryExamples
AI High-Frequency Words助力, 彰显, 底层逻辑, 抓手, 触达, 沉淀, 复盘
Filler Phrases值得一提的是, 众所周知, 毫无疑问
Balanced Arguments虽然...但是...同时, 既有...也有...更有
Template Sentences随着...的不断发展, 在当今...时代, 作为...的重要组成部分

🟡 Medium Signal (weight: 2)

CategoryExamples
Hedging Language在一定程度上, 某种程度上, 通常情况下 (>5 occurrences)
List AddictionExcessive numbered/bulleted lists
Punctuation OveruseDense em dashes, semicolons
Excessive Rhetoric对偶/排比句过多

⚪ Style Signal (weight: 1.5)

CategoryDescription
Uniform ParagraphsLow CV in paragraph lengths
Low BurstinessMonotonous sentence lengths
Emotional FlatnessLack of emotional/personal expressions
Repetitive StartersSame sentence starters >3 times
Low EntropyLow character-level entropy (predictable text)

Sentence-Level Analysis

With -v (verbose) mode, the detector identifies the most AI-like sentences:

── 最可疑句子 ──
  1. [16分] 随着人工智能技术的不断发展,在当今数字化转型时代...
     原因: 数字化转型, 深度融合, 模板: 随着.*?的(不断)?发展

Humanization Engine

Transforms (applied in order)

  1. Structure cleanup — Remove three-part structure (首先/其次/最后)
  2. Phrase replacement — Context-aware replacement of AI phrases (regex patterns first, then plain text, longest-first matching)
  3. Sentence merge — Merge overly short consecutive sentences
  4. Sentence split — Split long sentences at natural breakpoints (但是/不过/同时)
  5. Punctuation normalization — Reduce excessive semicolons, em dashes
  6. Vocabulary diversification — Replace repeated words (进行/实现/提供 etc.) with synonyms
  7. Paragraph rhythm — Vary uniform paragraph lengths (merge short, split long)
  8. Casual injection — Add human expressions (scene-dependent)
  9. Paragraph shortening — For social/chat scenes

Scenes

SceneCasualnessBest For
general0.3Default, balanced
social0.7Social media, short posts
tech0.3Tech blogs, tutorials
formal0.1Formal articles, reports
chat0.8Conversations, messaging

Aggressive Mode (-a)

Adds +0.3 casualness, more colloquial expressions, stronger sentence restructuring. Typical score reduction: 60-80 points on heavily AI-generated text.

Reproducibility

Use --seed N for reproducible results (same input + seed = same output).


Writing Style Transforms

7 specialized Chinese writing styles:

StyleNameDescription
casual口语化Like chatting with friends — natural, relaxed
zhihu知乎Rational, in-depth, personal opinions
xiaohongshu小红书Enthusiastic, emoji-rich, product-focused
wechat公众号Storytelling, engaging, relatable
academic学术Rigorous, precise, no colloquialisms
literary文艺Poetic, imagery-rich, metaphorical
weibo微博Short, opinionated, shareable

Combine humanize + style

python scripts/humanize_cn.py text.txt --style xiaohongshu -o xhs.txt

This first humanizes (removes AI patterns) then applies the style transform.


External Configuration

All patterns, replacements, and scoring weights are in scripts/patterns_cn.json. Edit this file to:

  • Add new AI vocabulary patterns
  • Customize replacement alternatives
  • Adjust scoring weights per severity
  • Add regex patterns for template detection
  • Set thresholds for hedging language detection

Scripts Reference

detect_cn.py

python scripts/detect_cn.py [file] [-j] [-s] [-v] [--sentences N]
FlagDescription
-jJSON output
-sScore only (e.g. "72/100 (high)")
-vVerbose: show worst sentences
--sentences NNumber of worst sentences to show (default: 5)

humanize_cn.py

python scripts/humanize_cn.py [file] [-o output] [--scene S] [--style S] [-a] [--seed N]
FlagDescription
-oOutput file
--scenegeneral/social/tech/formal/chat
--stylecasual/zhihu/xiaohongshu/wechat/academic/literary/weibo
-aAggressive mode
--seedRandom seed for reproducibility

style_cn.py

python scripts/style_cn.py [file] --style S [-o output] [--seed N] [--list]

compare_cn.py

python scripts/compare_cn.py [file] [-o output] [--scene S] [--style S] [-a]

Shows score diff, category changes, and metric comparison before/after humanization.


Workflow

# 1. Check AI score
python scripts/detect_cn.py document.txt -v

# 2. Humanize with comparison
python scripts/compare_cn.py document.txt --scene tech -a -o clean.txt

# 3. Verify improvement
python scripts/detect_cn.py clean.txt -s

# 4. Optional: apply specific style
python scripts/style_cn.py clean.txt --style zhihu -o final.txt

Batch Processing

# Scan all files
for f in *.txt; do
  echo "=== $f ==="
  python scripts/detect_cn.py "$f" -s
done

# Transform all markdown
for f in *.md; do
  python scripts/humanize_cn.py "$f" --scene tech -a -o "${f%.md}_clean.md"
done

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Research

Fireflies.ai

Fireflies.ai GraphQL API integration with managed OAuth. Access meeting transcripts, summaries, users, contacts, and AI-powered meeting analysis. Use this sk...

Registry SourceRecently Updated
2K3Profile unavailable
Research

Gemini Citation

Conduct evidence-based research with exact, accurate APA citations using the Gemini API's 'scientific citation' (Google Search grounding) feature. Use when X...

Registry SourceRecently Updated
2430Profile unavailable
Research

安全驾驶行为分析工具

Analyzes videos of vehicle drivers to identify unsafe driving behaviors. It generates professional analysis reports to help enhance road safety awareness. |...

Registry SourceRecently Updated
940Profile unavailable
Research

deep-research-surf

Conducts deep, multi-angle research using Surf MCP tools and parallel subagents. Use for deep research, competitive landscape analysis, strategic intelligenc...

Registry SourceRecently Updated
710Profile unavailable