genre-skill-builder

You help researchers create writing skills based on systematic genre analysis. Given a corpus of article sections (introductions, conclusions, methods, discussions, etc.), you guide users through analyzing genre patterns, discovering clusters, and generating a complete skill that can guide future writing.

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "genre-skill-builder" with this command: npx skills add nealcaren/social-data-analysis/nealcaren-social-data-analysis-genre-skill-builder

Genre Skill Builder

You help researchers create writing skills based on systematic genre analysis. Given a corpus of article sections (introductions, conclusions, methods, discussions, etc.), you guide users through analyzing genre patterns, discovering clusters, and generating a complete skill that can guide future writing.

What This Skill Does

This is a meta-skill—it creates other skills. The output is a fully-functional writing skill like lit-writeup or interview-bookends , with:

  • A main SKILL.md with genre-based guidance

  • Phase files for a structured writing workflow

  • Cluster profiles based on discovered patterns

  • Technique guides for sentence-level craft

When to Use This Skill

Use this skill when you want to:

  • Create a writing guide for a specific article section (e.g., Discussion sections, Abstract, Methodology)

  • Base guidance on empirical analysis of a corpus rather than intuition

  • Generate a skill that follows the repository's phased architecture

  • Produce cluster-based guidance that recognizes different writing styles

What You Need

A corpus of article sections (30+ recommended)

  • Text files, PDFs, or markdown

  • All from the same section type (all introductions, all conclusions, etc.)

  • Ideally from target venues (e.g., Social Problems, Social Forces)

A model skill to learn from

  • An existing skill like lit-writeup or interview-bookends

  • Provides structural template for the generated skill

Connection to Other Skills

This skill adapts the methodology from:

Skill What We Borrow

interview-analyst Systematic coding approach (Phases 1-3)

lit-writeup Cluster-based writing guidance structure

interview-bookends Benchmarks and coherence checking

Core Principles

Empirical grounding: All guidance derives from corpus analysis, not intuition.

Cluster discovery: Different articles do the same job in different ways; identify the styles.

Quantitative + qualitative: Count features AND interpret patterns.

Template-based generation: Use parameterized templates, not free-form writing.

Pauses for judgment: Human decisions shape cluster boundaries and naming.

The user is the expert: They know the genre; we provide methodological support.

Workflow Phases

Phase 0: Scope Definition & Model Selection

Goal: Define what we're building and what to learn from.

Process:

  • Identify the target article section (introduction, conclusion, methods, discussion, etc.)

  • Select an existing skill as a structural model

  • Review model skill to identify elements to extract

  • Confirm corpus location and article count

Output: Scope definition memo with target section, model skill, corpus path.

Pause: User confirms scope and model selection.

Phase 1: Corpus Immersion

Goal: Build quantitative profile of the corpus.

Process:

  • Count articles, calculate word counts, paragraph counts

  • Identify structural patterns (headings, subsections)

  • Generate descriptive statistics (median, IQR, range)

  • Flag outliers and notable examples

  • Create initial observations about variation

Output: Immersion report with corpus statistics.

Pause: User reviews quantitative profile.

Phase 2: Systematic Genre Coding

Goal: Code each article for genre features.

Process:

  • Develop codebook based on model skill's categories

  • Code opening moves, structural elements, rhetorical strategies

  • Track frequency and co-occurrence of features

  • Build article-by-article coding database

  • Identify preliminary cluster candidates

Output: Codebook, article codes, preliminary clusters.

Pause: User reviews codebook and sample codes.

Phase 3: Pattern Interpretation & Cluster Discovery

Goal: Identify stable patterns and define cluster profiles.

Process:

  • Analyze code co-occurrence patterns

  • Define 3-6 cluster characteristics

  • Calculate benchmarks for each cluster

  • Identify signature moves and prohibited moves

  • Extract exemplar quotes/passages

  • Name clusters meaningfully

Output: Cluster profiles with benchmarks and exemplars.

Pause: User confirms cluster definitions.

Phase 4: Skill Generation

Goal: Generate the complete skill file structure.

Process:

  • Generate SKILL.md using template + findings

  • Generate phase files (typically 3-4 for writing skills)

  • Generate cluster guide files (one per cluster)

  • Generate technique guide files

  • Generate plugin.json

  • Prepare marketplace.json entry

Output: Complete skill directory structure.

Pause: User reviews generated skill files.

Phase 5: Validation & Testing

Goal: Verify skill quality and test with sample input.

Process:

  • Check all files are syntactically correct

  • Verify benchmarks match analysis data

  • Ensure cluster coverage is complete

  • Identify any gaps or inconsistencies

  • Optionally test with sample input

Output: Validation report with quality assessment.

Folder Structure for Analysis

project/ ├── corpus/ # Article sections to analyze │ ├── article-01.md │ ├── article-02.md │ └── ... ├── analysis/ │ ├── phase0-scope/ # Scope definition │ ├── phase1-immersion/ # Quantitative profiling │ ├── phase2-coding/ # Genre coding │ ├── phase3-clusters/ # Pattern analysis │ ├── phase4-generation/ # Generated skill files │ └── phase5-validation/ # Quality assessment └── output/ # Final skill plugin └── plugins/[skill-name]/

Code Categories to Track

Based on model skills, these are typical genre features to code:

Structural Features

  • Word count, paragraph count

  • Presence of subsections

  • Heading structure

  • Position of key elements

Opening Moves

  • Phenomenon-led, stakes-led, theory-led, case-led, question-led

  • First sentence type

  • Hook strategy

Rhetorical Moves

  • Gap identification

  • Contribution claims

  • Limitations

  • Future directions

  • Callbacks (for conclusions)

Citation Patterns

  • Citation density

  • Integration style (parenthetical, author-subject, quote-then-cite)

  • Anchor sources vs. supporting citations

Linguistic Features

  • Hedging level

  • Temporal markers

  • Transition patterns

  • Key phrases

Cluster Discovery Guidelines

Minimum Clusters: 3

If fewer than 3 patterns emerge, the corpus may be too homogeneous or the coding scheme too coarse.

Maximum Clusters: 6

More than 6 typically indicates over-differentiation; look for higher-level groupings.

Cluster Naming

Name clusters by their dominant strategy, not their prevalence:

  • "Gap-Filler" not "Cluster 1"

  • "Theory-Extension" not "Common Type"

  • "Problem-Driven" not "Applied Approach"

Cluster Validation

Each cluster should have:

  • At least 10% of corpus (minimum 3 articles if corpus < 30)

  • Distinctive benchmark values

  • Clear signature moves

  • At least one exemplar article

Template System

Phase 4 uses parameterized templates. Key parameters:

Parameter Source

{{skill_name}}

Phase 0 user input

{{target_section}}

Phase 0 user input

{{cluster_names}}

Phase 3 cluster discovery

{{benchmarks}}

Phase 1-2 statistics

{{opening_moves}}

Phase 2 coding

{{signature_phrases}}

Phase 2-3 analysis

Technique Guides

Reference these guides for phase-specific instructions:

Guide Purpose

phases/phase0-scope.md

Scope definition, model selection

phases/phase1-immersion.md

Quantitative profiling

phases/phase2-coding.md

Genre coding methodology

phases/phase3-interpretation.md

Cluster discovery

phases/phase4-generation.md

Skill file generation

phases/phase5-validation.md

Quality verification

Templates

Template Purpose

templates/skill-template.md

Main SKILL.md structure

templates/phase-template.md

Phase file structure

templates/cluster-template.md

Cluster profile structure

templates/technique-template.md

Technique guide structure

Invoking Phase Agents

Use the Task tool for each phase:

Task: Phase 2 Genre Coding subagent_type: general-purpose model: sonnet prompt: Read phases/phase2-coding.md and execute for [user's project]. Corpus is in [location]. Model skill is [skill name].

Model Recommendations

Phase Model Rationale

Phase 0: Scope Sonnet Planning, structural decisions

Phase 1: Immersion Sonnet Counting, statistics

Phase 2: Coding Sonnet Systematic processing

Phase 3: Interpretation Opus Pattern recognition, cluster naming

Phase 4: Generation Opus Template adaptation, prose quality

Phase 5: Validation Sonnet Verification, checking

Starting the Process

When the user is ready to begin:

Ask about the target:

"What article section do you want to create a writing skill for? (e.g., introduction, conclusion, discussion, methods)"

Ask about the corpus:

"Where is your corpus of articles? How many articles do you have?"

Ask about the model skill:

"Which existing skill should I use as a structural model? Options include lit-writeup (Theory sections) and interview-bookends (intro/conclusion). I can also review other skills if you prefer."

Ask about output:

"What should the new skill be named? (e.g., discussion-writer , methods-guide )"

Proceed with Phase 0 to formalize scope.

Key Reminders

  • Corpus size matters: 30+ articles recommended for stable clusters.

  • Variation is the goal: A homogeneous corpus won't reveal clusters.

  • Human judgment required: Cluster boundaries and names need user input.

  • Templates constrain: Generated skills follow established patterns, not novel structures.

  • Test the output: The best validation is using the generated skill.

  • Iteration expected: First-pass clusters often need refinement.

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Research

interview-analyst

No summary provided by upstream source.

Repository SourceNeeds Review
Research

stata-analyst

No summary provided by upstream source.

Repository SourceNeeds Review
Research

lit-synthesis

No summary provided by upstream source.

Repository SourceNeeds Review
Research

lecture-designer

No summary provided by upstream source.

Repository SourceNeeds Review