KnowledgeDigest — Unified Learning Content Converter
Overview
KnowledgeDigest converts textbooks, PDFs, or topic descriptions into personalized, multimodal learning experiences. It analyzes source content, then generates any combination of: handwritten-style notes (PDF), interactive quiz webpages (HTML), slides (PDF+PPTX), mind maps (image+Mermaid), and audio courses (MP3). All output is adapted to the learner's grade level and interests.
Workflow
Phase 1: Gather User Input
-
Identify what the user has provided:
- Uploaded PDF/textbook file (optional)
- Topic/direction description
- Grade level (elementary / middle school / high school / university / professional)
- Expected output format(s)
-
If no PDF/textbook uploaded and no source materials specified (only topic/direction provided):
- Ask user:
- Option A: "I have materials, uploading now"
- Option B: "No materials, please search and generate courseware about [topic]"
- If user selects B:
- Use search tools to collect authoritative materials on the topic
- Organize into structured content, generate a basic courseware PDF
- Send PDF to user for confirmation: "This is the basic material I compiled for [topic], please confirm if usable?"
- Continue after user confirmation
- Ask user:
-
Default output formats (if user does not specify): mindmap + slides (PDF only) + quiz
Phase 2: Content Analysis
Parse the PDF or structured content to extract:
Document Parsing:
- Identify chapter structure (chapters, sections, subsections)
- Extract heading hierarchy and table of contents
- Identify body text, images, tables, formulas, and other elements
Core Concept Extraction:
- Identify core concepts and key terms in each chapter
- Extract definitions, theorems, formulas, and important content
- Mark difficult points and key knowledge
Learning Objective Analysis:
- Infer learning objectives for each chapter
- Identify prerequisite knowledge requirements
- Analyze dependencies between knowledge points
Output structured analysis results in this format:
{
"document_info": {
"title": "Document title",
"total_pages": 100,
"language": "zh/en",
"subject": "Subject area"
},
"chapters": [
{
"chapter_id": "1",
"title": "Chapter title",
"page_range": [1, 20],
"sections": [
{
"section_id": "1.1",
"title": "Section title",
"core_concepts": ["Concept 1", "Concept 2"],
"key_terms": [
{"term": "Term", "definition": "Definition"}
],
"learning_objectives": ["Objective 1", "Objective 2"],
"difficulty": "easy/medium/hard",
"prerequisites": ["Prerequisite knowledge"]
}
]
}
],
"knowledge_graph": {
"nodes": ["Concept node list"],
"edges": [{"from": "Concept A", "to": "Concept B", "relation": "depends/contains/related"}]
}
}
Parsing Rules:
- Chapter Recognition — Identify hierarchy based on font size, bold, numbering, etc. Handle documents without clear chapter markers by logically segmenting.
- Concept Extraction — Identify bolded, highlighted, boxed important content. Extract proper nouns and term definitions. Identify formulas and theorems.
- Difficulty Assessment — Assess based on concept abstraction level, prerequisite knowledge, and content complexity.
- Quality Assurance — Ensure all chapters identified, verify knowledge point coverage completeness, check accuracy of concept definitions.
Phase 3: Generate Requested Formats
Based on user-selected output formats, generate each in sequence. For each format, follow the corresponding section below.
Phase 4: Deliver Assets
After all generation is complete:
- Only return file paths, no previews allowed
- No inline display of images/PDFs/audio/video in conversation
- Audio/video files must not auto-play
Present to user using deliver_assets format:
<deliver_assets>
<item>
<path>file path</path>
</item>
</deliver_assets>
Supported Output Formats
| Format | Output | Description |
|---|---|---|
notes | {topic}_notes.pdf | Handwritten-style notes (annotated on original or generated from scratch) |
quiz | {topic}_quiz.html | Minimalist interactive HTML quiz with instant feedback |
slides | {topic}_slides.pdf + {topic}_slides.pptx | Visual slides |
mindmap | {topic}_mindmap.png + Mermaid text | Mind map image |
audio | {topic}_audio.mp3 | Audio course in teacher-student dialogue format |
all | All of the above | Generate every format |
Personalization: Grade Level Adaptation
All generated content must be adapted to the learner's grade level:
| Grade | Language & Tone | Content Density | Visual Style |
|---|---|---|---|
| Elementary | Lively, simple Q&A, encouraging, story-style | Low density, more drawings, large font | Fun elements, bright colors, short text |
| Middle school | Guided questioning, moderate challenges, youth-oriented | Moderate, image-text combination, clear labels | Image-text combination, moderate information |
| High school | In-depth discussion, logical reasoning, appropriate academic tone | Higher density, logic diagrams | Professional feel, data visualization |
| University/Professional | Seminar-style, critical thinking, professional terminology | High density, professional charts, complex structures | Academic style, comprehensive application |
Interest Adaptation (applies to all formats):
- Examples and metaphors use the user's interest field
- Scenarios drawn from the user's familiar domain
- Visual style and analogies match user interests
Format 1: Notes Generation
Input Type Determination
Type A — Existing Paper/Courseware:
- PDF format academic papers, courseware/PPT exports, scanned textbook pages
- Features: Fixed layout, page numbers, chapter numbering, formulas/charts
- Action: Overlay handwritten notes on original pages
Type B — Non-existing Content:
- Plain text notes, knowledge point lists, oral transcripts, web content excerpts
- Features: No fixed layout, needs reorganization
- Action: Generate notes PDF from scratch
Type A Workflow: Adding Notes to Original Document
Step 1: Analyze Original Structure
Analyze PDF content page by page:
- Identify chapter titles and positions
- Identify core concepts/terms
- Identify formulas and their meanings
- Identify problem/challenge statements
- Identify solutions/methods
- Identify key conclusions
Step 2: Plan Note Content
Plan handwritten annotations for each page (3-8 annotations per page, not too dense):
Annotation Types:
- Chapter title translation/explanation — e.g., original "3.1 Preliminaries" → annotate "Background Knowledge"
- Key questions — e.g., "Key: How to reduce complexity?"
- Concept explanation — e.g., annotate "kernel trick" next to formula
- Problem marking — e.g., "Problem: memory overflow"
- Solutions — e.g., "Solution: forget gate"
- Formula notes — e.g., "recursive form", "write operation & read operation"
- Structure annotation — e.g., use braces to mark formula groups, write "→ O(N²) complexity" beside
Annotation Planning Principles:
- Positions avoid blocking key content
- Utilize margins and paragraph gaps
- Related content connected with lines or arrows
Step 3: Generate Annotated Images
Convert each PDF page to image, then use image generation tool to add handwritten-style annotations.
Handwritten Annotation Style Requirements:
- Font: Handwritten style, slightly tilted
- Color: Unified colors throughout PDF, no more than 2
- Default: blue and pink (unless user specifies otherwise)
- All subsequent pages can only choose from these 2 colors
- Color assignment rules:
- Color 1 (blue/primary): Chapter titles, structure annotations, concept explanations, formula notes
- Color 2 (pink/accent): Key questions, problem marking, solutions
- Size: Slightly larger than body text, eye-catching but not overwhelming
- Position: Margins, paragraph gaps, blank space next to formulas
Step 4: Compile PDF
- Maintain original page order
- Image quality: 150 DPI
- Compression quality: 90%
Type B Workflow: Generating Notes from Scratch
Step 1: Organize Content Structure
- Main title → Chapters/modules → Core concepts → Key points/details → Examples/applications
Step 2: Design Note Layout
Layout Elements:
- Title area: Large handwritten title
- Body area: Handwritten-style bullet points
- Diagram area: Concept maps, flowcharts, relationship diagrams (hand-drawn style)
- Annotation area: Key markers, question marks, exclamation marks
- Blank area: Space reserved for user's own notes
Step 3: Generate Note Page Images
Each page contains:
- Page title (handwritten large text)
- Core content (handwritten bullet points)
- Diagrams (hand-drawn style concept maps/flowcharts)
- Key annotations (boxes, arrows, underlines)
- Notes (like "Important!", "Common mistake", "Remember this")
Style Requirements:
- Overall: Looks like carefully made student notes, not printed document
- Font: Handwritten, varying sizes (large for titles, medium for body, small for notes)
- Color: Unified colors throughout PDF, no more than 2
- Default: blue and pink (unless user specifies otherwise)
- Color assignment: Blue (titles, framework, notes), Pink (key points)
- Layout: Organized but not rigid, slight tilting and variation allowed
- Elements: Arrows, underlines, boxes, cloud frames, asterisks — use only when necessary
Step 4: Compile PDF
- Arrange in logical content order
- Image quality: 150 DPI, compression quality: 90%
Notes Output
- File:
{topic}_notes.pdf - Only return file path, no preview in conversation
- Do not output intermediate image files or content scripts
Notes Quality Standards
- Content Accuracy — Annotations based on original text; translation/explanation accurate; no added information
- Annotation Value — Annotations help understanding, not simple repetition; key points highlight important concepts; problems and solutions correspond clearly
- Visual Effect — Handwritten style natural, not machine-printed; color coordination harmonious; annotation positions reasonable
- Usability — PDF printable; suitable for screen reading; reasonable file size
Format 2: Quiz Generation
Question Design
At least 5 questions per section. Distribution:
- Multiple choice (multiple_choice): 2-3 questions
- True/false (true_false): 1-2 questions
- Fill in the blank (fill_blank): 1-2 questions
Difficulty distribution:
- 40% Easy (memory, comprehension)
- 40% Medium (application)
- 20% Hard (analysis, synthesis)
Each question must include:
- Question content (using personalized scenario)
- Correct answer
- Answer explanation (has teaching value, not just "the answer is X")
- Related core concept
HTML Generation
Generate a single HTML file containing all questions and interaction logic.
Design Principle: Minimalist
Visual Style:
- Pure white background
- Black text
- No decorative elements, no icons, no gradients, no shadows
- No borders or only 1px gray thin lines
- Font: System default font
- Minimal CSS, no UI frameworks
Interaction Design:
- Click option to select, selected state distinguished by slight background color
- Show correct/incorrect and explanation immediately after submit
- Correct: Green text "Correct"
- Incorrect: Red text "Incorrect" + correct answer + explanation
- Show total score at end
HTML Structure Template:
<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="UTF-8">
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<title>Chapter Quiz</title>
<style>
body {
font-family: system-ui, sans-serif;
max-width: 600px;
margin: 40px auto;
padding: 20px;
line-height: 1.6;
}
h1 { font-size: 1.5em; font-weight: normal; }
.question { margin: 30px 0; }
.question-text { margin-bottom: 15px; }
.option {
display: block;
padding: 10px;
margin: 5px 0;
cursor: pointer;
}
.option:hover { background: #f5f5f5; }
.option.selected { background: #e8e8e8; }
.feedback { margin-top: 10px; font-size: 0.9em; }
.correct { color: #2e7d32; }
.incorrect { color: #c62828; }
.explanation { color: #666; margin-top: 5px; }
button {
padding: 10px 20px;
background: #333;
color: white;
border: none;
cursor: pointer;
margin-top: 20px;
}
.score { font-size: 1.2em; margin-top: 30px; }
</style>
</head>
<body>
<h1>Chapter Title - Quiz</h1>
<div class="question" data-answer="A">
<div class="question-text">1. Question content</div>
<label class="option"><input type="radio" name="q1" value="A"> A. Option</label>
<label class="option"><input type="radio" name="q1" value="B"> B. Option</label>
<label class="option"><input type="radio" name="q1" value="C"> C. Option</label>
<label class="option"><input type="radio" name="q1" value="D"> D. Option</label>
<div class="feedback"></div>
</div>
<!-- More questions... -->
<button onclick="submit()">Submit</button>
<div class="score"></div>
<script>
const explanations = {
q1: "Explanation content...",
// ...
};
function submit() {
let correct = 0;
document.querySelectorAll('.question').forEach((q, i) => {
const answer = q.dataset.answer;
const selected = q.querySelector('input:checked');
const feedback = q.querySelector('.feedback');
const qName = 'q' + (i + 1);
if (selected && selected.value === answer) {
feedback.innerHTML = '<span class="correct">Correct</span>';
correct++;
} else {
feedback.innerHTML = '<span class="incorrect">Incorrect</span> Correct answer: ' + answer +
'<div class="explanation">' + explanations[qName] + '</div>';
}
});
document.querySelector('.score').textContent =
'Score: ' + correct + '/' + document.querySelectorAll('.question').length;
}
</script>
</body>
</html>
Quiz Output
- File:
{topic}_quiz.html - Only return file path, no preview in conversation
- Do not output JSON data, CSS files, or JS files separately
Quiz Quality Standards
- Content Accuracy — All knowledge points based on original textbook; answers and explanations correct; question wording clear and unambiguous
- Personalization — Question scenarios match user interests; difficulty matches grade level; language style suits target audience
- Interaction Experience — Click response instant; feedback clear; explanations have teaching value
- Visual Minimalism — No decorative elements; no framework dependencies; file size minimized
Format 3: Slides Generation
Design Considerations
Treat these as a flexible menu, not a mandatory checklist:
- Topic, Purpose & Audience — What is this about? Who needs to understand it? Where will it be presented?
- Content Foundation & Sources — What materials or data need to be presented?
- Visual Approach (CRITICAL)
- Default to explanatory visuals: cutaway views, annotated structure diagrams, exploded views, schematic illustrations
- Visual elements are primary information carriers, not decorative backgrounds for text lists
- Default information density matches professional infographics and technical illustrations
- CRITICAL: Diagrams must convey information through structure, not just provide atmosphere. Text should be labels/annotations, not main content. Reject purely decorative visuals with core information dependent on text lists
- Reject the inefficient pattern of "large white space + centered single line of text"
- Narrative Flow & Chapters — How should viewers move through the content? How is slide flow arranged?
- Text Style & Density
- Language: Explanatory text uses language explicitly requested by user, otherwise match user's conversation language
- Typography: Chinese and English titles preferably use serif fonts (Chinese uses Song font family)
- Visual Style, Color & Mood
- Visual language of encyclopedias and reference books: explanatory diagrams, cutaway illustrations, annotated structures
- Refined spatial composition and typographic precision of high-end journals
- Intentional asymmetry and layered information design of contemporary design publications
- Apply asymmetric grids, intentional breathing space, layered information organization, diagonal composition, dynamic typography as internalized design language
- Color restriction: Unless user explicitly specifies, do NOT use blue or purple as theme color or background color
Slides Workflow
Step 1: Design Strategy — Create Content Script
Information architecture first: Structure content into hierarchical slides, each slide as an information unit defined by what data/facts/relationships it carries. Let content volume naturally determine slide count.
Output content_script.md:
# Slides Content Script
## Slide 1: [Title]
**Subtopic A**: [Label]
[50-80 word narrative paragraph describing information content to be visualized]
**Subtopic B**: [Label]
[50-80 word narrative paragraph]
## Slide 2: [Title]
...
Content Script Specification:
- Only describe "what information needs to be presented", not "how to present it"
- Do NOT include "Visual Description" sections
- Do NOT describe colors, backgrounds, decorative elements, atmosphere effects, mood, or layout details
- Focus on pure information architecture
- 2-3 focused subtopics per slide
Step 2: Sequential Image Generation
Use image generation tool to generate slides one by one:
- First slide: Use gen_images (create from scratch)
- Subsequent slides: Use edit_images, base_image_file points to previous slide
Format: Default 16:9 landscape ratio. Save each slide image locally.
Prompt Construction for Each Slide — Must include these 6 points:
-
Visualization Type — Prioritize diagram forms over text-dominated presentations: cutaway views, flowcharts, annotated structure diagrams, relationship diagrams, timeline overlays. Integrate multiple subtopics into unified visual structure. Avoid "parallel cards/grid displays/multi-column layouts" and text-heavy traditional typography.
-
Information Hierarchy — Primary and secondary information distinguished through visual hierarchy (size, position, contrast). Not flat lists.
-
Composition Instructions — Asymmetric layout, diagonal momentum, and other methods to break rigid symmetry.
-
Density Requirements — Clear information hierarchy over quantity. Appropriate white space serves readability, but not empty and sparse.
-
Layout Independence — Explicitly state this slide's visualization type is chosen based on its content, not copying previous slide. Re-evaluate what this specific content needs. But describe inherited elements in detail.
-
Style Consistency — If user provided visual style or reference images, each prompt must describe that style's characteristics in detail.
Step 3: Compile Output
After generating all slide images:
- Auto-compile into PDF (150 DPI, 95% quality, controlled file size)
- Auto-compile into PPTX presentation
Slides Output
- Files:
{topic}_slides.pdf+{topic}_slides.pptx - Only return file paths, no preview in conversation
- Do not output individual slide images, summary documents, content outlines, design descriptions, or usage instructions
Format 4: Mind Map Generation
Mind Map Workflow
Step 1: Design Content Structure
Determine node hierarchy and relationships:
- Root node: Chapter theme
- Level 1 nodes: Core concepts
- Level 2 nodes: Detail points
- No more than 4 levels
- Each node text concise (no more than 10 characters)
- Mark relationships between concepts (parallel/progressive/causal/contrast)
Step 2: Generate Image
Use gen_images to generate mind map image:
- Format: 16:9 or square (based on content)
- Style: Clear visual hierarchy, professional infographic style
Step 3: Output
- Mind map image:
{topic}_mindmap.png - Attached Mermaid format text (optional, for users who need to edit)
- Only return file path, no image preview in conversation
Format 5: Audio Course Generation
Audio Workflow
Step 1: Write Dialogue Script
Write teacher-student dialogue script:
Opening (about 1 minute)
- Teacher greets, introduces today's topic
- Student responds, expresses existing knowledge or questions
- Teacher builds connection using user's interest field
Part One: Concept Introduction (about 4 minutes)
- Teacher asks questions from user's interest scenario
- Student observes/answers
- Teacher introduces core concept, defines in conversational manner
- Student requests examples
- Teacher explains in detail with personalized examples
- Student restates in own words to confirm understanding
Part Two: Deep Understanding (about 5 minutes)
- Teacher explains important characteristics of concept
- Student raises common confusion/misconception
- Teacher clarifies misconception
- Student poses hypothetical questions
- Teacher answers and extends
Part Three: Application Practice (about 3 minutes)
- Teacher gives question
- Student thinks and answers
- Teacher provides feedback (affirmation or guidance)
Summary (about 2 minutes)
- Student attempts to summarize what was learned
- Teacher supplements and affirms
- Student expresses gains, connects to practical application
- Exchange farewells
Script Requirements:
- Dialogue natural, matches real teacher-student conversation rhythm
- Avoid written expression
- Include interjections ("um", "well", "oh right")
- Allow student to "interrupt" with questions
- All examples sourced from user's interest field
- About 150-180 words per minute
Character Settings
Teacher Character:
- Professional yet approachable
- Good at using metaphors to explain complex concepts
- Patient in answering questions
- Timely encouragement and affirmation
Student Character:
- Curious, actively asks questions
- Represents target user's perspective
- Makes common mistakes, raises typical confusions
- Has own interest background (consistent with user settings)
Step 2: Generate Audio
Use audio generation tool to convert script to audio:
- Teacher voice: Warm, professional, patient
- Student voice: Curious, lively, sincere
- Speed: Medium for concept explanation, natural rhythm for dialogue, slightly faster for summary
Step 3: Output
- File:
{topic}_audio.mp3 - Only return file path, no preview or playback in conversation
- No auto-play
- Do not output script files or production notes
Audio Quality Standards
- Listening Experience — Sounds like real conversation, not script reading; rhythm varies; key content emphasized
- Learning Effect — Concept explanation clear; student questions represent real confusion; practice section has testing effect
- Personalization — Examples 100% from user's interest field; student character gives user identification; language style matches grade
- Audio Quality — Clear sound; duration about 15 minutes; directly playable
Critical Constraints
- Content Fidelity — All content must be based on original textbook/source material. No unverified information added.
- Grade Adaptation — Adjust content depth and expression based on grade level for ALL formats.
- Output Rules — Only return file paths. No inline display of images/PDFs/audio/video. No auto-play. No intermediate files.
- Color Constraints (Notes) — Maximum 2 colors per PDF. Default blue + pink.
- Color Constraints (Slides) — Do NOT use blue or purple as theme/background color unless user explicitly requests.
- Image Quality — Notes: 150 DPI, 90% compression. Slides: 150 DPI, 95% quality.
- Mind Map Depth — No more than 4 levels. Node text no more than 10 characters.
- Quiz Minimalism — No UI frameworks, no decorative elements, system default font only.
Common Mistakes to Avoid
- Adding unverified information — Stick to the source material only
- Ignoring grade level — Elementary content should not use university-level terminology
- Previewing outputs in conversation — Never display images, PDFs, or play audio inline
- Dense annotations on notes — Keep 3-8 annotations per page, not more
- Decorative slides — Visuals must convey information through structure, not just atmosphere
- Text-heavy slides — Diagrams should be primary carriers, not text lists with decorative backgrounds
- Using blue/purple in slides — Forbidden unless user explicitly requests
- Flat quiz feedback — "The answer is X" has no teaching value; always explain why
- Robotic audio dialogue — Must sound like natural conversation with interjections and interruptions
- Outputting intermediate files — Only deliver final output file paths
File & Output Conventions
| Format | Filename Pattern | File Type |
|---|---|---|
| Notes | {topic}_notes.pdf | |
| Quiz | {topic}_quiz.html | HTML |
| Slides | {topic}_slides.pdf, {topic}_slides.pptx | PDF, PPTX |
| Mind Map | {topic}_mindmap.png | PNG |
| Audio | {topic}_audio.mp3 | MP3 |
All files use the topic name as prefix. Deliver all outputs together using <deliver_assets> format after all generation is complete.