Add AI hybrid generation workflow note

2026-04-04 17:32:59 +07:00
parent 12ec26be5f
commit 08a1352268
1 changed files with 262 additions and 0 deletions
--- a/AI_HYBRID_GENERATION_WORKFLOW.md
+++ b/AI_HYBRID_GENERATION_WORKFLOW.md
@@ -0,0 +1,262 @@
+# AI Hybrid Generation Workflow
+
+## Goal
+
+Allow admins to generate either:
+
+- a single AI question
+- or multiple AI questions in one run
+
+without losing control over the quality of each generated item.
+
+The system should support both precision workflows and exploration workflows.
+
+## Core Principle
+
+Generation request and generated items must be treated as different things.
+
+That means:
+
+1. One admin action creates a **generation run**
+2. One generation run can produce one or many **generated variants**
+3. Each generated variant remains an individually reviewable item
+
+This is the cleanest way to support both single and bulk generation.
+
+## Why This Is Better
+
+Admins do not always have the same intent.
+
+### Precision mode
+
+The admin wants:
+
+- one strong output
+- high control
+- easy review
+
+This is best served by single generation.
+
+### Exploration mode
+
+The admin wants:
+
+- multiple candidates
+- idea exploration
+- later curation
+
+This is best served by bulk generation.
+
+A rigid one-size-fits-all generation flow is worse for both modes.
+
+## Recommended Model
+
+### Parent / Basis Question
+
+The canonical source or promoted basis item.
+
+### Generation Run
+
+Represents one AI request.
+
+Suggested fields:
+
+- parent question id
+- source question version id
+- target difficulty
+- requested count
+- model
+- prompt version
+- created by
+- created at
+- optional operator notes
+
+### Generated Variant
+
+Each output item from the generation run.
+
+Suggested fields:
+
+- generation run id
+- parent question id
+- source version id
+- difficulty
+- status
+- stem
+- options
+- answer
+- explanation
+- review notes
+- reviewer
+- reviewed at
+
+## Required Lifecycle
+
+Each generated item must be individually manageable.
+
+Suggested statuses:
+
+- `draft`
+- `approved`
+- `rejected`
+- `archived`
+- `stale`
+
+This is required even when a run generates many items at once.
+
+## UX Principle
+
+Do not treat bulk output as one indivisible package.
+
+Bulk generation should be:
+
+- one producer action
+- many independently reviewable outputs
+
+This means the admin can:
+
+- approve 2 items
+- reject 1 item
+- archive 1 item
+- regenerate only one item
+
+from the same generation run.
+
+## Recommended Admin UX
+
+Inside the parent question page:
+
+### Generation Form
+
+- target difficulty
+- model
+- count
+- optional notes or style instructions
+- generate button
+
+### Guidance Text
+
+The system should guide, not over-restrict.
+
+Recommended copy:
+
+- “You can generate one or many variants in one run.”
+- “Recommended: 1–3 variants per run for better consistency and easier review.”
+- “Larger runs may reduce cost per item but increase overlap, correlated mistakes, and review effort.”
+
+### Result View
+
+After generation, show each item separately with actions:
+
+- approve
+- reject
+- archive
+- edit
+- regenerate this item
+- compare with parent
+
+## Recommendation vs Restriction
+
+The product should not hard-limit normal admin workflow at very low counts like 2 or 3.
+
+Instead:
+
+- provide recommendation text in the UI
+- allow single and bulk generation
+- preserve admin control
+
+However, the backend should still apply a technical safety ceiling.
+
+Example:
+
+- no UX hard limit at 2 or 3
+- backend safety limit at something like 20 or 50
+
+This is not a workflow restriction. It is abuse and cost protection.
+
+## Recommended Count Guidance
+
+### 1 item
+
+Best for:
+
+- high quality
+- careful review
+- final production-ready generation
+
+### 2–3 items
+
+Best default.
+
+Good balance of:
+
+- cost
+- quality
+- review effort
+
+### 4–8 items
+
+Useful for exploration.
+
+Tradeoff:
+
+- more candidate variety
+- heavier review burden
+- higher chance of repeated structure and correlated errors
+
+### More than 8 items
+
+Should still be allowed if product policy permits, but treated as exploration mode.
+
+The UI should warn that:
+
+- review effort increases
+- quality consistency may drop
+- variants may become repetitive
+
+## Cost and Quality Insight
+
+Bulk generation can reduce cost per item because:
+
+- parent context is sent once
+- prompt overhead is amortized
+
+But quality risk increases because:
+
+- errors can repeat across all outputs in a run
+- structure can become too similar
+- weaker prompts can produce multiple low-quality siblings
+
+So the best product design is:
+
+- permit bulk
+- recommend lower counts
+- review outputs individually
+
+## Recommended Policy
+
+1. Allow both single and bulk generation
+2. Keep generated items reviewable one-by-one
+3. Store lineage through generation run metadata
+4. Provide UI recommendations instead of rigid low hard caps
+5. Enforce only a high backend safety cap
+6. Keep parent question as the operational center of the workflow
+
+## Product Direction
+
+The ideal flow is:
+
+1. Admin opens parent question
+2. Admin chooses difficulty, model, and count
+3. System creates one generation run
+4. System creates one or many generated child variants
+5. Admin reviews each child separately
+6. Admin approves, rejects, archives, or regenerates per item
+
+This gives:
+
+- low friction
+- high control
+- strong auditability
+- better quality governance
+- flexibility for different admin intents