Files

dwindown 97426d5ab1 first commit all files

2026-01-28 00:26:00 +07:00

22 KiB

Raw Blame History

WP Agentic Writer: Model Selection & Preset Packs

Executive Summary

This document defines 3 curated model packs for WP Agentic Writer on OpenRouter, optimized for different user budgets and quality requirements. Each pack includes models for 6 tasks: chat, clarity checking, planning, writing, refinement, and image generation.

Key principle: Users bring their own OpenRouter API key. Plugin ships with sensible presets so users just pick "Budget / Balanced / Premium"—no model switching needed unless they want to customize.

Model Recommendation Strategy
Preset Pack 1: Budget
Preset Pack 2: Balanced (Recommended)
Preset Pack 3: Premium
Cost Estimation Guide
When to Use Each Pack
Implementation Config

Model Recommendation Strategy

Task-by-Task Rationale

1. Chat (Discussion, Recommendation, Research)

What it does: Multi-turn conversation where user discusses topic, asks questions, researches ideas before committing to writing.

Quality metrics:

Context understanding
Iterative reasoning
Long-context support (for multi-message research threads)
Cost per token (since users may have many back-and-forth turns)

Recommendation by tier:

Tier	Model	Reason
Budget	DeepSeek V3.x	Strong reasoning, excellent value pricing (~$0.55/1M input tokens)
Balanced	Gemini 3 Flash Preview	Built for multi-turn agentic workflows, 1M context window, cheaper than Pro
Premium	Gemini 3 Flash Preview or GPT-5.2 Chat	Flash for cost savings on research; GPT-5.2 if user wants single OpenAI vendor

2. Clarity Check (Prompt QA + Quiz Generation)

What it does: Analyzes user's article topic/prompt for ambiguity, generates clarifying questions, suggests research gaps, and optionally generates self-assessment quiz.

Quality metrics:

Meta-reasoning (ability to critique its own instructions)
Quiz/checklist generation quality
Cost per query (typically short, one-off)

Recommendation by tier:

Tier	Model	Reason
Budget	DeepSeek V3.x	Good at structured reasoning, checklist generation; very cheap
Balanced	Gemini 3 Flash Preview	Excellent at prompt analysis and quiz generation; fast feedback loop
Premium	Claude Sonnet 4	Nuanced feedback; exceptional at Socratic question generation

3. Planning (Article Outline Generation)

What it does: Takes finalized topic + research notes → generates structured article outline (sections, subsections, key points) as JSON or markdown.

Quality metrics:

Structured output (JSON/markdown reliability)
Long-context input (researched notes, competitor articles, etc.)
Cost (ideally one-off, but might regenerate)
Speed (user should see outline quickly)

Recommendation by tier:

Tier	Model	Reason
Budget	Gemini 3 Flash Preview	1M context window, fast, cheap, excellent JSON output
Balanced	Gemini 3 Flash Preview	Same: primary "thinking" engine; doesn't need premium pricing
Premium	Gemini 3 Flash Preview	Same: planning quality doesn't scale with cost; still Flash is optimal

4. Writing (Article Draft Generation)

What it does: Transforms outline + research notes → full article draft (2–5k words), with proper tone, code examples if relevant, and flow.

Quality metrics:

Long-form coherence (2–5k words)
Tone consistency (match blog voice)
Code + explanation blending (if dev/tech topic)
Cost per article (this is the "heavy lift")

Recommendation by tier:

Tier	Model	Reason
Budget	Mistral Small	Fast, cheap (~$0.14/1M input); acceptable first drafts for dev blogs; editable output
Balanced	Claude Sonnet 3.5 or 4	Industry standard for long-form; strong at code blocks + prose blend; great value/quality ratio
Premium	GPT-5.2 or Claude Opus 4.5	Frontier models; superior narrative flow, voice consistency, subtle nuance across 2–5k words

5. Refinement (Paragraph/Section Edits)

What it does: User selects 1–3 paragraphs → asks AI to rewrite, expand, shorten, simplify, or adjust tone.

Quality metrics:

Precision editing (preserve surrounding context)
Tone control (match existing prose)
Cost efficiency (small rewrites should be cheap)

Recommendation by tier:

Tier	Model	Reason
Budget	DeepSeek V3.x	Cheap, capable at local edits; good enough for "shorten this" / "make beginner-friendly"
Balanced	Claude Sonnet 3.5 or 4	Same model as writing phase for consistency; strong at nuanced rewrites
Premium	GPT-5.2 or Claude Opus 4.5	Same frontier writer for final polish; maintains voice across refinements

6. Image Generation

What it does: Generates 1–4 hero or inline images per article based on outline + user direction.

Quality metrics:

Visual quality (coherence, aesthetic fit for blog)
Prompt adherence (matches user's description)
Cost per image (users may want multiple attempts)
Speed (not blocking)

Recommendation by tier:

Tier	Model	Reason
Budget	FLUX.2 [klein] 4B	Optimized for cost (~$0.014 USD/MP base); acceptable for blog illustrations
Balanced	Riverflow V2 Max or FLUX.2 Pro	Higher visual quality; flat ~$0.03–0.04 USD per image; good for professional blogs
Premium	FLUX.2 [max]	Frontier image quality; best prompt adherence; hero/marketing images (~$0.07 USD/MP base)

Preset Pack 1: Budget

Target User

Indie dev blogger or beginner content creator
Cost-sensitive; prioritizes shipping over perfection
Acceptable output: readable first drafts, simple blog images
Typical use: 2–3 articles/month

Complete Model Pack

Task	Model	Provider	Rationale
Chat	DeepSeek V3.x	OpenRouter	Powerful reasoning, 1/10th the cost of GPT-4.5
Clarity	DeepSeek V3.x	OpenRouter	Meta-reasoning for prompt analysis
Planning	Gemini 3 Flash Preview	OpenRouter	1M context, fast outlining, dirt cheap
Writing	Mistral Small	OpenRouter	Budget-friendly long-form; acceptable for drafts
Refinement	DeepSeek V3.x	OpenRouter	Cost-efficient edits; reuse for multiple refinements
Image	FLUX.2 [klein] 4B	OpenRouter (Black Forest Labs)	Optimized for cost; good enough for blog headers

Cost Breakdown (Per 2,500-word Article + 3 Images)

Text costs (tokens):

Typical usage:

Chat phase: ~3,000 input + 500 output tokens (discussion)
Clarity: ~1,000 input + 300 output tokens (prompt analysis)
Planning: ~2,000 input + 800 output tokens (outline)
Writing: ~4,000 input + 2,500 output tokens (draft generation)
Refinement: ~1,000 input + 400 output tokens (one round of edits)

Task	Input Tokens	Output Tokens	Cost (USD)
Chat (DeepSeek)	3,000	500	$0.0019
Clarity (DeepSeek)	1,000	300	$0.0007
Planning (Flash)	2,000	800	$0.0009
Writing (Mistral Small)	4,000	2,500	$0.0090
Refinement (DeepSeek)	1,000	400	$0.0008
Text subtotal			$0.0133

Image costs:

3 images × ~1 MP each via FLUX.2 klein
First MP per image: $0.014
Subsequent MP per image: $0.001
3 images × $0.014 ≈ $0.042

OpenRouter platform fee:

5.5% of total ≈ 5.5% × ($0.0133 + $0.042) ≈ $0.0045

Category	Cost (USD)
Text (all tasks)	$0.0133
Images (3 × 1 MP)	$0.0420
OpenRouter platform fee (5.5%)	$0.0045
Total/Article	$0.0598

💰 Budget pack = ~$0.06 USD/article (or ~$0.18–0.30 USD if user refines/regenerates once)

Preset Pack 2: Balanced (Recommended)

Target User

Active dev/content creator or small agency
Shipping 4–10 articles/month
Quality matters; willing to pay for strong writing
Acceptable output: polished, professional prose; nice images
Balance: cost-efficient without sacrificing quality

Complete Model Pack

Task	Model	Provider	Rationale
Chat	Gemini 3 Flash Preview	OpenRouter	Multi-turn agentic chat, 1M context, research-ready
Clarity	Gemini 3 Flash Preview	OpenRouter	Same engine; great at prompt analysis, quiz generation
Planning	Gemini 3 Flash Preview	OpenRouter	Primary "thinking" engine; excellent JSON outline generation
Writing	Claude Sonnet 3.5 or 4	OpenRouter	Industry standard long-form; strong code + prose blend
Refinement	Claude Sonnet 3.5 or 4	OpenRouter	Same as writing; consistency in rewrites and tone
Image	Riverflow V2 Max or FLUX.2 Pro	OpenRouter	High visual quality; flat ~$0.03–0.04 USD per image

Cost Breakdown (Per 2,500-word Article + 3 Images)

Text costs (tokens):

Task	Input Tokens	Output Tokens	Cost (USD)
Chat (Flash)	3,000	500	$0.0015
Clarity (Flash)	1,000	300	$0.0005
Planning (Flash)	2,000	800	$0.0008
Writing (Claude Sonnet)	4,000	2,500	$0.0300
Refinement (Claude Sonnet)	1,000	400	$0.0060
Text subtotal			$0.0388

Image costs:

3 images × Riverflow V2 Max (flat pricing ~$0.03 per image)
3 images × $0.03 ≈ $0.0900

OpenRouter platform fee:

5.5% × ($0.0388 + $0.0900) ≈ $0.0071

Category	Cost (USD)
Text (all tasks)	$0.0388
Images (3 × flat rate)	$0.0900
OpenRouter platform fee (5.5%)	$0.0071
Total/Article	$0.1359

💰 Balanced pack = ~$0.14 USD/article (or ~$0.25–0.35 USD with one round of refinement/regen)

Preset Pack 3: Premium

Target User

Content agencies or full-time creators
Publishing 15+ articles/month or selling content
Quality is non-negotiable (flagship posts, sales pages, thought leadership)
Acceptable output: publication-ready prose; hero images
Budget: cost is secondary to impact

Complete Model Pack

Task	Model	Provider	Rationale
Chat	Gemini 3 Flash Preview or GPT-5.2 Chat	OpenRouter	Flash for efficient research; GPT-5.2 if user wants single OpenAI vendor
Clarity	Claude Sonnet 4	OpenRouter	Exceptional at nuanced prompt feedback and Socratic questioning
Planning	Gemini 3 Flash Preview	OpenRouter	Long-context planner; cost doesn't improve quality for outlining
Writing	GPT-5.2 or Claude Opus 4.5	OpenRouter	Frontier long-form quality; superior narrative flow, voice, nuance
Refinement	GPT-5.2 or Claude Opus 4.5	OpenRouter	Same frontier writer; final editorial polish
Image	FLUX.2 [max]	OpenRouter (Black Forest Labs)	Top-tier image quality; best prompt following; hero/marketing grade

Cost Breakdown (Per 2,500-word Article + 3 Images)

Text costs (tokens):

Task	Input Tokens	Output Tokens	Cost (USD)
Chat (Flash)	3,000	500	$0.0015
Clarity (Sonnet 4)	1,000	300	$0.0015
Planning (Flash)	2,000	800	$0.0008
Writing (GPT-5.2 or Opus)	4,000	2,500	$0.0700
Refinement (GPT-5.2 or Opus)	1,000	400	$0.0140
Text subtotal			$0.0878

Image costs:

3 images × ~1 MP each via FLUX.2 [max]
First MP per image: $0.07
Subsequent MP per image: $0.03
3 images × $0.07 ≈ $0.2100

OpenRouter platform fee:

5.5% × ($0.0878 + $0.2100) ≈ $0.0164

Category	Cost (USD)
Text (all tasks)	$0.0878
Images (3 × max quality)	$0.2100
OpenRouter platform fee (5.5%)	$0.0164
Total/Article	$0.3142

💰 Premium pack = ~$0.31 USD/article (or ~$0.50–0.75 USD with multiple refinement passes or image regen)

Cost Estimation Guide

How Users Can Calculate Their Needs

Use this framework to estimate monthly costs:

Monthly AI Cost = (Cost/Article) × (Articles/Month) × (Regenerations Factor)

Step 1: Pick your tier and note cost/article

Budget: $0.06
Balanced: $0.14
Premium: $0.31

Step 2: Estimate articles per month

Blogger: 2–4 articles/month
Small content team: 5–10 articles/month
Agency: 15–30 articles/month

Step 3: Apply regeneration factor

First drafts only (happy with output): ×1.0
One round of refinement/regeneration: ×1.5–2.0
Heavy iteration (multiple regen cycles): ×2.5–3.0

Example calculations:

Profile	Tier	Articles/mo	Regens	Monthly Cost
Solo dev blogger	Budget	2	×1.5	$0.06 × 2 × 1.5 = $0.18
Content team	Balanced	8	×2.0	$0.14 × 8 × 2.0 = $2.24
Agency (flagship posts)	Premium	12	×2.5	$0.31 × 12 × 2.5 = $9.30

Important Notes

Token counts are estimates. Actual usage depends on:
- Outline complexity
- Number of research notes
- Image resolution and complexity
- Iteration/refinement cycles
OpenRouter base price + 5.5% platform fee is included in all estimates above.
No hidden costs. Users only pay for what they use. If they skip image generation or skip refinement, cost drops.
Image costs scale with resolution. If user requests higher resolution (>1 MP per image), multiply image cost accordingly.

When to Use Each Pack

Budget Pack: Best For

✅ Use when:

First time using AI writing; want to test the plugin
Publishing 1–3 articles/month
Topic: dev blogs, quick tutorials (where first drafts are acceptable)
Budget: <$5/month

❌ Not ideal for:

Sales pages or high-stakes content
Audiences expecting polish
Topics requiring heavy editing

Workflow expectation: User accepts 1–2 refinement cycles before publishing.

Balanced Pack: Best For (RECOMMENDED DEFAULT)

✅ Use when:

Regular blogging (4–10 articles/month)
Mixed content: tutorials, reviews, opinion pieces
Publishing to professional blog or portfolio
Budget: $5–20/month

✅ Default recommendation because:

Gemini Flash is the best pure planner on the market (cost doesn't improve planning)
Claude Sonnet is the industry-standard long-form writer
Cost:quality ratio is unbeatable
Users can hit "publish" with minimal editing

❌ Not ideal for:

One-off flagship posts (Premium is worth it)
Micro-budget users (use Budget instead)

Workflow expectation: User does one refinement cycle; publishes with high confidence.

Premium Pack: Best For

✅ Use when:

Publishing flagship posts, thought leadership, or sales content
Publishing 10+ articles/month (agency/professional creator)
Audiences/stakeholders expect flawless prose
Images need to be hero/standout quality
Budget: $20–100+/month

✅ Worth the cost because:

GPT-5.2 or Opus produce superior long-form narrative
Superior voice consistency across 2–5k words
FLUX.2 [max] images are publication-ready
Minimal editing required

❌ Overkill for:

Quick dev blogs or tutorials
Solo bloggers publishing <5/month

Workflow expectation: User does light editing (if any); publishes immediately.

Implementation Config

JSON Schema for Plugin Settings

Save presets as JSON config so users can swap or customize:

{
  "presets": {
    "budget": {
      "name": "Budget: DeepSeek + Flash + Mistral + FLUX.2 klein",
      "description": "Super affordable ($0.06/article). Great for testing or budget-conscious bloggers.",
      "models": {
        "chat": {
          "model": "deepseek-v3",
          "provider": "openrouter",
          "description": "DeepSeek V3: Fast, cheap, great reasoning"
        },
        "clarity": {
          "model": "deepseek-v3",
          "provider": "openrouter",
          "description": "DeepSeek V3: Meta-reasoning for prompt analysis"
        },
        "planning": {
          "model": "google/gemini-3-flash-preview",
          "provider": "openrouter",
          "description": "Gemini 3 Flash: 1M context, fast outlining"
        },
        "writing": {
          "model": "mistral/mistral-small",
          "provider": "openrouter",
          "description": "Mistral Small: Budget-friendly long-form"
        },
        "refinement": {
          "model": "deepseek-v3",
          "provider": "openrouter",
          "description": "DeepSeek V3: Cost-efficient paragraph edits"
        },
        "image": {
          "model": "black-forest-labs/flux.2-klein",
          "provider": "openrouter",
          "description": "FLUX.2 klein: Optimized for cost"
        }
      },
      "cost_per_article": {
        "text": 0.0133,
        "images": 0.0420,
        "platform_fee": 0.0045,
        "total": 0.0598,
        "currency": "USD"
      }
    },
    "balanced": {
      "name": "Balanced (RECOMMENDED): Gemini Flash + Claude Sonnet + Riverflow",
      "description": "Professional quality ($0.14/article). Default for most creators.",
      "models": {
        "chat": {
          "model": "google/gemini-3-flash-preview",
          "provider": "openrouter",
          "description": "Gemini 3 Flash: Multi-turn agentic chat, 1M context"
        },
        "clarity": {
          "model": "google/gemini-3-flash-preview",
          "provider": "openrouter",
          "description": "Gemini 3 Flash: Excellent at prompt analysis"
        },
        "planning": {
          "model": "google/gemini-3-flash-preview",
          "provider": "openrouter",
          "description": "Gemini 3 Flash: Primary thinking engine"
        },
        "writing": {
          "model": "anthropic/claude-3.5-sonnet",
          "provider": "openrouter",
          "description": "Claude Sonnet: Industry standard long-form"
        },
        "refinement": {
          "model": "anthropic/claude-3.5-sonnet",
          "provider": "openrouter",
          "description": "Claude Sonnet: Consistent rewrites"
        },
        "image": {
          "model": "sourceful/riverflow-v2-max",
          "provider": "openrouter",
          "description": "Riverflow V2 Max: High-quality images"
        }
      },
      "cost_per_article": {
        "text": 0.0388,
        "images": 0.0900,
        "platform_fee": 0.0071,
        "total": 0.1359,
        "currency": "USD"
      }
    },
    "premium": {
      "name": "Premium: GPT-5.2/Opus + Gemini Flash + FLUX.2 max",
      "description": "Flagship quality ($0.31/article). For agencies and thought leaders.",
      "models": {
        "chat": {
          "model": "google/gemini-3-flash-preview",
          "provider": "openrouter",
          "description": "Gemini 3 Flash: Efficient research"
        },
        "clarity": {
          "model": "anthropic/claude-sonnet-4",
          "provider": "openrouter",
          "description": "Claude Sonnet 4: Exceptional feedback"
        },
        "planning": {
          "model": "google/gemini-3-flash-preview",
          "provider": "openrouter",
          "description": "Gemini 3 Flash: Long-context planner"
        },
        "writing": {
          "model": "openai/gpt-5.2",
          "provider": "openrouter",
          "description": "GPT-5.2: Frontier long-form quality"
        },
        "refinement": {
          "model": "openai/gpt-5.2",
          "provider": "openrouter",
          "description": "GPT-5.2: Final editorial polish"
        },
        "image": {
          "model": "black-forest-labs/flux.2-max",
          "provider": "openrouter",
          "description": "FLUX.2 max: Hero-grade images"
        }
      },
      "cost_per_article": {
        "text": 0.0878,
        "images": 0.2100,
        "platform_fee": 0.0164,
        "total": 0.3142,
        "currency": "USD"
      }
    }
  }
}

How to Use in Plugin

Load preset on plugin activation:

$preset = get_option('agentic_writer_preset', 'balanced');
$presets = json_decode(file_get_contents(__DIR__ . '/model-presets.json'), true);
$active_models = $presets['presets'][$preset]['models'];

Route API calls based on preset:

switch ($task_type) {
    case 'chat':
        $model = $active_models['chat']['model'];
        break;
    case 'writing':
        $model = $active_models['writing']['model'];
        break;
    // ... etc
}

Display cost estimate in UI:

$cost = $presets['presets'][$preset]['cost_per_article']['total'];
echo "Estimated cost: ${$cost}/article";

Summary Table

Quick reference for all 3 packs:

Aspect	Budget	Balanced	Premium
Chat	DeepSeek	Gemini Flash	Gemini Flash
Clarity	DeepSeek	Gemini Flash	Claude Sonnet 4
Planning	Gemini Flash	Gemini Flash	Gemini Flash
Writing	Mistral Small	Claude Sonnet	GPT-5.2/Opus
Refinement	DeepSeek	Claude Sonnet	GPT-5.2/Opus
Image	FLUX.2 klein	Riverflow V2 Max	FLUX.2 max
Cost/Article	$0.06	$0.14	$0.31
Monthly (8 articles)	$0.48	$1.12	$2.48
Monthly (20 articles)	$1.20	$2.80	$6.20
Target User	Hobbyist/test	Active creator	Agency/Pro
Default?	❌	✅ RECOMMENDED	❌

Next Steps

Implement preset switcher in plugin settings – Let users pick Budget / Balanced / Premium
Add cost calculator to UI – Show estimated cost before user generates
Support preset customization – Allow power users to swap individual models
Track actual costs – Log usage and compare to estimates for billing transparency

Appendix: Model Slugs

These are the exact model identifiers for OpenRouter API calls (as of January 2026):

deepseek-v3
google/gemini-3-flash-preview
anthropic/claude-3.5-sonnet
anthropic/claude-sonnet-4
mistral/mistral-small
openai/gpt-5.2
black-forest-labs/flux.2-klein
black-forest-labs/flux.2-max
sourceful/riverflow-v2-max

Note: Model names and slugs may change slightly as providers update. Verify against OpenRouter Models before deploying.

Document version: 1.0
Date: January 22, 2026
Author: WP Agentic Writer Product Team
Status: Ready for Implementation

22 KiB Raw Blame History Unescape Escape

WP Agentic Writer: Model Selection & Preset Packs

Executive Summary

Table of Contents

Model Recommendation Strategy

Task-by-Task Rationale

1. Chat (Discussion, Recommendation, Research)

2. Clarity Check (Prompt QA + Quiz Generation)

3. Planning (Article Outline Generation)

4. Writing (Article Draft Generation)

5. Refinement (Paragraph/Section Edits)

6. Image Generation

Preset Pack 1: Budget

Target User

Complete Model Pack

Cost Breakdown (Per 2,500-word Article + 3 Images)

Preset Pack 2: Balanced (Recommended)

Target User

Complete Model Pack

Cost Breakdown (Per 2,500-word Article + 3 Images)

Preset Pack 3: Premium

Target User

Complete Model Pack

Cost Breakdown (Per 2,500-word Article + 3 Images)

Cost Estimation Guide

How Users Can Calculate Their Needs

Important Notes

When to Use Each Pack

Budget Pack: Best For

Balanced Pack: Best For (RECOMMENDED DEFAULT)

Premium Pack: Best For

Implementation Config

JSON Schema for Plugin Settings

How to Use in Plugin

Summary Table

Next Steps

Appendix: Model Slugs

22 KiB

Raw Blame History