Files
wp-agentic-writer/model-preset-brief.md
2026-01-28 00:26:00 +07:00

22 KiB
Raw Permalink Blame History

WP Agentic Writer: Model Selection & Preset Packs

Executive Summary

This document defines 3 curated model packs for WP Agentic Writer on OpenRouter, optimized for different user budgets and quality requirements. Each pack includes models for 6 tasks: chat, clarity checking, planning, writing, refinement, and image generation.

Key principle: Users bring their own OpenRouter API key. Plugin ships with sensible presets so users just pick "Budget / Balanced / Premium"—no model switching needed unless they want to customize.


Table of Contents

  1. Model Recommendation Strategy
  2. Preset Pack 1: Budget
  3. Preset Pack 2: Balanced (Recommended)
  4. Preset Pack 3: Premium
  5. Cost Estimation Guide
  6. When to Use Each Pack
  7. Implementation Config

Model Recommendation Strategy

Task-by-Task Rationale

1. Chat (Discussion, Recommendation, Research)

What it does: Multi-turn conversation where user discusses topic, asks questions, researches ideas before committing to writing.

Quality metrics:

  • Context understanding
  • Iterative reasoning
  • Long-context support (for multi-message research threads)
  • Cost per token (since users may have many back-and-forth turns)

Recommendation by tier:

Tier Model Reason
Budget DeepSeek V3.x Strong reasoning, excellent value pricing (~$0.55/1M input tokens)
Balanced Gemini 3 Flash Preview Built for multi-turn agentic workflows, 1M context window, cheaper than Pro
Premium Gemini 3 Flash Preview or GPT-5.2 Chat Flash for cost savings on research; GPT-5.2 if user wants single OpenAI vendor

2. Clarity Check (Prompt QA + Quiz Generation)

What it does: Analyzes user's article topic/prompt for ambiguity, generates clarifying questions, suggests research gaps, and optionally generates self-assessment quiz.

Quality metrics:

  • Meta-reasoning (ability to critique its own instructions)
  • Quiz/checklist generation quality
  • Cost per query (typically short, one-off)

Recommendation by tier:

Tier Model Reason
Budget DeepSeek V3.x Good at structured reasoning, checklist generation; very cheap
Balanced Gemini 3 Flash Preview Excellent at prompt analysis and quiz generation; fast feedback loop
Premium Claude Sonnet 4 Nuanced feedback; exceptional at Socratic question generation

3. Planning (Article Outline Generation)

What it does: Takes finalized topic + research notes → generates structured article outline (sections, subsections, key points) as JSON or markdown.

Quality metrics:

  • Structured output (JSON/markdown reliability)
  • Long-context input (researched notes, competitor articles, etc.)
  • Cost (ideally one-off, but might regenerate)
  • Speed (user should see outline quickly)

Recommendation by tier:

Tier Model Reason
Budget Gemini 3 Flash Preview 1M context window, fast, cheap, excellent JSON output
Balanced Gemini 3 Flash Preview Same: primary "thinking" engine; doesn't need premium pricing
Premium Gemini 3 Flash Preview Same: planning quality doesn't scale with cost; still Flash is optimal

4. Writing (Article Draft Generation)

What it does: Transforms outline + research notes → full article draft (25k words), with proper tone, code examples if relevant, and flow.

Quality metrics:

  • Long-form coherence (25k words)
  • Tone consistency (match blog voice)
  • Code + explanation blending (if dev/tech topic)
  • Cost per article (this is the "heavy lift")

Recommendation by tier:

Tier Model Reason
Budget Mistral Small Fast, cheap (~$0.14/1M input); acceptable first drafts for dev blogs; editable output
Balanced Claude Sonnet 3.5 or 4 Industry standard for long-form; strong at code blocks + prose blend; great value/quality ratio
Premium GPT-5.2 or Claude Opus 4.5 Frontier models; superior narrative flow, voice consistency, subtle nuance across 25k words

5. Refinement (Paragraph/Section Edits)

What it does: User selects 13 paragraphs → asks AI to rewrite, expand, shorten, simplify, or adjust tone.

Quality metrics:

  • Precision editing (preserve surrounding context)
  • Tone control (match existing prose)
  • Cost efficiency (small rewrites should be cheap)

Recommendation by tier:

Tier Model Reason
Budget DeepSeek V3.x Cheap, capable at local edits; good enough for "shorten this" / "make beginner-friendly"
Balanced Claude Sonnet 3.5 or 4 Same model as writing phase for consistency; strong at nuanced rewrites
Premium GPT-5.2 or Claude Opus 4.5 Same frontier writer for final polish; maintains voice across refinements

6. Image Generation

What it does: Generates 14 hero or inline images per article based on outline + user direction.

Quality metrics:

  • Visual quality (coherence, aesthetic fit for blog)
  • Prompt adherence (matches user's description)
  • Cost per image (users may want multiple attempts)
  • Speed (not blocking)

Recommendation by tier:

Tier Model Reason
Budget FLUX.2 [klein] 4B Optimized for cost (~$0.014 USD/MP base); acceptable for blog illustrations
Balanced Riverflow V2 Max or FLUX.2 Pro Higher visual quality; flat ~$0.030.04 USD per image; good for professional blogs
Premium FLUX.2 [max] Frontier image quality; best prompt adherence; hero/marketing images (~$0.07 USD/MP base)

Preset Pack 1: Budget

Target User

  • Indie dev blogger or beginner content creator
  • Cost-sensitive; prioritizes shipping over perfection
  • Acceptable output: readable first drafts, simple blog images
  • Typical use: 23 articles/month

Complete Model Pack

Task Model Provider Rationale
Chat DeepSeek V3.x OpenRouter Powerful reasoning, 1/10th the cost of GPT-4.5
Clarity DeepSeek V3.x OpenRouter Meta-reasoning for prompt analysis
Planning Gemini 3 Flash Preview OpenRouter 1M context, fast outlining, dirt cheap
Writing Mistral Small OpenRouter Budget-friendly long-form; acceptable for drafts
Refinement DeepSeek V3.x OpenRouter Cost-efficient edits; reuse for multiple refinements
Image FLUX.2 [klein] 4B OpenRouter (Black Forest Labs) Optimized for cost; good enough for blog headers

Cost Breakdown (Per 2,500-word Article + 3 Images)

Text costs (tokens):

Typical usage:

  • Chat phase: ~3,000 input + 500 output tokens (discussion)
  • Clarity: ~1,000 input + 300 output tokens (prompt analysis)
  • Planning: ~2,000 input + 800 output tokens (outline)
  • Writing: ~4,000 input + 2,500 output tokens (draft generation)
  • Refinement: ~1,000 input + 400 output tokens (one round of edits)
Task Input Tokens Output Tokens Cost (USD)
Chat (DeepSeek) 3,000 500 $0.0019
Clarity (DeepSeek) 1,000 300 $0.0007
Planning (Flash) 2,000 800 $0.0009
Writing (Mistral Small) 4,000 2,500 $0.0090
Refinement (DeepSeek) 1,000 400 $0.0008
Text subtotal $0.0133

Image costs:

  • 3 images × ~1 MP each via FLUX.2 klein
  • First MP per image: $0.014
  • Subsequent MP per image: $0.001
  • 3 images × $0.014 ≈ $0.042

OpenRouter platform fee:

  • 5.5% of total ≈ 5.5% × ($0.0133 + $0.042) ≈ $0.0045
Category Cost (USD)
Text (all tasks) $0.0133
Images (3 × 1 MP) $0.0420
OpenRouter platform fee (5.5%) $0.0045
Total/Article $0.0598

💰 Budget pack = ~$0.06 USD/article (or ~$0.180.30 USD if user refines/regenerates once)


Target User

  • Active dev/content creator or small agency
  • Shipping 410 articles/month
  • Quality matters; willing to pay for strong writing
  • Acceptable output: polished, professional prose; nice images
  • Balance: cost-efficient without sacrificing quality

Complete Model Pack

Task Model Provider Rationale
Chat Gemini 3 Flash Preview OpenRouter Multi-turn agentic chat, 1M context, research-ready
Clarity Gemini 3 Flash Preview OpenRouter Same engine; great at prompt analysis, quiz generation
Planning Gemini 3 Flash Preview OpenRouter Primary "thinking" engine; excellent JSON outline generation
Writing Claude Sonnet 3.5 or 4 OpenRouter Industry standard long-form; strong code + prose blend
Refinement Claude Sonnet 3.5 or 4 OpenRouter Same as writing; consistency in rewrites and tone
Image Riverflow V2 Max or FLUX.2 Pro OpenRouter High visual quality; flat ~$0.030.04 USD per image

Cost Breakdown (Per 2,500-word Article + 3 Images)

Text costs (tokens):

Task Input Tokens Output Tokens Cost (USD)
Chat (Flash) 3,000 500 $0.0015
Clarity (Flash) 1,000 300 $0.0005
Planning (Flash) 2,000 800 $0.0008
Writing (Claude Sonnet) 4,000 2,500 $0.0300
Refinement (Claude Sonnet) 1,000 400 $0.0060
Text subtotal $0.0388

Image costs:

  • 3 images × Riverflow V2 Max (flat pricing ~$0.03 per image)
  • 3 images × $0.03 ≈ $0.0900

OpenRouter platform fee:

  • 5.5% × ($0.0388 + $0.0900) ≈ $0.0071
Category Cost (USD)
Text (all tasks) $0.0388
Images (3 × flat rate) $0.0900
OpenRouter platform fee (5.5%) $0.0071
Total/Article $0.1359

💰 Balanced pack = ~$0.14 USD/article (or ~$0.250.35 USD with one round of refinement/regen)


Preset Pack 3: Premium

Target User

  • Content agencies or full-time creators
  • Publishing 15+ articles/month or selling content
  • Quality is non-negotiable (flagship posts, sales pages, thought leadership)
  • Acceptable output: publication-ready prose; hero images
  • Budget: cost is secondary to impact

Complete Model Pack

Task Model Provider Rationale
Chat Gemini 3 Flash Preview or GPT-5.2 Chat OpenRouter Flash for efficient research; GPT-5.2 if user wants single OpenAI vendor
Clarity Claude Sonnet 4 OpenRouter Exceptional at nuanced prompt feedback and Socratic questioning
Planning Gemini 3 Flash Preview OpenRouter Long-context planner; cost doesn't improve quality for outlining
Writing GPT-5.2 or Claude Opus 4.5 OpenRouter Frontier long-form quality; superior narrative flow, voice, nuance
Refinement GPT-5.2 or Claude Opus 4.5 OpenRouter Same frontier writer; final editorial polish
Image FLUX.2 [max] OpenRouter (Black Forest Labs) Top-tier image quality; best prompt following; hero/marketing grade

Cost Breakdown (Per 2,500-word Article + 3 Images)

Text costs (tokens):

Task Input Tokens Output Tokens Cost (USD)
Chat (Flash) 3,000 500 $0.0015
Clarity (Sonnet 4) 1,000 300 $0.0015
Planning (Flash) 2,000 800 $0.0008
Writing (GPT-5.2 or Opus) 4,000 2,500 $0.0700
Refinement (GPT-5.2 or Opus) 1,000 400 $0.0140
Text subtotal $0.0878

Image costs:

  • 3 images × ~1 MP each via FLUX.2 [max]
  • First MP per image: $0.07
  • Subsequent MP per image: $0.03
  • 3 images × $0.07 ≈ $0.2100

OpenRouter platform fee:

  • 5.5% × ($0.0878 + $0.2100) ≈ $0.0164
Category Cost (USD)
Text (all tasks) $0.0878
Images (3 × max quality) $0.2100
OpenRouter platform fee (5.5%) $0.0164
Total/Article $0.3142

💰 Premium pack = ~$0.31 USD/article (or ~$0.500.75 USD with multiple refinement passes or image regen)


Cost Estimation Guide

How Users Can Calculate Their Needs

Use this framework to estimate monthly costs:

Monthly AI Cost = (Cost/Article) × (Articles/Month) × (Regenerations Factor)

Step 1: Pick your tier and note cost/article

  • Budget: $0.06
  • Balanced: $0.14
  • Premium: $0.31

Step 2: Estimate articles per month

  • Blogger: 24 articles/month
  • Small content team: 510 articles/month
  • Agency: 1530 articles/month

Step 3: Apply regeneration factor

  • First drafts only (happy with output): ×1.0
  • One round of refinement/regeneration: ×1.52.0
  • Heavy iteration (multiple regen cycles): ×2.53.0

Example calculations:

Profile Tier Articles/mo Regens Monthly Cost
Solo dev blogger Budget 2 ×1.5 $0.06 × 2 × 1.5 = $0.18
Content team Balanced 8 ×2.0 $0.14 × 8 × 2.0 = $2.24
Agency (flagship posts) Premium 12 ×2.5 $0.31 × 12 × 2.5 = $9.30

Important Notes

  1. Token counts are estimates. Actual usage depends on:

    • Outline complexity
    • Number of research notes
    • Image resolution and complexity
    • Iteration/refinement cycles
  2. OpenRouter base price + 5.5% platform fee is included in all estimates above.

  3. No hidden costs. Users only pay for what they use. If they skip image generation or skip refinement, cost drops.

  4. Image costs scale with resolution. If user requests higher resolution (>1 MP per image), multiply image cost accordingly.


When to Use Each Pack

Budget Pack: Best For

Use when:

  • First time using AI writing; want to test the plugin
  • Publishing 13 articles/month
  • Topic: dev blogs, quick tutorials (where first drafts are acceptable)
  • Budget: <$5/month

Not ideal for:

  • Sales pages or high-stakes content
  • Audiences expecting polish
  • Topics requiring heavy editing

Workflow expectation: User accepts 12 refinement cycles before publishing.


Use when:

  • Regular blogging (410 articles/month)
  • Mixed content: tutorials, reviews, opinion pieces
  • Publishing to professional blog or portfolio
  • Budget: $520/month

Default recommendation because:

  • Gemini Flash is the best pure planner on the market (cost doesn't improve planning)
  • Claude Sonnet is the industry-standard long-form writer
  • Cost:quality ratio is unbeatable
  • Users can hit "publish" with minimal editing

Not ideal for:

  • One-off flagship posts (Premium is worth it)
  • Micro-budget users (use Budget instead)

Workflow expectation: User does one refinement cycle; publishes with high confidence.


Premium Pack: Best For

Use when:

  • Publishing flagship posts, thought leadership, or sales content
  • Publishing 10+ articles/month (agency/professional creator)
  • Audiences/stakeholders expect flawless prose
  • Images need to be hero/standout quality
  • Budget: $20100+/month

Worth the cost because:

  • GPT-5.2 or Opus produce superior long-form narrative
  • Superior voice consistency across 25k words
  • FLUX.2 [max] images are publication-ready
  • Minimal editing required

Overkill for:

  • Quick dev blogs or tutorials
  • Solo bloggers publishing <5/month

Workflow expectation: User does light editing (if any); publishes immediately.


Implementation Config

JSON Schema for Plugin Settings

Save presets as JSON config so users can swap or customize:

{
  "presets": {
    "budget": {
      "name": "Budget: DeepSeek + Flash + Mistral + FLUX.2 klein",
      "description": "Super affordable ($0.06/article). Great for testing or budget-conscious bloggers.",
      "models": {
        "chat": {
          "model": "deepseek-v3",
          "provider": "openrouter",
          "description": "DeepSeek V3: Fast, cheap, great reasoning"
        },
        "clarity": {
          "model": "deepseek-v3",
          "provider": "openrouter",
          "description": "DeepSeek V3: Meta-reasoning for prompt analysis"
        },
        "planning": {
          "model": "google/gemini-3-flash-preview",
          "provider": "openrouter",
          "description": "Gemini 3 Flash: 1M context, fast outlining"
        },
        "writing": {
          "model": "mistral/mistral-small",
          "provider": "openrouter",
          "description": "Mistral Small: Budget-friendly long-form"
        },
        "refinement": {
          "model": "deepseek-v3",
          "provider": "openrouter",
          "description": "DeepSeek V3: Cost-efficient paragraph edits"
        },
        "image": {
          "model": "black-forest-labs/flux.2-klein",
          "provider": "openrouter",
          "description": "FLUX.2 klein: Optimized for cost"
        }
      },
      "cost_per_article": {
        "text": 0.0133,
        "images": 0.0420,
        "platform_fee": 0.0045,
        "total": 0.0598,
        "currency": "USD"
      }
    },
    "balanced": {
      "name": "Balanced (RECOMMENDED): Gemini Flash + Claude Sonnet + Riverflow",
      "description": "Professional quality ($0.14/article). Default for most creators.",
      "models": {
        "chat": {
          "model": "google/gemini-3-flash-preview",
          "provider": "openrouter",
          "description": "Gemini 3 Flash: Multi-turn agentic chat, 1M context"
        },
        "clarity": {
          "model": "google/gemini-3-flash-preview",
          "provider": "openrouter",
          "description": "Gemini 3 Flash: Excellent at prompt analysis"
        },
        "planning": {
          "model": "google/gemini-3-flash-preview",
          "provider": "openrouter",
          "description": "Gemini 3 Flash: Primary thinking engine"
        },
        "writing": {
          "model": "anthropic/claude-3.5-sonnet",
          "provider": "openrouter",
          "description": "Claude Sonnet: Industry standard long-form"
        },
        "refinement": {
          "model": "anthropic/claude-3.5-sonnet",
          "provider": "openrouter",
          "description": "Claude Sonnet: Consistent rewrites"
        },
        "image": {
          "model": "sourceful/riverflow-v2-max",
          "provider": "openrouter",
          "description": "Riverflow V2 Max: High-quality images"
        }
      },
      "cost_per_article": {
        "text": 0.0388,
        "images": 0.0900,
        "platform_fee": 0.0071,
        "total": 0.1359,
        "currency": "USD"
      }
    },
    "premium": {
      "name": "Premium: GPT-5.2/Opus + Gemini Flash + FLUX.2 max",
      "description": "Flagship quality ($0.31/article). For agencies and thought leaders.",
      "models": {
        "chat": {
          "model": "google/gemini-3-flash-preview",
          "provider": "openrouter",
          "description": "Gemini 3 Flash: Efficient research"
        },
        "clarity": {
          "model": "anthropic/claude-sonnet-4",
          "provider": "openrouter",
          "description": "Claude Sonnet 4: Exceptional feedback"
        },
        "planning": {
          "model": "google/gemini-3-flash-preview",
          "provider": "openrouter",
          "description": "Gemini 3 Flash: Long-context planner"
        },
        "writing": {
          "model": "openai/gpt-5.2",
          "provider": "openrouter",
          "description": "GPT-5.2: Frontier long-form quality"
        },
        "refinement": {
          "model": "openai/gpt-5.2",
          "provider": "openrouter",
          "description": "GPT-5.2: Final editorial polish"
        },
        "image": {
          "model": "black-forest-labs/flux.2-max",
          "provider": "openrouter",
          "description": "FLUX.2 max: Hero-grade images"
        }
      },
      "cost_per_article": {
        "text": 0.0878,
        "images": 0.2100,
        "platform_fee": 0.0164,
        "total": 0.3142,
        "currency": "USD"
      }
    }
  }
}

How to Use in Plugin

  1. Load preset on plugin activation:

    $preset = get_option('agentic_writer_preset', 'balanced');
    $presets = json_decode(file_get_contents(__DIR__ . '/model-presets.json'), true);
    $active_models = $presets['presets'][$preset]['models'];
    
  2. Route API calls based on preset:

    switch ($task_type) {
        case 'chat':
            $model = $active_models['chat']['model'];
            break;
        case 'writing':
            $model = $active_models['writing']['model'];
            break;
        // ... etc
    }
    
  3. Display cost estimate in UI:

    $cost = $presets['presets'][$preset]['cost_per_article']['total'];
    echo "Estimated cost: ${$cost}/article";
    

Summary Table

Quick reference for all 3 packs:

Aspect Budget Balanced Premium
Chat DeepSeek Gemini Flash Gemini Flash
Clarity DeepSeek Gemini Flash Claude Sonnet 4
Planning Gemini Flash Gemini Flash Gemini Flash
Writing Mistral Small Claude Sonnet GPT-5.2/Opus
Refinement DeepSeek Claude Sonnet GPT-5.2/Opus
Image FLUX.2 klein Riverflow V2 Max FLUX.2 max
Cost/Article $0.06 $0.14 $0.31
Monthly (8 articles) $0.48 $1.12 $2.48
Monthly (20 articles) $1.20 $2.80 $6.20
Target User Hobbyist/test Active creator Agency/Pro
Default? RECOMMENDED

Next Steps

  1. Implement preset switcher in plugin settings Let users pick Budget / Balanced / Premium
  2. Add cost calculator to UI Show estimated cost before user generates
  3. Support preset customization Allow power users to swap individual models
  4. Track actual costs Log usage and compare to estimates for billing transparency

Appendix: Model Slugs

These are the exact model identifiers for OpenRouter API calls (as of January 2026):

deepseek-v3
google/gemini-3-flash-preview
anthropic/claude-3.5-sonnet
anthropic/claude-sonnet-4
mistral/mistral-small
openai/gpt-5.2
black-forest-labs/flux.2-klein
black-forest-labs/flux.2-max
sourceful/riverflow-v2-max

Note: Model names and slugs may change slightly as providers update. Verify against OpenRouter Models before deploying.


Document version: 1.0
Date: January 22, 2026
Author: WP Agentic Writer Product Team
Status: Ready for Implementation