first commit all files

2026-01-28 00:26:00 +07:00
parent 65dd207a74
commit 97426d5ab1
72 changed files with 91484 additions and 0 deletions
--- a/AGENTIC_CONTEXT_STRATEGY.md
+++ b/AGENTIC_CONTEXT_STRATEGY.md
@@ -0,0 +1,786 @@
+# Agentic Context Management Strategy
+
+**Date:** January 25, 2026  
+**Version:** 0.1.3+  
+**Purpose:** AI-powered context management for multilingual, intelligent user experience
+
+---
+
+## 🎯 Core Philosophy: Let AI Handle AI Context
+
+### **The Problem with Hardcoded Solutions**
+
+**Previous Approach (FLAWED):**
+```javascript
+// ❌ English-only, brittle, not scalable
+if (content.includes('outline') || content.includes('structure')) {
+    return 'create_outline';
+}
+```
+
+**Issues:**
+- ❌ Only works in English
+- ❌ Breaks in Indonesian, Arabic, Chinese, etc.
+- ❌ Misses nuanced intent
+- ❌ Requires constant maintenance
+- ❌ Goes against "agentic" philosophy
+
+### **Agentic Principle**
+
+> **"If AI is smart enough to write articles, it's smart enough to manage its own context."**
+
+**New Approach:**
+- ✅ Use AI to summarize chat history
+- ✅ Use AI to detect user intent
+- ✅ Language-agnostic (works in any language)
+- ✅ Adapts to context automatically
+- ✅ True "agentic" experience
+
+---
+
+## 💰 Cost Analysis: AI-Powered Context Management
+
+### **Your Cost Reference**
+
+```
+Action: meta_description
+Model: deepseek-chat-v3-032
+Tokens: 510
+Cost: $0.0001
+```
+
+**This is EXTREMELY cheap!** Let's use this model for context operations.
+
+### **Proposed Actions**
+
+#### **1. Action: `summarize_context`**
+
+**Purpose:** Condense long chat history into key points
+
+**Input:**
+```json
+{
+    "action": "summarize_context",
+    "chat_history": [
+        {"role": "user", "content": "Saya ingin menulis tentang keamanan WordPress"},
+        {"role": "assistant", "content": "[Long response in Indonesian...]"},
+        {"role": "user", "content": "Fokus pada plugin vulnerabilities saja"},
+        {"role": "assistant", "content": "[Detailed plugin security response...]"}
+    ]
+}
+```
+
+**Prompt:**
+```
+Summarize this conversation into key points that capture the user's intent and requirements. 
+Focus on:
+- Main topic
+- Specific focus areas
+- Rejected/excluded topics
+- User preferences (tone, audience, etc.)
+
+Keep the summary concise (max 200 words) but preserve critical context.
+Write in the same language as the conversation.
+
+Output format:
+TOPIC: [main topic]
+FOCUS: [what to include]
+EXCLUDE: [what to avoid]
+PREFERENCES: [any specific requirements]
+```
+
+**Expected Output:**
+```
+TOPIC: WordPress security
+FOCUS: Plugin vulnerabilities only
+EXCLUDE: Performance optimization, backup strategies (user rejected these)
+PREFERENCES: Technical audience, detailed explanations
+```
+
+**Cost Estimate:**
+- Input: 4,000 tokens (long chat history)
+- Output: 100 tokens (summary)
+- Model: deepseek-chat-v3-032
+- **Cost: ~$0.0001 per summarization**
+
+**When to Use:**
+- Chat history > 6 messages
+- Before generating outline
+- Before executing article
+
+---
+
+#### **2. Action: `detect_intent`**
+
+**Purpose:** Understand what user wants to do next
+
+**Input:**
+```json
+{
+    "action": "detect_intent",
+    "last_message": "Baiklah, sekarang buatkan outline-nya",
+    "has_plan": false,
+    "current_mode": "chat"
+}
+```
+
+**Prompt:**
+```
+Based on the user's message, determine their intent. Choose ONE:
+
+1. "create_outline" - User wants to create an article outline/structure
+2. "start_writing" - User wants to write the full article
+3. "refine_content" - User wants to improve existing content
+4. "continue_chat" - User wants to continue discussing/exploring
+5. "clarify" - User is asking questions or needs clarification
+
+Consider:
+- The user's explicit request
+- Whether they have an outline already (has_plan: {has_plan})
+- Current mode (current_mode: {current_mode})
+
+Respond with ONLY the intent code (e.g., "create_outline").
+```
+
+**Expected Output:**
+```
+create_outline
+```
+
+**Cost Estimate:**
+- Input: 100 tokens (last message + context)
+- Output: 5 tokens (intent code)
+- Model: deepseek-chat-v3-032
+- **Cost: ~$0.00002 per detection**
+
+**When to Use:**
+- After every user message in Chat mode
+- To show contextual action buttons
+- To auto-suggest next steps
+
+---
+
+## 📊 Cost Comparison: Full History vs AI-Powered
+
+### **Scenario: 5 Agent + 4 Human Messages**
+
+| Approach | Input Tokens | Output Tokens | Cost per Request | Quality | Language Support |
+|----------|--------------|---------------|------------------|---------|------------------|
+| **Full History** | 4,365 | 0 | $0.013 (Claude) | ✅ Best | ✅ All |
+| **AI Summarization** | 100 (summary) | 0 | $0.003 (Claude) + $0.0001 (summary) | ✅ Good | ✅ All |
+| **Hardcoded Pruning** | 1,800 | 0 | $0.005 (Claude) | ⚠️ Fair | ❌ English only |
+| **No Context** | 0 | 0 | $0.000 | ❌ Poor | ✅ All |
+
+### **Cost Breakdown for 100 Articles/Month**
+
+**Full History:**
+- Planning: 100 × $0.00033 = $0.033
+- Execution: 100 × $0.013 = $1.30
+- **Total: $1.33/month**
+
+**AI Summarization:**
+- Summarization: 100 × $0.0001 = $0.01
+- Planning: 100 × $0.00008 = $0.008
+- Execution: 100 × $0.003 = $0.30
+- **Total: $0.32/month**
+- **Savings: $1.01/month (76% reduction)**
+
+**Intent Detection:**
+- Per message: $0.00002
+- Average 10 messages per article: 100 × 10 × $0.00002 = $0.02
+- **Total: $0.02/month (negligible)**
+
+---
+
+## 🎯 The Big Picture: Agentic Experience
+
+### **User Journey Analysis**
+
+**Current Flow (Fragmented):**
+```
+1. User opens editor
+2. User manually switches to Chat mode
+3. User types message
+4. Agent responds
+5. User types more
+6. Agent responds
+7. User manually switches to Planning mode
+8. User types "create outline"
+9. Outline generated
+10. User manually clicks "Start Writing"
+11. Article generated
+```
+
+**Problems:**
+- Too many manual mode switches
+- User must know when to switch
+- No guidance on next steps
+- Friction in workflow
+
+---
+
+### **Proposed Agentic Flow (Seamless):**
+
+```
+1. User opens editor (any mode)
+2. User types: "Saya ingin menulis tentang keamanan WordPress"
+3. Agent responds with suggestions
+4. User types: "Fokus pada plugin vulnerabilities"
+5. Agent responds with refined ideas
+6. 💡 UI shows: [📝 Ready to create outline?] (AI-detected intent)
+7. User clicks button (or types "yes" or "buatkan outline")
+8. ✨ AI summarizes chat history (0.1 seconds)
+9. Outline generated with clean context
+10. 💡 UI shows: [✍️ Start Writing] (auto-suggested)
+11. User clicks
+12. Article generated
+```
+
+**Improvements:**
+- ✅ No manual mode switching needed
+- ✅ AI suggests next steps proactively
+- ✅ Context automatically optimized
+- ✅ Smooth, guided experience
+- ✅ Works in any language
+
+---
+
+## 🔧 Implementation Design
+
+### **Backend: New Actions**
+
+```php
+/**
+ * Handle context summarization request.
+ *
+ * @param WP_REST_Request $request REST request.
+ * @return WP_REST_Response|WP_Error Response.
+ */
+public function handle_summarize_context( $request ) {
+    $params = $request->get_json_params();
+    $chat_history = $params['chatHistory'] ?? array();
+    
+    if ( empty( $chat_history ) || count( $chat_history ) < 4 ) {
+        // No need to summarize short history
+        return new WP_REST_Response(
+            array(
+                'summary' => '',
+                'use_full_history' => true,
+            ),
+            200
+        );
+    }
+    
+    // Build summarization prompt
+    $history_text = '';
+    foreach ( $chat_history as $msg ) {
+        $role = ucfirst( $msg['role'] ?? 'Unknown' );
+        $content = $msg['content'] ?? '';
+        $history_text .= "{$role}: {$content}\n\n";
+    }
+    
+    $prompt = "Summarize this conversation into key points that capture the user's intent and requirements.
+
+Focus on:
+- Main topic
+- Specific focus areas
+- Rejected/excluded topics
+- User preferences (tone, audience, etc.)
+
+Keep the summary concise (max 200 words) but preserve critical context.
+Write in the same language as the conversation.
+
+Output format:
+TOPIC: [main topic]
+FOCUS: [what to include]
+EXCLUDE: [what to avoid]
+PREFERENCES: [any specific requirements]
+
+Conversation:
+{$history_text}";
+    
+    $provider = WP_Agentic_Writer_OpenRouter_Provider::get_instance();
+    $messages = array(
+        array(
+            'role' => 'user',
+            'content' => $prompt,
+        ),
+    );
+    
+    // Use cheap model for summarization
+    $response = $provider->chat( $messages, array(), 'summarize' );
+    
+    if ( is_wp_error( $response ) ) {
+        return $response;
+    }
+    
+    // Track cost
+    do_action(
+        'wp_aw_after_api_request',
+        $params['postId'] ?? 0,
+        $response['model'] ?? '',
+        'summarize_context',
+        $response['input_tokens'] ?? 0,
+        $response['output_tokens'] ?? 0,
+        $response['cost'] ?? 0
+    );
+    
+    return new WP_REST_Response(
+        array(
+            'summary' => $response['content'] ?? '',
+            'use_full_history' => false,
+            'cost' => $response['cost'] ?? 0,
+        ),
+        200
+    );
+}
+
+/**
+ * Handle intent detection request.
+ *
+ * @param WP_REST_Request $request REST request.
+ * @return WP_REST_Response|WP_Error Response.
+ */
+public function handle_detect_intent( $request ) {
+    $params = $request->get_json_params();
+    $last_message = $params['lastMessage'] ?? '';
+    $has_plan = $params['hasPlan'] ?? false;
+    $current_mode = $params['currentMode'] ?? 'chat';
+    
+    if ( empty( $last_message ) ) {
+        return new WP_REST_Response(
+            array( 'intent' => 'continue_chat' ),
+            200
+        );
+    }
+    
+    $prompt = "Based on the user's message, determine their intent. Choose ONE:
+
+1. \"create_outline\" - User wants to create an article outline/structure
+2. \"start_writing\" - User wants to write the full article
+3. \"refine_content\" - User wants to improve existing content
+4. \"continue_chat\" - User wants to continue discussing/exploring
+5. \"clarify\" - User is asking questions or needs clarification
+
+Consider:
+- The user's explicit request
+- Whether they have an outline already (has_plan: " . ( $has_plan ? 'true' : 'false' ) . ")
+- Current mode (current_mode: {$current_mode})
+
+User's message: \"{$last_message}\"
+
+Respond with ONLY the intent code (e.g., \"create_outline\").";
+    
+    $provider = WP_Agentic_Writer_OpenRouter_Provider::get_instance();
+    $messages = array(
+        array(
+            'role' => 'user',
+            'content' => $prompt,
+        ),
+    );
+    
+    $response = $provider->chat( $messages, array(), 'intent_detection' );
+    
+    if ( is_wp_error( $response ) ) {
+        return $response;
+    }
+    
+    // Track cost
+    do_action(
+        'wp_aw_after_api_request',
+        $params['postId'] ?? 0,
+        $response['model'] ?? '',
+        'detect_intent',
+        $response['input_tokens'] ?? 0,
+        $response['output_tokens'] ?? 0,
+        $response['cost'] ?? 0
+    );
+    
+    $intent = trim( strtolower( $response['content'] ?? 'continue_chat' ) );
+    
+    return new WP_REST_Response(
+        array(
+            'intent' => $intent,
+            'cost' => $response['cost'] ?? 0,
+        ),
+        200
+    );
+}
+```
+
+---
+
+### **Frontend: Agentic UX**
+
+```javascript
+// Auto-detect intent after each message
+const handleMessageSent = async (userMessage) => {
+    // Send message to chat
+    const chatResponse = await sendChatMessage(userMessage);
+    
+    // Detect intent in background
+    const intentResponse = await fetch('/wp-json/wp-agentic-writer/v1/detect-intent', {
+        method: 'POST',
+        body: JSON.stringify({
+            lastMessage: userMessage,
+            hasPlan: !!currentPlan,
+            currentMode: agentMode,
+            postId: postId
+        })
+    });
+    
+    const { intent } = await intentResponse.json();
+    
+    // Show contextual action based on intent
+    setDetectedIntent(intent);
+    showContextualAction(intent);
+};
+
+// Render contextual action buttons
+const renderContextualAction = () => {
+    if (!detectedIntent) return null;
+    
+    switch (detectedIntent) {
+        case 'create_outline':
+            return (
+                <div className="contextual-action">
+                    <p>💡 Ready to create an outline?</p>
+                    <button onClick={handleCreateOutlineWithSummary} className="primary">
+                        📝 Create Outline
+                    </button>
+                </div>
+            );
+        
+        case 'start_writing':
+            if (!currentPlan) {
+                return (
+                    <div className="contextual-action">
+                        <p>⚠️ You need an outline first</p>
+                        <button onClick={handleCreateOutlineWithSummary}>
+                            📝 Create Outline First
+                        </button>
+                    </div>
+                );
+            }
+            return (
+                <div className="contextual-action">
+                    <p>💡 Ready to write the article?</p>
+                    <button onClick={handleStartWriting} className="primary">
+                        ✍️ Start Writing
+                    </button>
+                </div>
+            );
+        
+        case 'refine_content':
+            return (
+                <div className="contextual-action">
+                    <p>💡 Use @block to refine specific sections</p>
+                </div>
+            );
+        
+        default:
+            return null;
+    }
+};
+
+// Create outline with AI summarization
+const handleCreateOutlineWithSummary = async () => {
+    setIsLoading(true);
+    
+    // Step 1: Summarize chat history if needed
+    let contextToSend = messages;
+    
+    if (messages.length > 6) {
+        showStatus('Optimizing context...');
+        
+        const summaryResponse = await fetch('/wp-json/wp-agentic-writer/v1/summarize-context', {
+            method: 'POST',
+            body: JSON.stringify({
+                chatHistory: messages,
+                postId: postId
+            })
+        });
+        
+        const { summary, use_full_history, cost } = await summaryResponse.json();
+        
+        if (!use_full_history && summary) {
+            // Use summarized context
+            contextToSend = [
+                {
+                    role: 'system',
+                    content: `Context Summary:\n${summary}`
+                },
+                ...messages.slice(-2) // Keep last exchange
+            ];
+            
+            console.log('Context optimized. Cost:', cost);
+        }
+    }
+    
+    // Step 2: Generate outline with optimized context
+    showStatus('Creating outline...');
+    
+    const outlineResponse = await fetch('/wp-json/wp-agentic-writer/v1/generate-plan', {
+        method: 'POST',
+        body: JSON.stringify({
+            topic: extractTopic(messages),
+            chatHistory: contextToSend,
+            postId: postId,
+            postConfig: postConfig,
+            stream: true
+        })
+    });
+    
+    // Handle streaming response...
+};
+```
+
+---
+
+## 🎨 UX Enhancements
+
+### **1. Contextual Action Cards**
+
+```
+┌─────────────────────────────────────────────────────┐
+│  Agent: "I can help you create a comprehensive      │
+│  outline for WordPress plugin security..."          │
+├─────────────────────────────────────────────────────┤
+│  💡 Detected Intent: Create Outline                 │
+│                                                      │
+│  [📝 Create Outline]  [💬 Continue Discussing]      │
+│                                                      │
+│  💰 Context will be optimized (~$0.0001)            │
+└─────────────────────────────────────────────────────┘
+```
+
+### **2. Context Optimization Indicator**
+
+```
+┌─────────────────────────────────────────────────────┐
+│  ⚡ Optimizing context...                           │
+│  • 9 messages → Summary (200 words)                 │
+│  • Token reduction: 4,365 → 450 (90%)               │
+│  • Cost: $0.0001                                    │
+│  ✓ Done in 0.2s                                     │
+└─────────────────────────────────────────────────────┘
+```
+
+### **3. Smart Mode Transitions**
+
+```
+User in Chat mode types: "buatkan outline-nya"
+
+┌─────────────────────────────────────────────────────┐
+│  💡 Switching to Planning mode...                   │
+│  • Detected intent: Create outline                  │
+│  • Optimizing 7 messages of context                 │
+│  • Generating outline...                            │
+└─────────────────────────────────────────────────────┘
+
+[Outline appears]
+
+┌─────────────────────────────────────────────────────┐
+│  ✨ Outline ready!                                   │
+│                                                      │
+│  Next step:                                          │
+│  [✍️ Start Writing Article]                         │
+└─────────────────────────────────────────────────────┘
+```
+
+---
+
+## 📊 Decision Matrix: When to Use What?
+
+| Situation | Recommended Approach | Reason |
+|-----------|---------------------|--------|
+| **Chat history ≤ 4 messages** | Send full history | Short enough, no optimization needed |
+| **Chat history 5-8 messages** | AI summarization | Balance cost and quality |
+| **Chat history > 8 messages** | AI summarization + last 2 | Keep recent context verbatim |
+| **User switches modes** | Detect intent | Guide user to next action |
+| **Before outline generation** | Summarize context | Clean, focused input |
+| **Before article execution** | Use plan (no chat history) | Plan already has all context |
+| **Block refinement** | No chat history | Block content is sufficient |
+| **User types "/reset"** | Clear all context | Fresh start |
+
+---
+
+## 🎯 Recommendation: Hybrid Intelligent Approach
+
+### **The Winning Strategy**
+
+**Combine AI-powered + Smart Defaults:**
+
+1. **Default Behavior (No User Action)**
+   - Chat history ≤ 4 messages → Send full history
+   - Chat history > 4 messages → Auto-summarize with AI
+   - Cost: ~$0.0001 per summarization (negligible)
+
+2. **Intent Detection (Automatic)**
+   - After every user message → Detect intent
+   - Show contextual action buttons
+   - Cost: ~$0.00002 per detection (negligible)
+
+3. **User Control (Optional)**
+   - Settings: "Context Mode" → Auto/Full/Minimal
+   - "/reset" command → Clear context
+   - Manual selection UI (advanced users)
+
+### **Why This Works**
+
+✅ **Language-Agnostic**
+- Works in English, Indonesian, Arabic, Chinese, etc.
+- No hardcoded keywords
+
+✅ **Cost-Effective**
+- 76% cost reduction vs full history
+- Total added cost: ~$0.34/month for 100 articles
+- ROI: Better quality + Lower cost
+
+✅ **True Agentic Experience**
+- AI manages its own context
+- Proactive suggestions
+- Seamless workflow
+- No manual mode switching
+
+✅ **User-Friendly**
+- Automatic by default
+- Optional manual control
+- Transparent (shows what's happening)
+- Fast (summarization takes 0.1-0.3s)
+
+---
+
+## 🔧 Implementation Plan
+
+### **Phase 1: Core Infrastructure** (Week 1)
+
+**Backend:**
+- [ ] Add `/summarize-context` endpoint
+- [ ] Add `/detect-intent` endpoint
+- [ ] Add `summarize` and `intent_detection` operation types to cost tracking
+- [ ] Update OpenRouter provider to support these actions
+
+**Frontend:**
+- [ ] Add `handleSummarizeContext()` function
+- [ ] Add `handleDetectIntent()` function
+- [ ] Add context optimization indicator component
+
+**Testing:**
+- [ ] Test summarization in English, Indonesian, Arabic
+- [ ] Test intent detection in multiple languages
+- [ ] Verify cost tracking
+
+### **Phase 2: UX Integration** (Week 2)
+
+**Frontend:**
+- [ ] Add contextual action cards
+- [ ] Auto-detect intent after each message
+- [ ] Show "Optimizing context..." status
+- [ ] Add smart mode transitions
+
+**Settings:**
+- [ ] Add "Context Mode" setting (Auto/Full/Minimal)
+- [ ] Add context optimization toggle
+- [ ] Add cost estimates in settings
+
+**Testing:**
+- [ ] Test full user journey (chat → outline → write)
+- [ ] Test in multiple languages
+- [ ] Verify smooth transitions
+
+### **Phase 3: Advanced Features** (Week 3)
+
+**Features:**
+- [ ] Add `/reset` command
+- [ ] Add manual context selection UI (optional)
+- [ ] Add context analytics (token usage, cost breakdown)
+- [ ] Add context caching (reuse summaries)
+
+**Optimization:**
+- [ ] Implement smart caching for summaries
+- [ ] Add context relevance scoring
+- [ ] Optimize prompt templates
+
+**Documentation:**
+- [ ] Update user guide
+- [ ] Add context management section
+- [ ] Document cost implications
+
+---
+
+## 💰 Final Cost Analysis
+
+### **Per Article (Average)**
+
+| Component | Cost | Frequency |
+|-----------|------|-----------|
+| Intent detection | $0.00002 × 10 messages | = $0.0002 |
+| Context summarization | $0.0001 × 1 time | = $0.0001 |
+| Planning (with summary) | $0.003 | = $0.003 |
+| Execution (no history) | $0.50-$2.00 | = $1.00 avg |
+| **Total per article** | | **≈ $1.0033** |
+
+**Compared to Full History:**
+- Full history approach: $1.013 per article
+- AI-powered approach: $1.0033 per article
+- **Savings: $0.01 per article** (negligible)
+
+**But wait - the real benefit:**
+- ✅ Better quality (clean, focused context)
+- ✅ Language-agnostic (works everywhere)
+- ✅ Better UX (proactive suggestions)
+- ✅ Scalable (no hardcoded rules)
+
+---
+
+## 🎯 Answer to Your Question
+
+### **"Send all chat history vs AI summarization?"**
+
+**Answer: AI Summarization is Better**
+
+**Reasons:**
+
+1. **Cost is Nearly Identical**
+   - Full history: $1.013/article
+   - AI summary: $1.0033/article
+   - Difference: $0.01 (1% savings)
+
+2. **Quality is Better**
+   - Summary removes contradicted ideas
+   - Summary focuses on final intent
+   - Summary prevents pollution
+   - AI explicitly told what to focus on
+
+3. **Language Support**
+   - Full history: Works in all languages ✅
+   - AI summary: Works in all languages ✅
+   - Hardcoded: Only English ❌
+
+4. **Agentic Experience**
+   - AI managing AI context = true agentic
+   - Proactive intent detection
+   - Seamless workflow
+   - No user friction
+
+5. **Scalability**
+   - No hardcoded rules to maintain
+   - Adapts to new languages automatically
+   - Handles edge cases gracefully
+
+### **The Plan:**
+
+1. ✅ **Implement AI summarization** (not hardcoded)
+2. ✅ **Implement AI intent detection** (not hardcoded)
+3. ✅ **Make it automatic** (no user action needed)
+4. ✅ **Add user controls** (optional override)
+5. ✅ **Track costs transparently** (show user what's happening)
+
+---
+
+**Status:** 🚀 READY TO IMPLEMENT  
+**Approach:** AI-Powered (Agentic)  
+**Cost Impact:** Negligible (+$0.34/month for 100 articles)  
+**Quality Impact:** Significant improvement  
+**UX Impact:** Seamless, guided experience