AI Analysis and Content Strategy fixes. Enhanced Strategy Routes refactoring.

2026-01-10 19:32:50 +05:30
parent 0b63ae7fc1
commit 8193cdba67
298 changed files with 45678 additions and 10952 deletions
--- a/docs/Billing_Subscription/PRODUCTION_PRICING_STRATEGY.md
+++ b/docs/Billing_Subscription/PRODUCTION_PRICING_STRATEGY.md
@@ -0,0 +1,899 @@
+# Production Pricing Strategy - Basic Tier Launch (OSS-Focused)
+
+## Executive Summary
+
+This document provides a comprehensive pricing strategy for ALwrity's production launch with **Basic Tier only**. All features and tools will be accessible to Basic tier users, requiring careful cost calculation and limit setting to ensure sustainability while providing value.
+
+**Critical Goals**:
+1. **OSS-First Strategy**: Prioritize Open-Source AI models (WaveSpeed OSS models) for cost efficiency
+2. **Hard Cost Cap**: $40-50 per user per month maximum (protects against losses)
+3. **Maximum User Value**: Provide generous limits while staying within cost constraints
+4. **Fair Pricing**: Balance between sustainability and user value (not excessive profit margins)
+
+**Strategy**: Use WaveSpeed's OSS models (Qwen, FLUX, Ideogram, WAN 2.5) which offer better pricing than proprietary alternatives, allowing us to provide more value to users while maintaining profitability.
+
+---
+
+## Current State Analysis
+
+### Current Basic Tier (Code Implementation)
+
+**Price**: $29/month ($290/year)
+
+**Limits**:
+- **AI Text Generation**: 10 unified calls/month (across all LLM providers)
+- **Tokens**: 20,000 per provider (Gemini, OpenAI, Anthropic, Mistral)
+- **Search APIs**: 200 Tavily, 200 Serper, 100 Metaphor, 100 Firecrawl, 500 Exa
+- **Image Generation**: 5 Stability AI images/month
+- **Image Editing**: 30 AI image edits/month
+- **Video Generation**: 20 videos/month
+- **Audio Generation**: 50 TTS generations/month
+- **Monthly Cost Cap**: $50.00
+
+**Problem**: 10 unified AI text generation calls is **too restrictive** for production launch where users need to experience all features.
+
+---
+
+## ALwrity Tools & Content Generation Analysis
+
+### Content Generation Tools
+
+#### 1. **Text Generation Tools** (Primary LLM Usage)
+
+| Tool | API Calls per Generation | Typical Usage | Cost per Generation |
+|------|--------------------------|---------------|---------------------|
+| **Blog Writer** | 3-5 calls | 1 blog = research (1) + outline (1) + content (1-3) | $0.01 - $0.05 |
+| **Story Writer** | 2-3 calls | 1 story = outline (1) + script (1-2) | $0.01 - $0.03 |
+| **Podcast Maker** | 3-4 calls | 1 podcast = research (1) + script (1) + outline (1-2) | $0.01 - $0.04 |
+| **Facebook Writer** | 1-2 calls | 1 post = generation (1) + optional optimization (1) | $0.005 - $0.01 |
+| **LinkedIn Writer** | 1-2 calls | 1 post = generation (1) + optional optimization (1) | $0.005 - $0.01 |
+| **SEO Tools** | 1-3 calls | Varies by tool complexity | $0.005 - $0.02 |
+| **Content Planning** | 2-4 calls | Strategy generation + analysis | $0.01 - $0.03 |
+
+**Average**: ~2-3 LLM calls per content generation workflow
+
+#### 2. **Image Generation Tools**
+
+| Tool | API Calls | Cost per Generation |
+|------|-----------|---------------------|
+| **Image Generator** | 1 Stability call | $0.04 per image |
+| **Image Editor** | 1 Image Edit call | $0.04 per edit operation |
+
+**Current Limit**: 5 images/month (too low for production)
+
+#### 3. **Video Generation Tools**
+
+| Tool | API Calls | Cost per Video | Notes |
+|------|-----------|-----------------|-------|
+| **Video Studio** | 1 video call | $0.10 - $0.42 | Depends on model/duration |
+| **YouTube Creator** | 1 video call per scene | $0.10 - $0.42 per scene | 5 scenes = $0.50 - $2.10 |
+| **Story Writer Video** | 1 video call per scene | $0.10 - $0.42 per scene | Variable scenes |
+| **Podcast Maker Video** | 1 video call per scene | $0.10 - $0.42 per scene | Optional video generation |
+
+**Current Limit**: 20 videos/month (reasonable)
+
+#### 4. **Audio Generation Tools**
+
+| Tool | API Calls | Cost per Generation | Notes |
+|------|-----------|---------------------|-------|
+| **Audio Generator** | 1 audio call | $0.05 per 1,000 chars | ~$0.10 - $0.50 per audio |
+| **Podcast Maker TTS** | 1 audio call per scene | $0.05 per 1,000 chars | Multiple scenes |
+| **Story Writer Narration** | 1 audio call per scene | $0.05 per 1,000 chars | Multiple scenes |
+
+**Current Limit**: 50 audio generations/month (reasonable)
+
+---
+
+## API Cost Breakdown
+
+### LLM Provider Costs (Per 1M Tokens)
+
+| Provider | Model | Input Cost | Output Cost | Typical Use |
+|----------|-------|------------|-------------|-------------|
+| **Gemini** | 2.5 Flash | $0.30 | $2.50 | Default (cost-effective) |
+| **Gemini** | 2.5 Pro | $1.25 | $10.00 | Premium quality |
+| **OpenAI** | GPT-4o Mini | $0.15 | $0.60 | Cost-effective |
+| **OpenAI** | GPT-4o | $2.50 | $10.00 | Premium quality |
+| **Anthropic** | Claude 3.5 Sonnet | $3.00 | $15.00 | Premium quality |
+| **HuggingFace** | GPT-OSS-120B | $1.00 | $3.00 | Alternative option |
+
+**Average Cost per LLM Call** (assuming 1K input + 2K output tokens):
+- Gemini Flash: ~$0.0056 per call
+- GPT-4o Mini: ~$0.0015 per call
+- Claude 3.5: ~$0.033 per call
+
+**Recommendation**: Use Gemini Flash as default for cost efficiency.
+
+### Search API Costs
+
+| Provider | Cost per Search | Typical Usage |
+|----------|----------------|---------------|
+| **Tavily** | $0.001 | Research operations |
+| **Serper** | $0.001 | Research operations |
+| **Metaphor** | $0.003 | Research operations |
+| **Exa** | $0.005 | Neural search (premium) |
+| **Firecrawl** | $0.002 | Web page extraction |
+
+**Average**: ~$0.002 per search operation
+
+### Media Generation Costs (OSS-Focused via WaveSpeed)
+
+#### **Image Generation** (OSS Models via WaveSpeed)
+| Model | Cost | Type | Notes |
+|------|------|------|-------|
+| **Qwen Image** | $0.03 per image | OSS | Fast generation, cost-effective |
+| **Ideogram V3 Turbo** | $0.05 per image | OSS | Photorealistic, text rendering |
+| **Default (Qwen)** | $0.03 per image | OSS | **Recommended for Basic tier** |
+
+#### **Image Editing** (OSS Models via WaveSpeed)
+| Model | Cost | Type | Use Case |
+|------|------|------|----------|
+| **Qwen Image Edit** | $0.02 per edit | OSS | Budget editing, bilingual |
+| **Qwen Image Edit Plus** | $0.02 per edit | OSS | Multi-image editing |
+| **FLUX Kontext Pro** | $0.04 per edit | OSS | Typography, professional |
+| **Default (Qwen Edit)** | $0.02 per edit | OSS | **Recommended for Basic tier** |
+
+#### **Video Generation** (OSS Models via WaveSpeed)
+| Model | Cost | Type | Duration | Notes |
+|------|------|------|----------|-------|
+| **WAN 2.5** | $0.05/sec | OSS | 5-15 sec | Text-to-Video, Image-to-Video |
+| **Seedance 1.5 Pro** | $0.08/sec | OSS | 10-30 sec | Longer duration |
+| **Kling v2.5 Turbo (5s)** | $0.21 per video | OSS | 5 sec | Image-to-Video |
+| **Kling v2.5 Turbo (10s)** | $0.42 per video | OSS | 10 sec | Extended duration |
+| **Default (WAN 2.5)** | $0.25 per video | OSS | ~5 sec | **Recommended for Basic tier** |
+
+#### **Audio Generation** (OSS Models via WaveSpeed)
+| Model | Cost | Type | Notes |
+|------|------|------|-------|
+| **Minimax Speech 02 HD** | $0.05 per 1K chars | OSS | High-quality TTS |
+| **Default** | $0.05 per 1K chars | OSS | ~$0.10-0.50 per audio |
+
+#### **Face Swap & Specialized** (OSS Models via WaveSpeed)
+| Operation | Cost | Type | Notes |
+|-----------|------|------|-------|
+| **Face Swap** | $0.01-$0.03 | OSS | Basic to premium quality |
+| **Image Upscaling** | $0.01-$0.06 | OSS | 2K/4K/8K options |
+| **3D Generation** | $0.02-$0.30 | OSS | Budget to premium |
+
+**OSS Advantage**: WaveSpeed provides access to OSS models (Qwen, FLUX, Ideogram, WAN 2.5) at significantly lower costs than proprietary alternatives, enabling better value for users.
+
+---
+
+## Production-Ready Basic Tier Proposal
+
+### Revised Limits for Production Launch
+
+**Price**: $29/month ($290/year) - **KEEP CURRENT PRICING**
+
+**Rationale**: Competitive pricing point, allows for sustainable margins with proper limits.
+
+### Proposed Limits
+
+#### 1. **AI Text Generation** (Unified Limit)
+- **Current**: 10 calls/month ❌ **TOO LOW**
+- **Proposed**: **50 calls/month** ✅
+- **Rationale**: 
+  - Allows ~16-25 content generations/month (assuming 2-3 calls each)
+  - Enables users to experience Blog Writer, Story Writer, Podcast Maker, Social Writers
+  - Sustainable cost: ~$0.28/month (50 calls × $0.0056 average)
+
+#### 2. **Token Limits** (Per Provider)
+- **Current**: 20,000 tokens/provider
+- **Proposed**: **100,000 tokens/provider** ✅
+- **Rationale**:
+  - Allows ~33-50 LLM calls per provider (assuming 2K tokens/call)
+  - Provides buffer for longer content generation
+  - Aligns with unified call limit (50 calls × 2K tokens = 100K tokens)
+
+#### 3. **Search APIs**
+- **Tavily**: 200 calls/month ✅ (Keep)
+- **Serper**: 200 calls/month ✅ (Keep)
+- **Metaphor**: 100 calls/month ✅ (Keep)
+- **Firecrawl**: 100 calls/month ✅ (Keep)
+- **Exa**: 500 calls/month ✅ (Keep)
+- **Rationale**: Sufficient for research-heavy tools (Blog Writer, Podcast Maker, SEO tools)
+
+#### 4. **Image Generation** (OSS Models via WaveSpeed)
+- **Current**: 5 images/month ❌ **TOO LOW**
+- **Proposed**: **50 images/month** ✅ (INCREASED - OSS models are cheaper)
+- **Rationale**: 
+  - OSS models (Qwen Image $0.03) are cheaper than Stability ($0.04)
+  - Allows users to generate images for Story Writer, Blog Writer, Social Media
+  - Cost: ~$1.50/month (50 × $0.03 using Qwen Image OSS model)
+  - Enables visual content creation workflows
+  - **Default to Qwen Image OSS model** for cost efficiency
+
+#### 5. **Image Editing** (OSS Models via WaveSpeed)
+- **Current**: 30 edits/month
+- **Proposed**: **50 edits/month** ✅ (INCREASED - OSS models are cheaper)
+- **Rationale**: 
+  - OSS models (Qwen Edit $0.02) are cheaper than Stability ($0.04)
+  - Cost: ~$1.00/month (50 × $0.02 using Qwen Edit OSS model)
+  - Sufficient for image optimization workflows
+  - **Default to Qwen Edit OSS model** for cost efficiency
+
+#### 6. **Video Generation** (OSS Models via WaveSpeed)
+- **Current**: 20 videos/month
+- **Proposed**: **30 videos/month** ✅ (INCREASED - OSS models available)
+- **Rationale**: 
+  - OSS models (WAN 2.5 $0.25 per 5s video) provide good value
+  - Allows ~6-10 full video projects/month (assuming 3-5 scenes each)
+  - Cost: ~$7.50/month (30 × $0.25 using WAN 2.5 OSS model)
+  - Enables Video Studio, YouTube Creator, Story Writer video features
+  - **Default to WAN 2.5 OSS model** for cost efficiency
+
+#### 7. **Audio Generation** (OSS Models via WaveSpeed)
+- **Current**: 50 generations/month
+- **Proposed**: **100 generations/month** ✅ (INCREASED - OSS models are affordable)
+- **Rationale**:
+  - OSS models (Minimax Speech 02 HD) provide high quality at $0.05/1K chars
+  - Sufficient for Podcast Maker, Story Writer narration
+  - Cost: ~$10.00-$25.00/month (depending on length, assuming 2K-5K chars per audio)
+  - Enables audio content workflows
+  - **Default to Minimax Speech 02 HD OSS model**
+
+#### 8. **Monthly Cost Cap**
+- **Current**: $50.00
+- **Proposed**: **$45.00** ✅ (ADJUSTED - aligns with $40-50 target)
+- **Rationale**: 
+  - Protects against unexpected high usage
+  - Allows flexibility within limits
+  - Provides safety margin
+  - Aligns with $40-50 hard limit requirement
+
+---
+
+## Cost Analysis: Proposed Basic Tier (OSS-Focused)
+
+### Monthly Cost Breakdown (Per User) - Using OSS Models
+
+| Category | Usage | Cost per Unit (OSS) | Monthly Cost |
+|----------|-------|---------------------|--------------|
+| **LLM Calls** | 50 calls | $0.0056 avg (Gemini Flash) | **$0.28** |
+| **Search APIs** | 200 searches | $0.002 avg | **$0.40** |
+| **Image Generation** | 50 images | $0.03 (Qwen Image OSS) | **$1.50** |
+| **Image Editing** | 50 edits | $0.02 (Qwen Edit OSS) | **$1.00** |
+| **Video Generation** | 30 videos | $0.25 (WAN 2.5 OSS, ~5s) | **$7.50** |
+| **Audio Generation** | 100 audios | $0.10-$0.50 avg | **$10.00-$25.00** |
+| **Total Variable Cost** | | | **$20.68-$35.68** |
+
+### Margin Analysis (OSS-Focused)
+
+**Subscription Revenue**: $29.00/month
+**Variable Costs (OSS Models)**: $20.68-$35.68/month (depending on usage)
+**Gross Margin**: **-$6.68 to +$8.32/month**
+
+**✅ IMPROVEMENT**: OSS models reduce costs significantly:
+- Image generation: $0.03 vs $0.04 (25% savings)
+- Image editing: $0.02 vs $0.04 (50% savings)
+- Video generation: $0.25 vs $0.42 (40% savings)
+
+**Mitigation Strategy**: 
+1. **Cost cap enforcement**: Monthly cost cap of $45 prevents extreme losses
+2. **OSS model defaults**: Default to cheaper OSS models (Qwen, WAN 2.5)
+3. **Realistic usage**: Most users won't hit all limits simultaneously
+4. **Average usage assumption**: ~60-70% of limits = $12-25 cost = $4-17 margin
+5. **Hard limit protection**: $45 cap ensures we never exceed $50/user/month
+
+---
+
+## Revised Basic Tier Limits (Production-Ready, OSS-Focused)
+
+```python
+{
+    "name": "Basic",
+    "tier": SubscriptionTier.BASIC,
+    "price_monthly": 29.0,
+    "price_yearly": 290.0,
+    
+    # AI Text Generation (Unified Limit)
+    "ai_text_generation_calls_limit": 50,  # INCREASED from 10
+    
+    # Token Limits (Per Provider)
+    "gemini_tokens_limit": 100000,  # INCREASED from 20,000
+    "openai_tokens_limit": 100000,  # INCREASED from 20,000
+    "anthropic_tokens_limit": 100000,  # INCREASED from 20,000
+    "mistral_tokens_limit": 100000,  # INCREASED from 20,000
+    
+    # Search APIs
+    "tavily_calls_limit": 200,  # Keep
+    "serper_calls_limit": 200,  # Keep
+    "metaphor_calls_limit": 100,  # Keep
+    "firecrawl_calls_limit": 100,  # Keep
+    "exa_calls_limit": 500,  # Keep
+    
+    # Media Generation (OSS Models via WaveSpeed)
+    "stability_calls_limit": 50,  # INCREASED from 5 (using Qwen Image OSS $0.03)
+    "image_edit_calls_limit": 50,  # INCREASED from 30 (using Qwen Edit OSS $0.02)
+    "video_calls_limit": 30,  # INCREASED from 20 (using WAN 2.5 OSS $0.25)
+    "audio_calls_limit": 100,  # INCREASED from 50 (using Minimax Speech OSS)
+    
+    # Cost Protection
+    "monthly_cost_limit": 45.0,  # ADJUSTED from 50.0 (aligns with $40-50 target)
+    
+    # OSS Model Defaults
+    "default_image_model": "qwen-image",  # OSS model via WaveSpeed
+    "default_image_edit_model": "qwen-edit",  # OSS model via WaveSpeed
+    "default_video_model": "wan-2.5",  # OSS model via WaveSpeed
+    "default_audio_model": "minimax-speech-02-hd",  # OSS model via WaveSpeed
+    
+    # Features
+    "features": [
+        "full_content_generation",
+        "advanced_research", 
+        "basic_analytics",
+        "all_tools_access",  # All ALwrity tools accessible
+        "billing_dashboard",
+        "usage_tracking",
+        "oss_models_priority"  # NEW: OSS models prioritized for cost efficiency
+    ],
+    "description": "Perfect for individuals and small teams. Access all ALwrity features with generous limits powered by OSS AI models."
+}
+```
+
+---
+
+## Tool Usage Scenarios & Limits
+
+### Scenario 1: Blog Writer User
+- **Workflow**: 1 blog post = 3-5 LLM calls + 3-5 search calls + 1-2 images
+- **Monthly Capacity**: ~10-16 blog posts (with 50 LLM calls)
+- **Cost**: ~$0.50-$1.00 per blog post
+- **Status**: ✅ **FEASIBLE**
+
+### Scenario 2: Story Writer User
+- **Workflow**: 1 story = 2-3 LLM calls + 5-10 images + 5-10 audio + 5-10 videos
+- **Monthly Capacity**: ~16-25 stories (LLM limit) OR ~3-6 stories (image/video limits)
+- **Cost**: ~$2.00-$5.00 per story
+- **Status**: ✅ **FEASIBLE** (limited by media, not LLM)
+
+### Scenario 3: Podcast Maker User
+- **Workflow**: 1 podcast = 3-4 LLM calls + 3-5 search calls + 5-10 audio + optional 5-10 videos
+- **Monthly Capacity**: ~12-16 podcasts (LLM limit) OR ~5-10 podcasts (audio limit)
+- **Cost**: ~$1.00-$3.00 per podcast (without video)
+- **Status**: ✅ **FEASIBLE**
+
+### Scenario 4: Social Media Content Creator
+- **Workflow**: 1 post = 1-2 LLM calls + 1 image (optional)
+- **Monthly Capacity**: ~25-50 posts (LLM limit) OR ~30 posts (image limit)
+- **Cost**: ~$0.10-$0.15 per post
+- **Status**: ✅ **FEASIBLE**
+
+### Scenario 5: Video Creator (YouTube Creator)
+- **Workflow**: 1 video = 2-3 LLM calls + 5 scenes × (1 image + 1 audio + 1 video)
+- **Monthly Capacity**: ~4-5 full videos (video limit) OR ~16-25 videos (LLM limit)
+- **Cost**: ~$3.00-$5.00 per video
+- **Status**: ✅ **FEASIBLE** (limited by video limit, not LLM)
+
+---
+
+## Risk Mitigation Strategies
+
+### 1. **Cost Cap Enforcement**
+- **Monthly cost cap**: $50.00 (hard limit)
+- **Behavior**: When cap reached, all API calls blocked until next billing period
+- **Protection**: Prevents losses from extreme usage
+
+### 2. **Pre-flight Validation**
+- **Implementation**: Already in place
+- **Function**: Validates limits BEFORE making API calls
+- **Benefit**: Prevents wasted API calls on operations that would fail
+
+### 3. **Usage Monitoring & Alerts**
+- **80% Warning**: Alert users at 80% of limits
+- **100% Block**: Block operations at 100% of limits
+- **Dashboard**: Real-time usage tracking
+
+### 4. **Optimized Default Models**
+- **Strategy**: Use cost-effective models by default (Gemini Flash, GPT-4o Mini)
+- **Benefit**: Reduces costs while maintaining quality
+- **User Control**: Allow model selection for power users
+
+### 5. **Efficient API Usage**
+- **Batching**: Batch multiple operations where possible
+- **Caching**: Cache research results and common queries
+- **Optimization**: Continue optimizing tool workflows to reduce API calls
+
+---
+
+## Pricing Page Updates Required
+
+### Current Issues
+1. Pricing page shows outdated limits
+2. Missing unified `ai_text_generation_calls_limit` explanation
+3. Token limits don't match code (shows 1M/500K, code has 20K)
+4. Missing video/audio/image editing limits
+5. Missing cost transparency information
+
+### Required Updates
+
+#### Basic Tier Display
+```
+💰 Basic Plan - $29/month ($290/year)
+
+✨ All ALwrity Features Included:
+✅ Blog Writer, Story Writer, Podcast Maker
+✅ Image Generator & Editor
+✅ Video Studio & YouTube Creator
+✅ Audio Generator
+✅ All Social Media Writers
+✅ All SEO Tools & Dashboards
+✅ Content Planning & Strategy Tools
+
+📊 Usage Limits:
+• 50 AI Text Generations/month (unified across all LLM providers)
+• 100,000 tokens per provider (Gemini, OpenAI, Anthropic, Mistral)
+• 200 Research Searches/month (Tavily, Serper)
+• 500 Neural Searches/month (Exa)
+• 30 AI Images/month
+• 30 Image Edits/month
+• 20 AI Videos/month
+• 50 AI Audio Generations/month
+• $50 Monthly Cost Cap (protects you from overages)
+
+💡 Perfect for: Individuals, content creators, small teams
+```
+
+---
+
+## Implementation Checklist
+
+### Phase 1: Update Code Limits
+- [ ] Update `pricing_service.py` Basic tier limits:
+  - [ ] `ai_text_generation_calls_limit`: 10 → 50
+  - [ ] `gemini_tokens_limit`: 20,000 → 100,000
+  - [ ] `openai_tokens_limit`: 20,000 → 100,000
+  - [ ] `anthropic_tokens_limit`: 20,000 → 100,000
+  - [ ] `mistral_tokens_limit`: 20,000 → 100,000
+  - [ ] `stability_calls_limit`: 5 → 30
+- [ ] Run database migration script
+- [ ] Test limit enforcement
+
+### Phase 2: Update Pricing Page
+- [ ] Update `docs-site/docs/features/subscription/pricing.md`
+- [ ] Update frontend pricing page component
+- [ ] Add cost transparency section
+- [ ] Add tool usage examples
+- [ ] Add FAQ section
+
+### Phase 3: Update Documentation
+- [ ] Update subscription rule file (`.cursor/rules/subscription.mdc`)
+- [ ] Update API documentation
+- [ ] Create user-facing pricing guide
+
+### Phase 4: Testing
+- [ ] Test all tools with new limits
+- [ ] Verify cost calculations
+- [ ] Test limit enforcement
+- [ ] Test cost cap enforcement
+- [ ] Verify pre-flight validation
+
+---
+
+## Cost Calculation Examples
+
+### Example 1: Blog Writer - 1 Blog Post (OSS Models)
+```
+Research: 3 Exa searches = $0.015
+Outline: 1 LLM call (Gemini Flash) = $0.0056
+Content: 2 LLM calls (Gemini Flash) = $0.0112
+Image: 1 Qwen Image OSS = $0.03 (vs $0.04 Stability)
+Total: ~$0.06 per blog post (saved $0.01 with OSS)
+```
+
+### Example 2: Story Writer - 1 Story (5 scenes, OSS Models)
+```
+Outline: 1 LLM call = $0.0056
+Script: 1 LLM call = $0.0056
+Images: 5 × $0.03 (Qwen Image OSS) = $0.15 (vs $0.20)
+Audio: 5 × $0.10 = $0.50
+Videos: 5 × $0.25 (WAN 2.5 OSS) = $1.25 (vs $0.50-$2.10)
+Total: ~$1.96 per story (higher video cost, but better quality)
+```
+
+### Example 3: Podcast Maker - 1 Episode (10 min, 5 scenes, OSS Models)
+```
+Research: 3 Exa searches = $0.015
+Script: 1 LLM call = $0.0056
+Outline: 1 LLM call = $0.0056
+Audio: 5 × $0.20 (Minimax Speech OSS) = $1.00
+Video (optional): 5 × $0.25 (WAN 2.5 OSS) = $1.25
+Total: ~$1.03 per podcast (without video)
+Total: ~$2.28 per podcast (with video, OSS models)
+```
+
+### Example 4: Social Media - 10 Posts (OSS Models)
+```
+Generation: 10 × 1 LLM call = 10 calls × $0.0056 = $0.056
+Images: 10 × $0.03 (Qwen Image OSS) = $0.30 (vs $0.40)
+Total: ~$0.36 for 10 posts (saved $0.10 with OSS)
+```
+
+---
+
+## Competitive Analysis
+
+### Similar AI Content Platforms
+
+| Platform | Price | Limits | Notes |
+|----------|-------|--------|-------|
+| **Jasper** | $49/month | 50K words | Text-focused |
+| **Copy.ai** | $49/month | Unlimited words | Text-focused |
+| **Writesonic** | $19/month | 100K words | Text-focused |
+| **ALwrity Basic** | $29/month | 50 LLM calls + media | **Full platform** |
+
+**ALwrity Advantage**: 
+- Lower price point ($29 vs $49)
+- Includes video, image, audio generation (competitors don't)
+- Comprehensive tool suite (not just text)
+- Better value proposition
+
+---
+
+## Recommendations Summary
+
+### ✅ **APPROVED: Production-Ready Basic Tier (OSS-Focused)**
+
+**Price**: $29/month ($290/year) - **KEEP**
+
+**Key Changes** (OSS-Focused):
+1. ✅ **Increase AI Text Generation**: 10 → **50 calls/month**
+2. ✅ **Increase Token Limits**: 20K → **100K per provider**
+3. ✅ **Increase Image Generation**: 5 → **50 images/month** (OSS: Qwen Image $0.03)
+4. ✅ **Increase Image Editing**: 30 → **50 edits/month** (OSS: Qwen Edit $0.02)
+5. ✅ **Increase Video Generation**: 20 → **30 videos/month** (OSS: WAN 2.5 $0.25)
+6. ✅ **Increase Audio Generation**: 50 → **100 generations/month** (OSS: Minimax Speech)
+7. ✅ **Adjust Cost Cap**: $50 → **$45** (aligns with $40-50 target)
+8. ✅ **Default to OSS Models**: Qwen, WAN 2.5, Minimax Speech (cost-efficient)
+
+**Expected Outcomes**:
+- Users can experience all ALwrity features with generous limits
+- Sustainable cost structure (~$20-35/user/month average with OSS models)
+- Competitive pricing ($29 vs competitors $49+)
+- Room for margin ($4-17/user/month average)
+- Cost cap ($45) protects against losses (hard limit $40-50)
+- **OSS models provide 25-50% cost savings** vs proprietary alternatives
+
+**Risk Level**: 🟢 **LOW** (with cost cap enforcement and OSS model defaults)
+
+---
+
+## Implementation Plan
+
+### Phase 1: Update Pricing Service & Database (Priority: HIGH)
+
+#### 1.1 Update `pricing_service.py` Basic Tier Limits
+**File**: `backend/services/subscription/pricing_service.py`
+
+**Changes Required**:
+```python
+# In initialize_default_plans() method
+{
+    "name": "Basic",
+    "tier": SubscriptionTier.BASIC,
+    "price_monthly": 29.0,
+    "price_yearly": 290.0,
+    
+    # AI Text Generation (Unified Limit)
+    "ai_text_generation_calls_limit": 50,  # Changed from 10
+    
+    # Token Limits (Per Provider)
+    "gemini_tokens_limit": 100000,  # Changed from 20,000
+    "openai_tokens_limit": 100000,  # Changed from 20,000
+    "anthropic_tokens_limit": 100000,  # Changed from 20,000
+    "mistral_tokens_limit": 100000,  # Changed from 20,000
+    
+    # Search APIs (Keep existing)
+    "tavily_calls_limit": 200,
+    "serper_calls_limit": 200,
+    "metaphor_calls_limit": 100,
+    "firecrawl_calls_limit": 100,
+    "exa_calls_limit": 500,
+    
+    # Media Generation (OSS Models via WaveSpeed)
+    "stability_calls_limit": 50,  # Changed from 5 (now includes WaveSpeed OSS)
+    "image_edit_calls_limit": 50,  # Changed from 30
+    "video_calls_limit": 30,  # Changed from 20
+    "audio_calls_limit": 100,  # Changed from 50
+    
+    # Cost Protection
+    "monthly_cost_limit": 45.0,  # Changed from 50.0
+}
+```
+
+**Action Items**:
+- [ ] Update `initialize_default_plans()` method in `pricing_service.py`
+- [ ] Run database migration to update existing Basic tier subscriptions
+- [ ] Test limit enforcement with new values
+- [ ] Verify cost calculations reflect OSS model pricing
+
+#### 1.2 Update WaveSpeed Model Pricing in `pricing_service.py`
+**File**: `backend/services/subscription/pricing_service.py`
+
+**Changes Required**:
+```python
+# In initialize_default_pricing() method, update/add WaveSpeed OSS model pricing:
+
+# Image Generation (OSS Models via WaveSpeed)
+{
+    "provider": APIProvider.IMAGE,
+    "model_name": "qwen-image",
+    "cost_per_request": 0.03,  # OSS model via WaveSpeed
+    "description": "WaveSpeed Qwen Image (OSS) - Fast generation"
+},
+{
+    "provider": APIProvider.IMAGE,
+    "model_name": "ideogram-v3-turbo",
+    "cost_per_request": 0.05,  # OSS model via WaveSpeed
+    "description": "WaveSpeed Ideogram V3 Turbo (OSS) - Photorealistic"
+},
+
+# Image Editing (OSS Models via WaveSpeed)
+{
+    "provider": APIProvider.IMAGE_EDIT,
+    "model_name": "qwen-edit",
+    "cost_per_request": 0.02,  # OSS model via WaveSpeed
+    "description": "WaveSpeed Qwen Image Edit (OSS) - Budget editing"
+},
+{
+    "provider": APIProvider.IMAGE_EDIT,
+    "model_name": "qwen-edit-plus",
+    "cost_per_request": 0.02,  # OSS model via WaveSpeed
+    "description": "WaveSpeed Qwen Image Edit Plus (OSS) - Multi-image"
+},
+{
+    "provider": APIProvider.IMAGE_EDIT,
+    "model_name": "flux-kontext-pro",
+    "cost_per_request": 0.04,  # OSS model via WaveSpeed
+    "description": "WaveSpeed FLUX Kontext Pro (OSS) - Professional"
+},
+
+# Video Generation (OSS Models via WaveSpeed)
+{
+    "provider": APIProvider.VIDEO,
+    "model_name": "wan-2.5",
+    "cost_per_request": 0.25,  # OSS model via WaveSpeed (~5 seconds)
+    "description": "WaveSpeed WAN 2.5 (OSS) - Text-to-Video, Image-to-Video"
+},
+{
+    "provider": APIProvider.VIDEO,
+    "model_name": "seedance-1.5-pro",
+    "cost_per_request": 0.40,  # OSS model via WaveSpeed (~5 seconds)
+    "description": "WaveSpeed Seedance 1.5 Pro (OSS) - Longer duration"
+},
+
+# Audio Generation (OSS Models via WaveSpeed)
+{
+    "provider": APIProvider.AUDIO,
+    "model_name": "minimax-speech-02-hd",
+    "cost_per_input_token": 0.00005,  # $0.05 per 1K chars
+    "cost_per_output_token": 0.0,
+    "cost_per_request": 0.0,
+    "description": "WaveSpeed Minimax Speech 02 HD (OSS) - High-quality TTS"
+},
+```
+
+**Action Items**:
+- [ ] Add WaveSpeed OSS model pricing entries
+- [ ] Update default model selection logic to prefer OSS models
+- [ ] Test cost calculation with OSS models
+- [ ] Verify pricing accuracy against WaveSpeed API documentation
+
+#### 1.3 Update Default Model Selection Logic
+**Files**: 
+- `backend/services/llm_providers/main_image_generation.py`
+- `backend/services/image_studio/create_service.py`
+- `backend/services/image_studio/edit_service.py`
+- `backend/services/video_studio/video_service.py`
+- `backend/services/audio_generation/audio_service.py`
+
+**Changes Required**:
+- Default image generation to `qwen-image` (OSS) instead of Stability
+- Default image editing to `qwen-edit` (OSS) instead of Stability
+- Default video generation to `wan-2.5` (OSS) instead of HuggingFace
+- Default audio generation to `minimax-speech-02-hd` (OSS)
+
+**Action Items**:
+- [ ] Update `get_default_provider()` methods to prefer WaveSpeed OSS models
+- [ ] Update model selection UI to show OSS models as default/recommended
+- [ ] Add cost comparison tooltips showing OSS model savings
+- [ ] Test all tools with OSS model defaults
+
+### Phase 2: Update Frontend & Documentation (Priority: HIGH)
+
+#### 2.1 Update Pricing Page
+**File**: `docs-site/docs/features/subscription/pricing.md`
+
+**Changes Required**:
+- Update Basic tier limits to reflect new values (50 images, 50 edits, 30 videos, 100 audio)
+- Add OSS model information and cost savings messaging
+- Update cost examples to use OSS model pricing
+- Add FAQ about OSS models and cost efficiency
+
+**Action Items**:
+- [ ] Update pricing page markdown
+- [ ] Update frontend pricing component (if exists)
+- [ ] Add OSS model badges/indicators
+- [ ] Add cost comparison table (OSS vs proprietary)
+
+#### 2.2 Update Subscription Context & Components
+**Files**:
+- `frontend/src/contexts/SubscriptionContext.tsx`
+- `frontend/src/components/billing/EnhancedBillingDashboard.tsx`
+- `frontend/src/components/shared/UsageDashboard.tsx`
+
+**Changes Required**:
+- Display OSS model indicators in usage dashboard
+- Show cost savings from using OSS models
+- Update limit displays to show new Basic tier limits
+- Add tooltips explaining OSS model benefits
+
+**Action Items**:
+- [ ] Update limit displays in billing dashboard
+- [ ] Add OSS model indicators in cost breakdown
+- [ ] Update usage statistics to reflect new limits
+- [ ] Test UI with new limit values
+
+### Phase 3: Testing & Validation (Priority: CRITICAL)
+
+#### 3.1 Limit Enforcement Testing
+**Test Cases**:
+- [ ] Test 50 AI text generation calls limit
+- [ ] Test 50 image generation limit (OSS models)
+- [ ] Test 50 image editing limit (OSS models)
+- [ ] Test 30 video generation limit (OSS models)
+- [ ] Test 100 audio generation limit (OSS models)
+- [ ] Test $45 monthly cost cap enforcement
+- [ ] Test pre-flight validation with new limits
+- [ ] Test limit exceeded error messages
+
+#### 3.2 Cost Calculation Testing
+**Test Cases**:
+- [ ] Verify Qwen Image cost: $0.03 per image
+- [ ] Verify Qwen Edit cost: $0.02 per edit
+- [ ] Verify WAN 2.5 video cost: $0.25 per video
+- [ ] Verify Minimax Speech cost: $0.05 per 1K chars
+- [ ] Test cost aggregation across all operations
+- [ ] Test cost cap enforcement at $45
+- [ ] Verify cost display in billing dashboard
+
+#### 3.3 OSS Model Integration Testing
+**Test Cases**:
+- [ ] Test Qwen Image generation via WaveSpeed
+- [ ] Test Qwen Edit editing via WaveSpeed
+- [ ] Test WAN 2.5 video generation via WaveSpeed
+- [ ] Test Minimax Speech audio generation via WaveSpeed
+- [ ] Verify default model selection uses OSS models
+- [ ] Test model fallback if OSS model unavailable
+- [ ] Verify cost tracking for OSS models
+
+### Phase 4: Database Migration (Priority: HIGH)
+
+#### 4.1 Create Migration Script
+**File**: `backend/database/migrations/update_basic_tier_limits_oss.py`
+
+**Script Requirements**:
+```python
+"""
+Migration: Update Basic Tier Limits for OSS-Focused Pricing Strategy
+- Increase AI text generation: 10 → 50
+- Increase token limits: 20K → 100K per provider
+- Increase image generation: 5 → 50
+- Increase image editing: 30 → 50
+- Increase video generation: 20 → 30
+- Increase audio generation: 50 → 100
+- Adjust cost cap: $50 → $45
+"""
+
+def upgrade():
+    # Update SubscriptionPlan for Basic tier
+    # Update existing UserSubscription records
+    # Clear pricing service cache
+    pass
+
+def downgrade():
+    # Revert to previous limits if needed
+    pass
+```
+
+**Action Items**:
+- [ ] Create migration script
+- [ ] Test migration on staging database
+- [ ] Backup production database before migration
+- [ ] Run migration during maintenance window
+- [ ] Verify all subscriptions updated correctly
+
+### Phase 5: Monitoring & Adjustment (Priority: MEDIUM)
+
+#### 5.1 Set Up Monitoring
+**Metrics to Track**:
+- Average cost per user per month
+- Users hitting $45 cost cap
+- Users hitting individual limits
+- OSS model usage vs proprietary model usage
+- Cost savings from OSS models
+
+**Action Items**:
+- [ ] Set up cost monitoring dashboard
+- [ ] Create alerts for cost cap breaches
+- [ ] Track OSS model adoption rate
+- [ ] Monitor user satisfaction with limits
+
+#### 5.2 Adjustment Plan
+**Triggers for Adjustment**:
+- If average cost > $35/user: Consider reducing limits
+- If >15% users hit cost cap: Consider increasing cost cap to $50
+- If <20% users use video/audio: Consider reducing those limits
+- If OSS models unavailable: Fallback to proprietary models
+
+**Action Items**:
+- [ ] Define adjustment criteria
+- [ ] Create adjustment workflow
+- [ ] Plan communication strategy for limit changes
+
+---
+
+## Next Steps (Priority Order)
+
+1. **CRITICAL**: Update `pricing_service.py` with new Basic tier limits
+2. **CRITICAL**: Add WaveSpeed OSS model pricing to `pricing_service.py`
+3. **HIGH**: Update default model selection to prefer OSS models
+4. **HIGH**: Create and run database migration
+5. **HIGH**: Update pricing page documentation
+6. **HIGH**: Test limit enforcement and cost calculations
+7. **MEDIUM**: Update frontend components with new limits
+8. **MEDIUM**: Set up monitoring and alerts
+9. **LOW**: Add OSS model indicators to UI
+
+---
+
+## Monitoring & Adjustment Plan
+
+### Key Metrics to Track
+- Average LLM calls per user per month
+- Average media generation per user per month
+- Average cost per user per month
+- Users hitting cost cap
+- Users hitting individual limits
+
+### Adjustment Triggers
+- **If average cost > $25/user**: Consider reducing limits
+- **If >20% users hit cost cap**: Consider increasing cost cap
+- **If <10% users use video/audio**: Consider reducing those limits
+- **If churn rate high**: Consider increasing limits
+
+### Review Schedule
+- **Week 1-2**: Daily monitoring
+- **Month 1**: Weekly review
+- **Month 2-3**: Bi-weekly review
+- **Month 4+**: Monthly review
+
+---
+
+## Conclusion
+
+The proposed Basic tier limits (OSS-Focused) provide:
+- ✅ **Access to all ALwrity features** with generous limits
+- ✅ **Sustainable cost structure** using OSS models (25-50% savings)
+- ✅ **Competitive pricing** ($29 vs competitors $49+)
+- ✅ **Protection against losses** ($45 cost cap, hard limit $40-50)
+- ✅ **Room for growth** (can adjust based on usage)
+- ✅ **OSS-first strategy** (Qwen, FLUX, Ideogram, WAN 2.5, Minimax Speech)
+- ✅ **Maximum user value** while staying within cost constraints
+
+**Key Advantages of OSS-Focused Strategy**:
+1. **Cost Efficiency**: 25-50% cost savings vs proprietary models
+2. **Better Limits**: Can offer more generations due to lower costs
+3. **User Value**: More value for the same $29/month price
+4. **Sustainability**: Lower costs = better margins = sustainable business
+5. **Flexibility**: Can adjust limits based on actual usage patterns
+
+**Recommendation**: **APPROVE** for production launch with OSS-focused strategy.
+
+**Confidence Level**: 🟢 **HIGH** (with proper monitoring, cost cap enforcement, and OSS model defaults)
+
+**Risk Mitigation**:
+- $45 cost cap protects against losses (hard limit $40-50)
+- OSS model defaults ensure cost efficiency
+- Monitoring allows quick adjustment if needed
+- Realistic usage assumptions (60-70% of limits)