# Production Pricing Strategy - Basic Tier Launch (OSS-Focused) ## Executive Summary This document provides a comprehensive pricing strategy for ALwrity's production launch with **Basic Tier only**. All features and tools will be accessible to Basic tier users, requiring careful cost calculation and limit setting to ensure sustainability while providing value. **Critical Goals**: 1. **OSS-First Strategy**: Prioritize Open-Source AI models (WaveSpeed OSS models) for cost efficiency 2. **Hard Cost Cap**: $40-50 per user per month maximum (protects against losses) 3. **Maximum User Value**: Provide generous limits while staying within cost constraints 4. **Fair Pricing**: Balance between sustainability and user value (not excessive profit margins) **Strategy**: Use WaveSpeed's OSS models (Qwen, FLUX, Ideogram, WAN 2.5) which offer better pricing than proprietary alternatives, allowing us to provide more value to users while maintaining profitability. --- ## Current State Analysis ### Current Basic Tier (Code Implementation) **Price**: $29/month ($290/year) **Limits**: - **AI Text Generation**: 10 unified calls/month (across all LLM providers) - **Tokens**: 20,000 per provider (Gemini, OpenAI, Anthropic, Mistral) - **Search APIs**: 200 Tavily, 200 Serper, 100 Metaphor, 100 Firecrawl, 500 Exa - **Image Generation**: 5 Stability AI images/month - **Image Editing**: 30 AI image edits/month - **Video Generation**: 20 videos/month - **Audio Generation**: 50 TTS generations/month - **Monthly Cost Cap**: $50.00 **Problem**: 10 unified AI text generation calls is **too restrictive** for production launch where users need to experience all features. --- ## ALwrity Tools & Content Generation Analysis ### Content Generation Tools #### 1. **Text Generation Tools** (Primary LLM Usage) | Tool | API Calls per Generation | Typical Usage | Cost per Generation | |------|--------------------------|---------------|---------------------| | **Blog Writer** | 3-5 calls | 1 blog = research (1) + outline (1) + content (1-3) | $0.01 - $0.05 | | **Story Writer** | 2-3 calls | 1 story = outline (1) + script (1-2) | $0.01 - $0.03 | | **Podcast Maker** | 3-4 calls | 1 podcast = research (1) + script (1) + outline (1-2) | $0.01 - $0.04 | | **Facebook Writer** | 1-2 calls | 1 post = generation (1) + optional optimization (1) | $0.005 - $0.01 | | **LinkedIn Writer** | 1-2 calls | 1 post = generation (1) + optional optimization (1) | $0.005 - $0.01 | | **SEO Tools** | 1-3 calls | Varies by tool complexity | $0.005 - $0.02 | | **Content Planning** | 2-4 calls | Strategy generation + analysis | $0.01 - $0.03 | **Average**: ~2-3 LLM calls per content generation workflow #### 2. **Image Generation Tools** | Tool | API Calls | Cost per Generation | |------|-----------|---------------------| | **Image Generator** | 1 Stability call | $0.04 per image | | **Image Editor** | 1 Image Edit call | $0.04 per edit operation | **Current Limit**: 5 images/month (too low for production) #### 3. **Video Generation Tools** | Tool | API Calls | Cost per Video | Notes | |------|-----------|-----------------|-------| | **Video Studio** | 1 video call | $0.10 - $0.42 | Depends on model/duration | | **YouTube Creator** | 1 video call per scene | $0.10 - $0.42 per scene | 5 scenes = $0.50 - $2.10 | | **Story Writer Video** | 1 video call per scene | $0.10 - $0.42 per scene | Variable scenes | | **Podcast Maker Video** | 1 video call per scene | $0.10 - $0.42 per scene | Optional video generation | **Current Limit**: 20 videos/month (reasonable) #### 4. **Audio Generation Tools** | Tool | API Calls | Cost per Generation | Notes | |------|-----------|---------------------|-------| | **Audio Generator** | 1 audio call | $0.05 per 1,000 chars | ~$0.10 - $0.50 per audio | | **Podcast Maker TTS** | 1 audio call per scene | $0.05 per 1,000 chars | Multiple scenes | | **Story Writer Narration** | 1 audio call per scene | $0.05 per 1,000 chars | Multiple scenes | **Current Limit**: 50 audio generations/month (reasonable) --- ## API Cost Breakdown ### LLM Provider Costs (Per 1M Tokens) | Provider | Model | Input Cost | Output Cost | Typical Use | |----------|-------|------------|-------------|-------------| | **Gemini** | 2.5 Flash | $0.30 | $2.50 | Default (cost-effective) | | **Gemini** | 2.5 Pro | $1.25 | $10.00 | Premium quality | | **OpenAI** | GPT-4o Mini | $0.15 | $0.60 | Cost-effective | | **OpenAI** | GPT-4o | $2.50 | $10.00 | Premium quality | | **Anthropic** | Claude 3.5 Sonnet | $3.00 | $15.00 | Premium quality | | **HuggingFace** | GPT-OSS-120B | $1.00 | $3.00 | Alternative option | **Average Cost per LLM Call** (assuming 1K input + 2K output tokens): - Gemini Flash: ~$0.0056 per call - GPT-4o Mini: ~$0.0015 per call - Claude 3.5: ~$0.033 per call **Recommendation**: Use Gemini Flash as default for cost efficiency. ### Search API Costs | Provider | Cost per Search | Typical Usage | |----------|----------------|---------------| | **Tavily** | $0.001 | Research operations | | **Serper** | $0.001 | Research operations | | **Metaphor** | $0.003 | Research operations | | **Exa** | $0.005 | Neural search (premium) | | **Firecrawl** | $0.002 | Web page extraction | **Average**: ~$0.002 per search operation ### Media Generation Costs (OSS-Focused via WaveSpeed) #### **Image Generation** (OSS Models via WaveSpeed) | Model | Cost | Type | Notes | |------|------|------|-------| | **Qwen Image** | $0.03 per image | OSS | Fast generation, cost-effective | | **Ideogram V3 Turbo** | $0.05 per image | OSS | Photorealistic, text rendering | | **Default (Qwen)** | $0.03 per image | OSS | **Recommended for Basic tier** | #### **Image Editing** (OSS Models via WaveSpeed) | Model | Cost | Type | Use Case | |------|------|------|----------| | **Qwen Image Edit** | $0.02 per edit | OSS | Budget editing, bilingual | | **Qwen Image Edit Plus** | $0.02 per edit | OSS | Multi-image editing | | **FLUX Kontext Pro** | $0.04 per edit | OSS | Typography, professional | | **Default (Qwen Edit)** | $0.02 per edit | OSS | **Recommended for Basic tier** | #### **Video Generation** (OSS Models via WaveSpeed) | Model | Cost | Type | Duration | Notes | |------|------|------|----------|-------| | **WAN 2.5** | $0.05/sec | OSS | 5-15 sec | Text-to-Video, Image-to-Video | | **Seedance 1.5 Pro** | $0.08/sec | OSS | 10-30 sec | Longer duration | | **Kling v2.5 Turbo (5s)** | $0.21 per video | OSS | 5 sec | Image-to-Video | | **Kling v2.5 Turbo (10s)** | $0.42 per video | OSS | 10 sec | Extended duration | | **Default (WAN 2.5)** | $0.25 per video | OSS | ~5 sec | **Recommended for Basic tier** | #### **Audio Generation** (OSS Models via WaveSpeed) | Model | Cost | Type | Notes | |------|------|------|-------| | **Minimax Speech 02 HD** | $0.05 per 1K chars | OSS | High-quality TTS | | **Default** | $0.05 per 1K chars | OSS | ~$0.10-0.50 per audio | #### **Face Swap & Specialized** (OSS Models via WaveSpeed) | Operation | Cost | Type | Notes | |-----------|------|------|-------| | **Face Swap** | $0.01-$0.03 | OSS | Basic to premium quality | | **Image Upscaling** | $0.01-$0.06 | OSS | 2K/4K/8K options | | **3D Generation** | $0.02-$0.30 | OSS | Budget to premium | **OSS Advantage**: WaveSpeed provides access to OSS models (Qwen, FLUX, Ideogram, WAN 2.5) at significantly lower costs than proprietary alternatives, enabling better value for users. --- ## Production-Ready Basic Tier Proposal ### Revised Limits for Production Launch **Price**: $29/month ($290/year) - **KEEP CURRENT PRICING** **Rationale**: Competitive pricing point, allows for sustainable margins with proper limits. ### Proposed Limits #### 1. **AI Text Generation** (Unified Limit) - **Current**: 10 calls/month ❌ **TOO LOW** - **Proposed**: **50 calls/month** ✅ - **Rationale**: - Allows ~16-25 content generations/month (assuming 2-3 calls each) - Enables users to experience Blog Writer, Story Writer, Podcast Maker, Social Writers - Sustainable cost: ~$0.28/month (50 calls × $0.0056 average) #### 2. **Token Limits** (Per Provider) - **Current**: 20,000 tokens/provider - **Proposed**: **100,000 tokens/provider** ✅ - **Rationale**: - Allows ~33-50 LLM calls per provider (assuming 2K tokens/call) - Provides buffer for longer content generation - Aligns with unified call limit (50 calls × 2K tokens = 100K tokens) #### 3. **Search APIs** - **Tavily**: 200 calls/month ✅ (Keep) - **Serper**: 200 calls/month ✅ (Keep) - **Metaphor**: 100 calls/month ✅ (Keep) - **Firecrawl**: 100 calls/month ✅ (Keep) - **Exa**: 500 calls/month ✅ (Keep) - **Rationale**: Sufficient for research-heavy tools (Blog Writer, Podcast Maker, SEO tools) #### 4. **Image Generation** (OSS Models via WaveSpeed) - **Current**: 5 images/month ❌ **TOO LOW** - **Proposed**: **50 images/month** ✅ (INCREASED - OSS models are cheaper) - **Rationale**: - OSS models (Qwen Image $0.03) are cheaper than Stability ($0.04) - Allows users to generate images for Story Writer, Blog Writer, Social Media - Cost: ~$1.50/month (50 × $0.03 using Qwen Image OSS model) - Enables visual content creation workflows - **Default to Qwen Image OSS model** for cost efficiency #### 5. **Image Editing** (OSS Models via WaveSpeed) - **Current**: 30 edits/month - **Proposed**: **50 edits/month** ✅ (INCREASED - OSS models are cheaper) - **Rationale**: - OSS models (Qwen Edit $0.02) are cheaper than Stability ($0.04) - Cost: ~$1.00/month (50 × $0.02 using Qwen Edit OSS model) - Sufficient for image optimization workflows - **Default to Qwen Edit OSS model** for cost efficiency #### 6. **Video Generation** (OSS Models via WaveSpeed) - **Current**: 20 videos/month - **Proposed**: **30 videos/month** ✅ (INCREASED - OSS models available) - **Rationale**: - OSS models (WAN 2.5 $0.25 per 5s video) provide good value - Allows ~6-10 full video projects/month (assuming 3-5 scenes each) - Cost: ~$7.50/month (30 × $0.25 using WAN 2.5 OSS model) - Enables Video Studio, YouTube Creator, Story Writer video features - **Default to WAN 2.5 OSS model** for cost efficiency #### 7. **Audio Generation** (OSS Models via WaveSpeed) - **Current**: 50 generations/month - **Proposed**: **100 generations/month** ✅ (INCREASED - OSS models are affordable) - **Rationale**: - OSS models (Minimax Speech 02 HD) provide high quality at $0.05/1K chars - Sufficient for Podcast Maker, Story Writer narration - Cost: ~$10.00-$25.00/month (depending on length, assuming 2K-5K chars per audio) - Enables audio content workflows - **Default to Minimax Speech 02 HD OSS model** #### 8. **Monthly Cost Cap** - **Current**: $50.00 - **Proposed**: **$45.00** ✅ (ADJUSTED - aligns with $40-50 target) - **Rationale**: - Protects against unexpected high usage - Allows flexibility within limits - Provides safety margin - Aligns with $40-50 hard limit requirement --- ## Cost Analysis: Proposed Basic Tier (OSS-Focused) ### Monthly Cost Breakdown (Per User) - Using OSS Models | Category | Usage | Cost per Unit (OSS) | Monthly Cost | |----------|-------|---------------------|--------------| | **LLM Calls** | 50 calls | $0.0056 avg (Gemini Flash) | **$0.28** | | **Search APIs** | 200 searches | $0.002 avg | **$0.40** | | **Image Generation** | 50 images | $0.03 (Qwen Image OSS) | **$1.50** | | **Image Editing** | 50 edits | $0.02 (Qwen Edit OSS) | **$1.00** | | **Video Generation** | 30 videos | $0.25 (WAN 2.5 OSS, ~5s) | **$7.50** | | **Audio Generation** | 100 audios | $0.10-$0.50 avg | **$10.00-$25.00** | | **Total Variable Cost** | | | **$20.68-$35.68** | ### Margin Analysis (OSS-Focused) **Subscription Revenue**: $29.00/month **Variable Costs (OSS Models)**: $20.68-$35.68/month (depending on usage) **Gross Margin**: **-$6.68 to +$8.32/month** **✅ IMPROVEMENT**: OSS models reduce costs significantly: - Image generation: $0.03 vs $0.04 (25% savings) - Image editing: $0.02 vs $0.04 (50% savings) - Video generation: $0.25 vs $0.42 (40% savings) **Mitigation Strategy**: 1. **Cost cap enforcement**: Monthly cost cap of $45 prevents extreme losses 2. **OSS model defaults**: Default to cheaper OSS models (Qwen, WAN 2.5) 3. **Realistic usage**: Most users won't hit all limits simultaneously 4. **Average usage assumption**: ~60-70% of limits = $12-25 cost = $4-17 margin 5. **Hard limit protection**: $45 cap ensures we never exceed $50/user/month --- ## Revised Basic Tier Limits (Production-Ready, OSS-Focused) ```python { "name": "Basic", "tier": SubscriptionTier.BASIC, "price_monthly": 29.0, "price_yearly": 290.0, # AI Text Generation (Unified Limit) "ai_text_generation_calls_limit": 50, # INCREASED from 10 # Token Limits (Per Provider) "gemini_tokens_limit": 100000, # INCREASED from 20,000 "openai_tokens_limit": 100000, # INCREASED from 20,000 "anthropic_tokens_limit": 100000, # INCREASED from 20,000 "mistral_tokens_limit": 100000, # INCREASED from 20,000 # Search APIs "tavily_calls_limit": 200, # Keep "serper_calls_limit": 200, # Keep "metaphor_calls_limit": 100, # Keep "firecrawl_calls_limit": 100, # Keep "exa_calls_limit": 500, # Keep # Media Generation (OSS Models via WaveSpeed) "stability_calls_limit": 50, # INCREASED from 5 (using Qwen Image OSS $0.03) "image_edit_calls_limit": 50, # INCREASED from 30 (using Qwen Edit OSS $0.02) "video_calls_limit": 30, # INCREASED from 20 (using WAN 2.5 OSS $0.25) "audio_calls_limit": 100, # INCREASED from 50 (using Minimax Speech OSS) # Cost Protection "monthly_cost_limit": 45.0, # ADJUSTED from 50.0 (aligns with $40-50 target) # OSS Model Defaults "default_image_model": "qwen-image", # OSS model via WaveSpeed "default_image_edit_model": "qwen-edit", # OSS model via WaveSpeed "default_video_model": "wan-2.5", # OSS model via WaveSpeed "default_audio_model": "minimax-speech-02-hd", # OSS model via WaveSpeed # Features "features": [ "full_content_generation", "advanced_research", "basic_analytics", "all_tools_access", # All ALwrity tools accessible "billing_dashboard", "usage_tracking", "oss_models_priority" # NEW: OSS models prioritized for cost efficiency ], "description": "Perfect for individuals and small teams. Access all ALwrity features with generous limits powered by OSS AI models." } ``` --- ## Tool Usage Scenarios & Limits ### Scenario 1: Blog Writer User - **Workflow**: 1 blog post = 3-5 LLM calls + 3-5 search calls + 1-2 images - **Monthly Capacity**: ~10-16 blog posts (with 50 LLM calls) - **Cost**: ~$0.50-$1.00 per blog post - **Status**: ✅ **FEASIBLE** ### Scenario 2: Story Writer User - **Workflow**: 1 story = 2-3 LLM calls + 5-10 images + 5-10 audio + 5-10 videos - **Monthly Capacity**: ~16-25 stories (LLM limit) OR ~3-6 stories (image/video limits) - **Cost**: ~$2.00-$5.00 per story - **Status**: ✅ **FEASIBLE** (limited by media, not LLM) ### Scenario 3: Podcast Maker User - **Workflow**: 1 podcast = 3-4 LLM calls + 3-5 search calls + 5-10 audio + optional 5-10 videos - **Monthly Capacity**: ~12-16 podcasts (LLM limit) OR ~5-10 podcasts (audio limit) - **Cost**: ~$1.00-$3.00 per podcast (without video) - **Status**: ✅ **FEASIBLE** ### Scenario 4: Social Media Content Creator - **Workflow**: 1 post = 1-2 LLM calls + 1 image (optional) - **Monthly Capacity**: ~25-50 posts (LLM limit) OR ~30 posts (image limit) - **Cost**: ~$0.10-$0.15 per post - **Status**: ✅ **FEASIBLE** ### Scenario 5: Video Creator (YouTube Creator) - **Workflow**: 1 video = 2-3 LLM calls + 5 scenes × (1 image + 1 audio + 1 video) - **Monthly Capacity**: ~4-5 full videos (video limit) OR ~16-25 videos (LLM limit) - **Cost**: ~$3.00-$5.00 per video - **Status**: ✅ **FEASIBLE** (limited by video limit, not LLM) --- ## Risk Mitigation Strategies ### 1. **Cost Cap Enforcement** - **Monthly cost cap**: $50.00 (hard limit) - **Behavior**: When cap reached, all API calls blocked until next billing period - **Protection**: Prevents losses from extreme usage ### 2. **Pre-flight Validation** - **Implementation**: Already in place - **Function**: Validates limits BEFORE making API calls - **Benefit**: Prevents wasted API calls on operations that would fail ### 3. **Usage Monitoring & Alerts** - **80% Warning**: Alert users at 80% of limits - **100% Block**: Block operations at 100% of limits - **Dashboard**: Real-time usage tracking ### 4. **Optimized Default Models** - **Strategy**: Use cost-effective models by default (Gemini Flash, GPT-4o Mini) - **Benefit**: Reduces costs while maintaining quality - **User Control**: Allow model selection for power users ### 5. **Efficient API Usage** - **Batching**: Batch multiple operations where possible - **Caching**: Cache research results and common queries - **Optimization**: Continue optimizing tool workflows to reduce API calls --- ## Pricing Page Updates Required ### Current Issues 1. Pricing page shows outdated limits 2. Missing unified `ai_text_generation_calls_limit` explanation 3. Token limits don't match code (shows 1M/500K, code has 20K) 4. Missing video/audio/image editing limits 5. Missing cost transparency information ### Required Updates #### Basic Tier Display ``` 💰 Basic Plan - $29/month ($290/year) ✨ All ALwrity Features Included: ✅ Blog Writer, Story Writer, Podcast Maker ✅ Image Generator & Editor ✅ Video Studio & YouTube Creator ✅ Audio Generator ✅ All Social Media Writers ✅ All SEO Tools & Dashboards ✅ Content Planning & Strategy Tools 📊 Usage Limits: • 50 AI Text Generations/month (unified across all LLM providers) • 100,000 tokens per provider (Gemini, OpenAI, Anthropic, Mistral) • 200 Research Searches/month (Tavily, Serper) • 500 Neural Searches/month (Exa) • 30 AI Images/month • 30 Image Edits/month • 20 AI Videos/month • 50 AI Audio Generations/month • $50 Monthly Cost Cap (protects you from overages) 💡 Perfect for: Individuals, content creators, small teams ``` --- ## Implementation Checklist ### Phase 1: Update Code Limits - [ ] Update `pricing_service.py` Basic tier limits: - [ ] `ai_text_generation_calls_limit`: 10 → 50 - [ ] `gemini_tokens_limit`: 20,000 → 100,000 - [ ] `openai_tokens_limit`: 20,000 → 100,000 - [ ] `anthropic_tokens_limit`: 20,000 → 100,000 - [ ] `mistral_tokens_limit`: 20,000 → 100,000 - [ ] `stability_calls_limit`: 5 → 30 - [ ] Run database migration script - [ ] Test limit enforcement ### Phase 2: Update Pricing Page - [ ] Update `docs-site/docs/features/subscription/pricing.md` - [ ] Update frontend pricing page component - [ ] Add cost transparency section - [ ] Add tool usage examples - [ ] Add FAQ section ### Phase 3: Update Documentation - [ ] Update subscription rule file (`.cursor/rules/subscription.mdc`) - [ ] Update API documentation - [ ] Create user-facing pricing guide ### Phase 4: Testing - [ ] Test all tools with new limits - [ ] Verify cost calculations - [ ] Test limit enforcement - [ ] Test cost cap enforcement - [ ] Verify pre-flight validation --- ## Cost Calculation Examples ### Example 1: Blog Writer - 1 Blog Post (OSS Models) ``` Research: 3 Exa searches = $0.015 Outline: 1 LLM call (Gemini Flash) = $0.0056 Content: 2 LLM calls (Gemini Flash) = $0.0112 Image: 1 Qwen Image OSS = $0.03 (vs $0.04 Stability) Total: ~$0.06 per blog post (saved $0.01 with OSS) ``` ### Example 2: Story Writer - 1 Story (5 scenes, OSS Models) ``` Outline: 1 LLM call = $0.0056 Script: 1 LLM call = $0.0056 Images: 5 × $0.03 (Qwen Image OSS) = $0.15 (vs $0.20) Audio: 5 × $0.10 = $0.50 Videos: 5 × $0.25 (WAN 2.5 OSS) = $1.25 (vs $0.50-$2.10) Total: ~$1.96 per story (higher video cost, but better quality) ``` ### Example 3: Podcast Maker - 1 Episode (10 min, 5 scenes, OSS Models) ``` Research: 3 Exa searches = $0.015 Script: 1 LLM call = $0.0056 Outline: 1 LLM call = $0.0056 Audio: 5 × $0.20 (Minimax Speech OSS) = $1.00 Video (optional): 5 × $0.25 (WAN 2.5 OSS) = $1.25 Total: ~$1.03 per podcast (without video) Total: ~$2.28 per podcast (with video, OSS models) ``` ### Example 4: Social Media - 10 Posts (OSS Models) ``` Generation: 10 × 1 LLM call = 10 calls × $0.0056 = $0.056 Images: 10 × $0.03 (Qwen Image OSS) = $0.30 (vs $0.40) Total: ~$0.36 for 10 posts (saved $0.10 with OSS) ``` --- ## Competitive Analysis ### Similar AI Content Platforms | Platform | Price | Limits | Notes | |----------|-------|--------|-------| | **Jasper** | $49/month | 50K words | Text-focused | | **Copy.ai** | $49/month | Unlimited words | Text-focused | | **Writesonic** | $19/month | 100K words | Text-focused | | **ALwrity Basic** | $29/month | 50 LLM calls + media | **Full platform** | **ALwrity Advantage**: - Lower price point ($29 vs $49) - Includes video, image, audio generation (competitors don't) - Comprehensive tool suite (not just text) - Better value proposition --- ## Recommendations Summary ### ✅ **APPROVED: Production-Ready Basic Tier (OSS-Focused)** **Price**: $29/month ($290/year) - **KEEP** **Key Changes** (OSS-Focused): 1. ✅ **Increase AI Text Generation**: 10 → **50 calls/month** 2. ✅ **Increase Token Limits**: 20K → **100K per provider** 3. ✅ **Increase Image Generation**: 5 → **50 images/month** (OSS: Qwen Image $0.03) 4. ✅ **Increase Image Editing**: 30 → **50 edits/month** (OSS: Qwen Edit $0.02) 5. ✅ **Increase Video Generation**: 20 → **30 videos/month** (OSS: WAN 2.5 $0.25) 6. ✅ **Increase Audio Generation**: 50 → **100 generations/month** (OSS: Minimax Speech) 7. ✅ **Adjust Cost Cap**: $50 → **$45** (aligns with $40-50 target) 8. ✅ **Default to OSS Models**: Qwen, WAN 2.5, Minimax Speech (cost-efficient) **Expected Outcomes**: - Users can experience all ALwrity features with generous limits - Sustainable cost structure (~$20-35/user/month average with OSS models) - Competitive pricing ($29 vs competitors $49+) - Room for margin ($4-17/user/month average) - Cost cap ($45) protects against losses (hard limit $40-50) - **OSS models provide 25-50% cost savings** vs proprietary alternatives **Risk Level**: 🟢 **LOW** (with cost cap enforcement and OSS model defaults) --- ## Implementation Plan ### Phase 1: Update Pricing Service & Database (Priority: HIGH) #### 1.1 Update `pricing_service.py` Basic Tier Limits **File**: `backend/services/subscription/pricing_service.py` **Changes Required**: ```python # In initialize_default_plans() method { "name": "Basic", "tier": SubscriptionTier.BASIC, "price_monthly": 29.0, "price_yearly": 290.0, # AI Text Generation (Unified Limit) "ai_text_generation_calls_limit": 50, # Changed from 10 # Token Limits (Per Provider) "gemini_tokens_limit": 100000, # Changed from 20,000 "openai_tokens_limit": 100000, # Changed from 20,000 "anthropic_tokens_limit": 100000, # Changed from 20,000 "mistral_tokens_limit": 100000, # Changed from 20,000 # Search APIs (Keep existing) "tavily_calls_limit": 200, "serper_calls_limit": 200, "metaphor_calls_limit": 100, "firecrawl_calls_limit": 100, "exa_calls_limit": 500, # Media Generation (OSS Models via WaveSpeed) "stability_calls_limit": 50, # Changed from 5 (now includes WaveSpeed OSS) "image_edit_calls_limit": 50, # Changed from 30 "video_calls_limit": 30, # Changed from 20 "audio_calls_limit": 100, # Changed from 50 # Cost Protection "monthly_cost_limit": 45.0, # Changed from 50.0 } ``` **Action Items**: - [ ] Update `initialize_default_plans()` method in `pricing_service.py` - [ ] Run database migration to update existing Basic tier subscriptions - [ ] Test limit enforcement with new values - [ ] Verify cost calculations reflect OSS model pricing #### 1.2 Update WaveSpeed Model Pricing in `pricing_service.py` **File**: `backend/services/subscription/pricing_service.py` **Changes Required**: ```python # In initialize_default_pricing() method, update/add WaveSpeed OSS model pricing: # Image Generation (OSS Models via WaveSpeed) { "provider": APIProvider.IMAGE, "model_name": "qwen-image", "cost_per_request": 0.03, # OSS model via WaveSpeed "description": "WaveSpeed Qwen Image (OSS) - Fast generation" }, { "provider": APIProvider.IMAGE, "model_name": "ideogram-v3-turbo", "cost_per_request": 0.05, # OSS model via WaveSpeed "description": "WaveSpeed Ideogram V3 Turbo (OSS) - Photorealistic" }, # Image Editing (OSS Models via WaveSpeed) { "provider": APIProvider.IMAGE_EDIT, "model_name": "qwen-edit", "cost_per_request": 0.02, # OSS model via WaveSpeed "description": "WaveSpeed Qwen Image Edit (OSS) - Budget editing" }, { "provider": APIProvider.IMAGE_EDIT, "model_name": "qwen-edit-plus", "cost_per_request": 0.02, # OSS model via WaveSpeed "description": "WaveSpeed Qwen Image Edit Plus (OSS) - Multi-image" }, { "provider": APIProvider.IMAGE_EDIT, "model_name": "flux-kontext-pro", "cost_per_request": 0.04, # OSS model via WaveSpeed "description": "WaveSpeed FLUX Kontext Pro (OSS) - Professional" }, # Video Generation (OSS Models via WaveSpeed) { "provider": APIProvider.VIDEO, "model_name": "wan-2.5", "cost_per_request": 0.25, # OSS model via WaveSpeed (~5 seconds) "description": "WaveSpeed WAN 2.5 (OSS) - Text-to-Video, Image-to-Video" }, { "provider": APIProvider.VIDEO, "model_name": "seedance-1.5-pro", "cost_per_request": 0.40, # OSS model via WaveSpeed (~5 seconds) "description": "WaveSpeed Seedance 1.5 Pro (OSS) - Longer duration" }, # Audio Generation (OSS Models via WaveSpeed) { "provider": APIProvider.AUDIO, "model_name": "minimax-speech-02-hd", "cost_per_input_token": 0.00005, # $0.05 per 1K chars "cost_per_output_token": 0.0, "cost_per_request": 0.0, "description": "WaveSpeed Minimax Speech 02 HD (OSS) - High-quality TTS" }, ``` **Action Items**: - [ ] Add WaveSpeed OSS model pricing entries - [ ] Update default model selection logic to prefer OSS models - [ ] Test cost calculation with OSS models - [ ] Verify pricing accuracy against WaveSpeed API documentation #### 1.3 Update Default Model Selection Logic **Files**: - `backend/services/llm_providers/main_image_generation.py` - `backend/services/image_studio/create_service.py` - `backend/services/image_studio/edit_service.py` - `backend/services/video_studio/video_service.py` - `backend/services/audio_generation/audio_service.py` **Changes Required**: - Default image generation to `qwen-image` (OSS) instead of Stability - Default image editing to `qwen-edit` (OSS) instead of Stability - Default video generation to `wan-2.5` (OSS) instead of HuggingFace - Default audio generation to `minimax-speech-02-hd` (OSS) **Action Items**: - [ ] Update `get_default_provider()` methods to prefer WaveSpeed OSS models - [ ] Update model selection UI to show OSS models as default/recommended - [ ] Add cost comparison tooltips showing OSS model savings - [ ] Test all tools with OSS model defaults ### Phase 2: Update Frontend & Documentation (Priority: HIGH) #### 2.1 Update Pricing Page **File**: `docs-site/docs/features/subscription/pricing.md` **Changes Required**: - Update Basic tier limits to reflect new values (50 images, 50 edits, 30 videos, 100 audio) - Add OSS model information and cost savings messaging - Update cost examples to use OSS model pricing - Add FAQ about OSS models and cost efficiency **Action Items**: - [ ] Update pricing page markdown - [ ] Update frontend pricing component (if exists) - [ ] Add OSS model badges/indicators - [ ] Add cost comparison table (OSS vs proprietary) #### 2.2 Update Subscription Context & Components **Files**: - `frontend/src/contexts/SubscriptionContext.tsx` - `frontend/src/components/billing/EnhancedBillingDashboard.tsx` - `frontend/src/components/shared/UsageDashboard.tsx` **Changes Required**: - Display OSS model indicators in usage dashboard - Show cost savings from using OSS models - Update limit displays to show new Basic tier limits - Add tooltips explaining OSS model benefits **Action Items**: - [ ] Update limit displays in billing dashboard - [ ] Add OSS model indicators in cost breakdown - [ ] Update usage statistics to reflect new limits - [ ] Test UI with new limit values ### Phase 3: Testing & Validation (Priority: CRITICAL) #### 3.1 Limit Enforcement Testing **Test Cases**: - [ ] Test 50 AI text generation calls limit - [ ] Test 50 image generation limit (OSS models) - [ ] Test 50 image editing limit (OSS models) - [ ] Test 30 video generation limit (OSS models) - [ ] Test 100 audio generation limit (OSS models) - [ ] Test $45 monthly cost cap enforcement - [ ] Test pre-flight validation with new limits - [ ] Test limit exceeded error messages #### 3.2 Cost Calculation Testing **Test Cases**: - [ ] Verify Qwen Image cost: $0.03 per image - [ ] Verify Qwen Edit cost: $0.02 per edit - [ ] Verify WAN 2.5 video cost: $0.25 per video - [ ] Verify Minimax Speech cost: $0.05 per 1K chars - [ ] Test cost aggregation across all operations - [ ] Test cost cap enforcement at $45 - [ ] Verify cost display in billing dashboard #### 3.3 OSS Model Integration Testing **Test Cases**: - [ ] Test Qwen Image generation via WaveSpeed - [ ] Test Qwen Edit editing via WaveSpeed - [ ] Test WAN 2.5 video generation via WaveSpeed - [ ] Test Minimax Speech audio generation via WaveSpeed - [ ] Verify default model selection uses OSS models - [ ] Test model fallback if OSS model unavailable - [ ] Verify cost tracking for OSS models ### Phase 4: Database Migration (Priority: HIGH) #### 4.1 Create Migration Script **File**: `backend/database/migrations/update_basic_tier_limits_oss.py` **Script Requirements**: ```python """ Migration: Update Basic Tier Limits for OSS-Focused Pricing Strategy - Increase AI text generation: 10 → 50 - Increase token limits: 20K → 100K per provider - Increase image generation: 5 → 50 - Increase image editing: 30 → 50 - Increase video generation: 20 → 30 - Increase audio generation: 50 → 100 - Adjust cost cap: $50 → $45 """ def upgrade(): # Update SubscriptionPlan for Basic tier # Update existing UserSubscription records # Clear pricing service cache pass def downgrade(): # Revert to previous limits if needed pass ``` **Action Items**: - [ ] Create migration script - [ ] Test migration on staging database - [ ] Backup production database before migration - [ ] Run migration during maintenance window - [ ] Verify all subscriptions updated correctly ### Phase 5: Monitoring & Adjustment (Priority: MEDIUM) #### 5.1 Set Up Monitoring **Metrics to Track**: - Average cost per user per month - Users hitting $45 cost cap - Users hitting individual limits - OSS model usage vs proprietary model usage - Cost savings from OSS models **Action Items**: - [ ] Set up cost monitoring dashboard - [ ] Create alerts for cost cap breaches - [ ] Track OSS model adoption rate - [ ] Monitor user satisfaction with limits #### 5.2 Adjustment Plan **Triggers for Adjustment**: - If average cost > $35/user: Consider reducing limits - If >15% users hit cost cap: Consider increasing cost cap to $50 - If <20% users use video/audio: Consider reducing those limits - If OSS models unavailable: Fallback to proprietary models **Action Items**: - [ ] Define adjustment criteria - [ ] Create adjustment workflow - [ ] Plan communication strategy for limit changes --- ## Next Steps (Priority Order) 1. **CRITICAL**: Update `pricing_service.py` with new Basic tier limits 2. **CRITICAL**: Add WaveSpeed OSS model pricing to `pricing_service.py` 3. **HIGH**: Update default model selection to prefer OSS models 4. **HIGH**: Create and run database migration 5. **HIGH**: Update pricing page documentation 6. **HIGH**: Test limit enforcement and cost calculations 7. **MEDIUM**: Update frontend components with new limits 8. **MEDIUM**: Set up monitoring and alerts 9. **LOW**: Add OSS model indicators to UI --- ## Monitoring & Adjustment Plan ### Key Metrics to Track - Average LLM calls per user per month - Average media generation per user per month - Average cost per user per month - Users hitting cost cap - Users hitting individual limits ### Adjustment Triggers - **If average cost > $25/user**: Consider reducing limits - **If >20% users hit cost cap**: Consider increasing cost cap - **If <10% users use video/audio**: Consider reducing those limits - **If churn rate high**: Consider increasing limits ### Review Schedule - **Week 1-2**: Daily monitoring - **Month 1**: Weekly review - **Month 2-3**: Bi-weekly review - **Month 4+**: Monthly review --- ## Conclusion The proposed Basic tier limits (OSS-Focused) provide: - ✅ **Access to all ALwrity features** with generous limits - ✅ **Sustainable cost structure** using OSS models (25-50% savings) - ✅ **Competitive pricing** ($29 vs competitors $49+) - ✅ **Protection against losses** ($45 cost cap, hard limit $40-50) - ✅ **Room for growth** (can adjust based on usage) - ✅ **OSS-first strategy** (Qwen, FLUX, Ideogram, WAN 2.5, Minimax Speech) - ✅ **Maximum user value** while staying within cost constraints **Key Advantages of OSS-Focused Strategy**: 1. **Cost Efficiency**: 25-50% cost savings vs proprietary models 2. **Better Limits**: Can offer more generations due to lower costs 3. **User Value**: More value for the same $29/month price 4. **Sustainability**: Lower costs = better margins = sustainable business 5. **Flexibility**: Can adjust limits based on actual usage patterns **Recommendation**: **APPROVE** for production launch with OSS-focused strategy. **Confidence Level**: 🟢 **HIGH** (with proper monitoring, cost cap enforcement, and OSS model defaults) **Risk Mitigation**: - $45 cost cap protects against losses (hard limit $40-50) - OSS model defaults ensure cost efficiency - Monitoring allows quick adjustment if needed - Realistic usage assumptions (60-70% of limits)