Files
ALwrity/docs/Billing_Subscription/PRODUCTION_PRICING_STRATEGY.md

34 KiB
Raw Blame History

Production Pricing Strategy - Basic Tier Launch (OSS-Focused)

Executive Summary

This document provides a comprehensive pricing strategy for ALwrity's production launch with Basic Tier only. All features and tools will be accessible to Basic tier users, requiring careful cost calculation and limit setting to ensure sustainability while providing value.

Critical Goals:

  1. OSS-First Strategy: Prioritize Open-Source AI models (WaveSpeed OSS models) for cost efficiency
  2. Hard Cost Cap: $40-50 per user per month maximum (protects against losses)
  3. Maximum User Value: Provide generous limits while staying within cost constraints
  4. Fair Pricing: Balance between sustainability and user value (not excessive profit margins)

Strategy: Use WaveSpeed's OSS models (Qwen, FLUX, Ideogram, WAN 2.5) which offer better pricing than proprietary alternatives, allowing us to provide more value to users while maintaining profitability.


Current State Analysis

Current Basic Tier (Code Implementation)

Price: $29/month ($290/year)

Limits:

  • AI Text Generation: 10 unified calls/month (across all LLM providers)
  • Tokens: 20,000 per provider (Gemini, OpenAI, Anthropic, Mistral)
  • Search APIs: 200 Tavily, 200 Serper, 100 Metaphor, 100 Firecrawl, 500 Exa
  • Image Generation: 5 Stability AI images/month
  • Image Editing: 30 AI image edits/month
  • Video Generation: 20 videos/month
  • Audio Generation: 50 TTS generations/month
  • Monthly Cost Cap: $50.00

Problem: 10 unified AI text generation calls is too restrictive for production launch where users need to experience all features.


ALwrity Tools & Content Generation Analysis

Content Generation Tools

1. Text Generation Tools (Primary LLM Usage)

Tool API Calls per Generation Typical Usage Cost per Generation
Blog Writer 3-5 calls 1 blog = research (1) + outline (1) + content (1-3) $0.01 - $0.05
Story Writer 2-3 calls 1 story = outline (1) + script (1-2) $0.01 - $0.03
Podcast Maker 3-4 calls 1 podcast = research (1) + script (1) + outline (1-2) $0.01 - $0.04
Facebook Writer 1-2 calls 1 post = generation (1) + optional optimization (1) $0.005 - $0.01
LinkedIn Writer 1-2 calls 1 post = generation (1) + optional optimization (1) $0.005 - $0.01
SEO Tools 1-3 calls Varies by tool complexity $0.005 - $0.02
Content Planning 2-4 calls Strategy generation + analysis $0.01 - $0.03

Average: ~2-3 LLM calls per content generation workflow

2. Image Generation Tools

Tool API Calls Cost per Generation
Image Generator 1 Stability call $0.04 per image
Image Editor 1 Image Edit call $0.04 per edit operation

Current Limit: 5 images/month (too low for production)

3. Video Generation Tools

Tool API Calls Cost per Video Notes
Video Studio 1 video call $0.10 - $0.42 Depends on model/duration
YouTube Creator 1 video call per scene $0.10 - $0.42 per scene 5 scenes = $0.50 - $2.10
Story Writer Video 1 video call per scene $0.10 - $0.42 per scene Variable scenes
Podcast Maker Video 1 video call per scene $0.10 - $0.42 per scene Optional video generation

Current Limit: 20 videos/month (reasonable)

4. Audio Generation Tools

Tool API Calls Cost per Generation Notes
Audio Generator 1 audio call $0.05 per 1,000 chars ~$0.10 - $0.50 per audio
Podcast Maker TTS 1 audio call per scene $0.05 per 1,000 chars Multiple scenes
Story Writer Narration 1 audio call per scene $0.05 per 1,000 chars Multiple scenes

Current Limit: 50 audio generations/month (reasonable)


API Cost Breakdown

LLM Provider Costs (Per 1M Tokens)

Provider Model Input Cost Output Cost Typical Use
Gemini 2.5 Flash $0.30 $2.50 Default (cost-effective)
Gemini 2.5 Pro $1.25 $10.00 Premium quality
OpenAI GPT-4o Mini $0.15 $0.60 Cost-effective
OpenAI GPT-4o $2.50 $10.00 Premium quality
Anthropic Claude 3.5 Sonnet $3.00 $15.00 Premium quality
HuggingFace GPT-OSS-120B $1.00 $3.00 Alternative option

Average Cost per LLM Call (assuming 1K input + 2K output tokens):

  • Gemini Flash: ~$0.0056 per call
  • GPT-4o Mini: ~$0.0015 per call
  • Claude 3.5: ~$0.033 per call

Recommendation: Use Gemini Flash as default for cost efficiency.

Search API Costs

Provider Cost per Search Typical Usage
Tavily $0.001 Research operations
Serper $0.001 Research operations
Metaphor $0.003 Research operations
Exa $0.005 Neural search (premium)
Firecrawl $0.002 Web page extraction

Average: ~$0.002 per search operation

Media Generation Costs (OSS-Focused via WaveSpeed)

Image Generation (OSS Models via WaveSpeed)

Model Cost Type Notes
Qwen Image $0.03 per image OSS Fast generation, cost-effective
Ideogram V3 Turbo $0.05 per image OSS Photorealistic, text rendering
Default (Qwen) $0.03 per image OSS Recommended for Basic tier

Image Editing (OSS Models via WaveSpeed)

Model Cost Type Use Case
Qwen Image Edit $0.02 per edit OSS Budget editing, bilingual
Qwen Image Edit Plus $0.02 per edit OSS Multi-image editing
FLUX Kontext Pro $0.04 per edit OSS Typography, professional
Default (Qwen Edit) $0.02 per edit OSS Recommended for Basic tier

Video Generation (OSS Models via WaveSpeed)

Model Cost Type Duration Notes
WAN 2.5 $0.05/sec OSS 5-15 sec Text-to-Video, Image-to-Video
Seedance 1.5 Pro $0.08/sec OSS 10-30 sec Longer duration
Kling v2.5 Turbo (5s) $0.21 per video OSS 5 sec Image-to-Video
Kling v2.5 Turbo (10s) $0.42 per video OSS 10 sec Extended duration
Default (WAN 2.5) $0.25 per video OSS ~5 sec Recommended for Basic tier

Audio Generation (OSS Models via WaveSpeed)

Model Cost Type Notes
Minimax Speech 02 HD $0.05 per 1K chars OSS High-quality TTS
Default $0.05 per 1K chars OSS ~$0.10-0.50 per audio

Face Swap & Specialized (OSS Models via WaveSpeed)

Operation Cost Type Notes
Face Swap $0.01-$0.03 OSS Basic to premium quality
Image Upscaling $0.01-$0.06 OSS 2K/4K/8K options
3D Generation $0.02-$0.30 OSS Budget to premium

OSS Advantage: WaveSpeed provides access to OSS models (Qwen, FLUX, Ideogram, WAN 2.5) at significantly lower costs than proprietary alternatives, enabling better value for users.


Production-Ready Basic Tier Proposal

Revised Limits for Production Launch

Price: $29/month ($290/year) - KEEP CURRENT PRICING

Rationale: Competitive pricing point, allows for sustainable margins with proper limits.

Proposed Limits

1. AI Text Generation (Unified Limit)

  • Current: 10 calls/month TOO LOW
  • Proposed: 50 calls/month
  • Rationale:
    • Allows ~16-25 content generations/month (assuming 2-3 calls each)
    • Enables users to experience Blog Writer, Story Writer, Podcast Maker, Social Writers
    • Sustainable cost: ~$0.28/month (50 calls × $0.0056 average)

2. Token Limits (Per Provider)

  • Current: 20,000 tokens/provider
  • Proposed: 100,000 tokens/provider
  • Rationale:
    • Allows ~33-50 LLM calls per provider (assuming 2K tokens/call)
    • Provides buffer for longer content generation
    • Aligns with unified call limit (50 calls × 2K tokens = 100K tokens)

3. Search APIs

  • Tavily: 200 calls/month (Keep)
  • Serper: 200 calls/month (Keep)
  • Metaphor: 100 calls/month (Keep)
  • Firecrawl: 100 calls/month (Keep)
  • Exa: 500 calls/month (Keep)
  • Rationale: Sufficient for research-heavy tools (Blog Writer, Podcast Maker, SEO tools)

4. Image Generation (OSS Models via WaveSpeed)

  • Current: 5 images/month TOO LOW
  • Proposed: 50 images/month (INCREASED - OSS models are cheaper)
  • Rationale:
    • OSS models (Qwen Image $0.03) are cheaper than Stability ($0.04)
    • Allows users to generate images for Story Writer, Blog Writer, Social Media
    • Cost: ~$1.50/month (50 × $0.03 using Qwen Image OSS model)
    • Enables visual content creation workflows
    • Default to Qwen Image OSS model for cost efficiency

5. Image Editing (OSS Models via WaveSpeed)

  • Current: 30 edits/month
  • Proposed: 50 edits/month (INCREASED - OSS models are cheaper)
  • Rationale:
    • OSS models (Qwen Edit $0.02) are cheaper than Stability ($0.04)
    • Cost: ~$1.00/month (50 × $0.02 using Qwen Edit OSS model)
    • Sufficient for image optimization workflows
    • Default to Qwen Edit OSS model for cost efficiency

6. Video Generation (OSS Models via WaveSpeed)

  • Current: 20 videos/month
  • Proposed: 30 videos/month (INCREASED - OSS models available)
  • Rationale:
    • OSS models (WAN 2.5 $0.25 per 5s video) provide good value
    • Allows ~6-10 full video projects/month (assuming 3-5 scenes each)
    • Cost: ~$7.50/month (30 × $0.25 using WAN 2.5 OSS model)
    • Enables Video Studio, YouTube Creator, Story Writer video features
    • Default to WAN 2.5 OSS model for cost efficiency

7. Audio Generation (OSS Models via WaveSpeed)

  • Current: 50 generations/month
  • Proposed: 100 generations/month (INCREASED - OSS models are affordable)
  • Rationale:
    • OSS models (Minimax Speech 02 HD) provide high quality at $0.05/1K chars
    • Sufficient for Podcast Maker, Story Writer narration
    • Cost: ~$10.00-$25.00/month (depending on length, assuming 2K-5K chars per audio)
    • Enables audio content workflows
    • Default to Minimax Speech 02 HD OSS model

8. Monthly Cost Cap

  • Current: $50.00
  • Proposed: $45.00 (ADJUSTED - aligns with $40-50 target)
  • Rationale:
    • Protects against unexpected high usage
    • Allows flexibility within limits
    • Provides safety margin
    • Aligns with $40-50 hard limit requirement

Cost Analysis: Proposed Basic Tier (OSS-Focused)

Monthly Cost Breakdown (Per User) - Using OSS Models

Category Usage Cost per Unit (OSS) Monthly Cost
LLM Calls 50 calls $0.0056 avg (Gemini Flash) $0.28
Search APIs 200 searches $0.002 avg $0.40
Image Generation 50 images $0.03 (Qwen Image OSS) $1.50
Image Editing 50 edits $0.02 (Qwen Edit OSS) $1.00
Video Generation 30 videos $0.25 (WAN 2.5 OSS, ~5s) $7.50
Audio Generation 100 audios $0.10-$0.50 avg $10.00-$25.00
Total Variable Cost $20.68-$35.68

Margin Analysis (OSS-Focused)

Subscription Revenue: $29.00/month Variable Costs (OSS Models): $20.68-$35.68/month (depending on usage) Gross Margin: -$6.68 to +$8.32/month

IMPROVEMENT: OSS models reduce costs significantly:

  • Image generation: $0.03 vs $0.04 (25% savings)
  • Image editing: $0.02 vs $0.04 (50% savings)
  • Video generation: $0.25 vs $0.42 (40% savings)

Mitigation Strategy:

  1. Cost cap enforcement: Monthly cost cap of $45 prevents extreme losses
  2. OSS model defaults: Default to cheaper OSS models (Qwen, WAN 2.5)
  3. Realistic usage: Most users won't hit all limits simultaneously
  4. Average usage assumption: ~60-70% of limits = $12-25 cost = $4-17 margin
  5. Hard limit protection: $45 cap ensures we never exceed $50/user/month

Revised Basic Tier Limits (Production-Ready, OSS-Focused)

{
    "name": "Basic",
    "tier": SubscriptionTier.BASIC,
    "price_monthly": 29.0,
    "price_yearly": 290.0,
    
    # AI Text Generation (Unified Limit)
    "ai_text_generation_calls_limit": 50,  # INCREASED from 10
    
    # Token Limits (Per Provider)
    "gemini_tokens_limit": 100000,  # INCREASED from 20,000
    "openai_tokens_limit": 100000,  # INCREASED from 20,000
    "anthropic_tokens_limit": 100000,  # INCREASED from 20,000
    "mistral_tokens_limit": 100000,  # INCREASED from 20,000
    
    # Search APIs
    "tavily_calls_limit": 200,  # Keep
    "serper_calls_limit": 200,  # Keep
    "metaphor_calls_limit": 100,  # Keep
    "firecrawl_calls_limit": 100,  # Keep
    "exa_calls_limit": 500,  # Keep
    
    # Media Generation (OSS Models via WaveSpeed)
    "stability_calls_limit": 50,  # INCREASED from 5 (using Qwen Image OSS $0.03)
    "image_edit_calls_limit": 50,  # INCREASED from 30 (using Qwen Edit OSS $0.02)
    "video_calls_limit": 30,  # INCREASED from 20 (using WAN 2.5 OSS $0.25)
    "audio_calls_limit": 100,  # INCREASED from 50 (using Minimax Speech OSS)
    
    # Cost Protection
    "monthly_cost_limit": 45.0,  # ADJUSTED from 50.0 (aligns with $40-50 target)
    
    # OSS Model Defaults
    "default_image_model": "qwen-image",  # OSS model via WaveSpeed
    "default_image_edit_model": "qwen-edit",  # OSS model via WaveSpeed
    "default_video_model": "wan-2.5",  # OSS model via WaveSpeed
    "default_audio_model": "minimax-speech-02-hd",  # OSS model via WaveSpeed
    
    # Features
    "features": [
        "full_content_generation",
        "advanced_research", 
        "basic_analytics",
        "all_tools_access",  # All ALwrity tools accessible
        "billing_dashboard",
        "usage_tracking",
        "oss_models_priority"  # NEW: OSS models prioritized for cost efficiency
    ],
    "description": "Perfect for individuals and small teams. Access all ALwrity features with generous limits powered by OSS AI models."
}

Tool Usage Scenarios & Limits

Scenario 1: Blog Writer User

  • Workflow: 1 blog post = 3-5 LLM calls + 3-5 search calls + 1-2 images
  • Monthly Capacity: ~10-16 blog posts (with 50 LLM calls)
  • Cost: ~$0.50-$1.00 per blog post
  • Status: FEASIBLE

Scenario 2: Story Writer User

  • Workflow: 1 story = 2-3 LLM calls + 5-10 images + 5-10 audio + 5-10 videos
  • Monthly Capacity: ~16-25 stories (LLM limit) OR ~3-6 stories (image/video limits)
  • Cost: ~$2.00-$5.00 per story
  • Status: FEASIBLE (limited by media, not LLM)

Scenario 3: Podcast Maker User

  • Workflow: 1 podcast = 3-4 LLM calls + 3-5 search calls + 5-10 audio + optional 5-10 videos
  • Monthly Capacity: ~12-16 podcasts (LLM limit) OR ~5-10 podcasts (audio limit)
  • Cost: ~$1.00-$3.00 per podcast (without video)
  • Status: FEASIBLE

Scenario 4: Social Media Content Creator

  • Workflow: 1 post = 1-2 LLM calls + 1 image (optional)
  • Monthly Capacity: ~25-50 posts (LLM limit) OR ~30 posts (image limit)
  • Cost: ~$0.10-$0.15 per post
  • Status: FEASIBLE

Scenario 5: Video Creator (YouTube Creator)

  • Workflow: 1 video = 2-3 LLM calls + 5 scenes × (1 image + 1 audio + 1 video)
  • Monthly Capacity: ~4-5 full videos (video limit) OR ~16-25 videos (LLM limit)
  • Cost: ~$3.00-$5.00 per video
  • Status: FEASIBLE (limited by video limit, not LLM)

Risk Mitigation Strategies

1. Cost Cap Enforcement

  • Monthly cost cap: $50.00 (hard limit)
  • Behavior: When cap reached, all API calls blocked until next billing period
  • Protection: Prevents losses from extreme usage

2. Pre-flight Validation

  • Implementation: Already in place
  • Function: Validates limits BEFORE making API calls
  • Benefit: Prevents wasted API calls on operations that would fail

3. Usage Monitoring & Alerts

  • 80% Warning: Alert users at 80% of limits
  • 100% Block: Block operations at 100% of limits
  • Dashboard: Real-time usage tracking

4. Optimized Default Models

  • Strategy: Use cost-effective models by default (Gemini Flash, GPT-4o Mini)
  • Benefit: Reduces costs while maintaining quality
  • User Control: Allow model selection for power users

5. Efficient API Usage

  • Batching: Batch multiple operations where possible
  • Caching: Cache research results and common queries
  • Optimization: Continue optimizing tool workflows to reduce API calls

Pricing Page Updates Required

Current Issues

  1. Pricing page shows outdated limits
  2. Missing unified ai_text_generation_calls_limit explanation
  3. Token limits don't match code (shows 1M/500K, code has 20K)
  4. Missing video/audio/image editing limits
  5. Missing cost transparency information

Required Updates

Basic Tier Display

💰 Basic Plan - $29/month ($290/year)

✨ All ALwrity Features Included:
✅ Blog Writer, Story Writer, Podcast Maker
✅ Image Generator & Editor
✅ Video Studio & YouTube Creator
✅ Audio Generator
✅ All Social Media Writers
✅ All SEO Tools & Dashboards
✅ Content Planning & Strategy Tools

📊 Usage Limits:
• 50 AI Text Generations/month (unified across all LLM providers)
• 100,000 tokens per provider (Gemini, OpenAI, Anthropic, Mistral)
• 200 Research Searches/month (Tavily, Serper)
• 500 Neural Searches/month (Exa)
• 30 AI Images/month
• 30 Image Edits/month
• 20 AI Videos/month
• 50 AI Audio Generations/month
• $50 Monthly Cost Cap (protects you from overages)

💡 Perfect for: Individuals, content creators, small teams

Implementation Checklist

Phase 1: Update Code Limits

  • Update pricing_service.py Basic tier limits:
    • ai_text_generation_calls_limit: 10 → 50
    • gemini_tokens_limit: 20,000 → 100,000
    • openai_tokens_limit: 20,000 → 100,000
    • anthropic_tokens_limit: 20,000 → 100,000
    • mistral_tokens_limit: 20,000 → 100,000
    • stability_calls_limit: 5 → 30
  • Run database migration script
  • Test limit enforcement

Phase 2: Update Pricing Page

  • Update docs-site/docs/features/subscription/pricing.md
  • Update frontend pricing page component
  • Add cost transparency section
  • Add tool usage examples
  • Add FAQ section

Phase 3: Update Documentation

  • Update subscription rule file (.cursor/rules/subscription.mdc)
  • Update API documentation
  • Create user-facing pricing guide

Phase 4: Testing

  • Test all tools with new limits
  • Verify cost calculations
  • Test limit enforcement
  • Test cost cap enforcement
  • Verify pre-flight validation

Cost Calculation Examples

Example 1: Blog Writer - 1 Blog Post (OSS Models)

Research: 3 Exa searches = $0.015
Outline: 1 LLM call (Gemini Flash) = $0.0056
Content: 2 LLM calls (Gemini Flash) = $0.0112
Image: 1 Qwen Image OSS = $0.03 (vs $0.04 Stability)
Total: ~$0.06 per blog post (saved $0.01 with OSS)

Example 2: Story Writer - 1 Story (5 scenes, OSS Models)

Outline: 1 LLM call = $0.0056
Script: 1 LLM call = $0.0056
Images: 5 × $0.03 (Qwen Image OSS) = $0.15 (vs $0.20)
Audio: 5 × $0.10 = $0.50
Videos: 5 × $0.25 (WAN 2.5 OSS) = $1.25 (vs $0.50-$2.10)
Total: ~$1.96 per story (higher video cost, but better quality)

Example 3: Podcast Maker - 1 Episode (10 min, 5 scenes, OSS Models)

Research: 3 Exa searches = $0.015
Script: 1 LLM call = $0.0056
Outline: 1 LLM call = $0.0056
Audio: 5 × $0.20 (Minimax Speech OSS) = $1.00
Video (optional): 5 × $0.25 (WAN 2.5 OSS) = $1.25
Total: ~$1.03 per podcast (without video)
Total: ~$2.28 per podcast (with video, OSS models)

Example 4: Social Media - 10 Posts (OSS Models)

Generation: 10 × 1 LLM call = 10 calls × $0.0056 = $0.056
Images: 10 × $0.03 (Qwen Image OSS) = $0.30 (vs $0.40)
Total: ~$0.36 for 10 posts (saved $0.10 with OSS)

Competitive Analysis

Similar AI Content Platforms

Platform Price Limits Notes
Jasper $49/month 50K words Text-focused
Copy.ai $49/month Unlimited words Text-focused
Writesonic $19/month 100K words Text-focused
ALwrity Basic $29/month 50 LLM calls + media Full platform

ALwrity Advantage:

  • Lower price point ($29 vs $49)
  • Includes video, image, audio generation (competitors don't)
  • Comprehensive tool suite (not just text)
  • Better value proposition

Recommendations Summary

APPROVED: Production-Ready Basic Tier (OSS-Focused)

Price: $29/month ($290/year) - KEEP

Key Changes (OSS-Focused):

  1. Increase AI Text Generation: 10 → 50 calls/month
  2. Increase Token Limits: 20K → 100K per provider
  3. Increase Image Generation: 5 → 50 images/month (OSS: Qwen Image $0.03)
  4. Increase Image Editing: 30 → 50 edits/month (OSS: Qwen Edit $0.02)
  5. Increase Video Generation: 20 → 30 videos/month (OSS: WAN 2.5 $0.25)
  6. Increase Audio Generation: 50 → 100 generations/month (OSS: Minimax Speech)
  7. Adjust Cost Cap: $50 → $45 (aligns with $40-50 target)
  8. Default to OSS Models: Qwen, WAN 2.5, Minimax Speech (cost-efficient)

Expected Outcomes:

  • Users can experience all ALwrity features with generous limits
  • Sustainable cost structure (~$20-35/user/month average with OSS models)
  • Competitive pricing ($29 vs competitors $49+)
  • Room for margin ($4-17/user/month average)
  • Cost cap ($45) protects against losses (hard limit $40-50)
  • OSS models provide 25-50% cost savings vs proprietary alternatives

Risk Level: 🟢 LOW (with cost cap enforcement and OSS model defaults)


Implementation Plan

Phase 1: Update Pricing Service & Database (Priority: HIGH)

1.1 Update pricing_service.py Basic Tier Limits

File: backend/services/subscription/pricing_service.py

Changes Required:

# In initialize_default_plans() method
{
    "name": "Basic",
    "tier": SubscriptionTier.BASIC,
    "price_monthly": 29.0,
    "price_yearly": 290.0,
    
    # AI Text Generation (Unified Limit)
    "ai_text_generation_calls_limit": 50,  # Changed from 10
    
    # Token Limits (Per Provider)
    "gemini_tokens_limit": 100000,  # Changed from 20,000
    "openai_tokens_limit": 100000,  # Changed from 20,000
    "anthropic_tokens_limit": 100000,  # Changed from 20,000
    "mistral_tokens_limit": 100000,  # Changed from 20,000
    
    # Search APIs (Keep existing)
    "tavily_calls_limit": 200,
    "serper_calls_limit": 200,
    "metaphor_calls_limit": 100,
    "firecrawl_calls_limit": 100,
    "exa_calls_limit": 500,
    
    # Media Generation (OSS Models via WaveSpeed)
    "stability_calls_limit": 50,  # Changed from 5 (now includes WaveSpeed OSS)
    "image_edit_calls_limit": 50,  # Changed from 30
    "video_calls_limit": 30,  # Changed from 20
    "audio_calls_limit": 100,  # Changed from 50
    
    # Cost Protection
    "monthly_cost_limit": 45.0,  # Changed from 50.0
}

Action Items:

  • Update initialize_default_plans() method in pricing_service.py
  • Run database migration to update existing Basic tier subscriptions
  • Test limit enforcement with new values
  • Verify cost calculations reflect OSS model pricing

1.2 Update WaveSpeed Model Pricing in pricing_service.py

File: backend/services/subscription/pricing_service.py

Changes Required:

# In initialize_default_pricing() method, update/add WaveSpeed OSS model pricing:

# Image Generation (OSS Models via WaveSpeed)
{
    "provider": APIProvider.IMAGE,
    "model_name": "qwen-image",
    "cost_per_request": 0.03,  # OSS model via WaveSpeed
    "description": "WaveSpeed Qwen Image (OSS) - Fast generation"
},
{
    "provider": APIProvider.IMAGE,
    "model_name": "ideogram-v3-turbo",
    "cost_per_request": 0.05,  # OSS model via WaveSpeed
    "description": "WaveSpeed Ideogram V3 Turbo (OSS) - Photorealistic"
},

# Image Editing (OSS Models via WaveSpeed)
{
    "provider": APIProvider.IMAGE_EDIT,
    "model_name": "qwen-edit",
    "cost_per_request": 0.02,  # OSS model via WaveSpeed
    "description": "WaveSpeed Qwen Image Edit (OSS) - Budget editing"
},
{
    "provider": APIProvider.IMAGE_EDIT,
    "model_name": "qwen-edit-plus",
    "cost_per_request": 0.02,  # OSS model via WaveSpeed
    "description": "WaveSpeed Qwen Image Edit Plus (OSS) - Multi-image"
},
{
    "provider": APIProvider.IMAGE_EDIT,
    "model_name": "flux-kontext-pro",
    "cost_per_request": 0.04,  # OSS model via WaveSpeed
    "description": "WaveSpeed FLUX Kontext Pro (OSS) - Professional"
},

# Video Generation (OSS Models via WaveSpeed)
{
    "provider": APIProvider.VIDEO,
    "model_name": "wan-2.5",
    "cost_per_request": 0.25,  # OSS model via WaveSpeed (~5 seconds)
    "description": "WaveSpeed WAN 2.5 (OSS) - Text-to-Video, Image-to-Video"
},
{
    "provider": APIProvider.VIDEO,
    "model_name": "seedance-1.5-pro",
    "cost_per_request": 0.40,  # OSS model via WaveSpeed (~5 seconds)
    "description": "WaveSpeed Seedance 1.5 Pro (OSS) - Longer duration"
},

# Audio Generation (OSS Models via WaveSpeed)
{
    "provider": APIProvider.AUDIO,
    "model_name": "minimax-speech-02-hd",
    "cost_per_input_token": 0.00005,  # $0.05 per 1K chars
    "cost_per_output_token": 0.0,
    "cost_per_request": 0.0,
    "description": "WaveSpeed Minimax Speech 02 HD (OSS) - High-quality TTS"
},

Action Items:

  • Add WaveSpeed OSS model pricing entries
  • Update default model selection logic to prefer OSS models
  • Test cost calculation with OSS models
  • Verify pricing accuracy against WaveSpeed API documentation

1.3 Update Default Model Selection Logic

Files:

  • backend/services/llm_providers/main_image_generation.py
  • backend/services/image_studio/create_service.py
  • backend/services/image_studio/edit_service.py
  • backend/services/video_studio/video_service.py
  • backend/services/audio_generation/audio_service.py

Changes Required:

  • Default image generation to qwen-image (OSS) instead of Stability
  • Default image editing to qwen-edit (OSS) instead of Stability
  • Default video generation to wan-2.5 (OSS) instead of HuggingFace
  • Default audio generation to minimax-speech-02-hd (OSS)

Action Items:

  • Update get_default_provider() methods to prefer WaveSpeed OSS models
  • Update model selection UI to show OSS models as default/recommended
  • Add cost comparison tooltips showing OSS model savings
  • Test all tools with OSS model defaults

Phase 2: Update Frontend & Documentation (Priority: HIGH)

2.1 Update Pricing Page

File: docs-site/docs/features/subscription/pricing.md

Changes Required:

  • Update Basic tier limits to reflect new values (50 images, 50 edits, 30 videos, 100 audio)
  • Add OSS model information and cost savings messaging
  • Update cost examples to use OSS model pricing
  • Add FAQ about OSS models and cost efficiency

Action Items:

  • Update pricing page markdown
  • Update frontend pricing component (if exists)
  • Add OSS model badges/indicators
  • Add cost comparison table (OSS vs proprietary)

2.2 Update Subscription Context & Components

Files:

  • frontend/src/contexts/SubscriptionContext.tsx
  • frontend/src/components/billing/EnhancedBillingDashboard.tsx
  • frontend/src/components/shared/UsageDashboard.tsx

Changes Required:

  • Display OSS model indicators in usage dashboard
  • Show cost savings from using OSS models
  • Update limit displays to show new Basic tier limits
  • Add tooltips explaining OSS model benefits

Action Items:

  • Update limit displays in billing dashboard
  • Add OSS model indicators in cost breakdown
  • Update usage statistics to reflect new limits
  • Test UI with new limit values

Phase 3: Testing & Validation (Priority: CRITICAL)

3.1 Limit Enforcement Testing

Test Cases:

  • Test 50 AI text generation calls limit
  • Test 50 image generation limit (OSS models)
  • Test 50 image editing limit (OSS models)
  • Test 30 video generation limit (OSS models)
  • Test 100 audio generation limit (OSS models)
  • Test $45 monthly cost cap enforcement
  • Test pre-flight validation with new limits
  • Test limit exceeded error messages

3.2 Cost Calculation Testing

Test Cases:

  • Verify Qwen Image cost: $0.03 per image
  • Verify Qwen Edit cost: $0.02 per edit
  • Verify WAN 2.5 video cost: $0.25 per video
  • Verify Minimax Speech cost: $0.05 per 1K chars
  • Test cost aggregation across all operations
  • Test cost cap enforcement at $45
  • Verify cost display in billing dashboard

3.3 OSS Model Integration Testing

Test Cases:

  • Test Qwen Image generation via WaveSpeed
  • Test Qwen Edit editing via WaveSpeed
  • Test WAN 2.5 video generation via WaveSpeed
  • Test Minimax Speech audio generation via WaveSpeed
  • Verify default model selection uses OSS models
  • Test model fallback if OSS model unavailable
  • Verify cost tracking for OSS models

Phase 4: Database Migration (Priority: HIGH)

4.1 Create Migration Script

File: backend/database/migrations/update_basic_tier_limits_oss.py

Script Requirements:

"""
Migration: Update Basic Tier Limits for OSS-Focused Pricing Strategy
- Increase AI text generation: 10 → 50
- Increase token limits: 20K → 100K per provider
- Increase image generation: 5 → 50
- Increase image editing: 30 → 50
- Increase video generation: 20 → 30
- Increase audio generation: 50 → 100
- Adjust cost cap: $50 → $45
"""

def upgrade():
    # Update SubscriptionPlan for Basic tier
    # Update existing UserSubscription records
    # Clear pricing service cache
    pass

def downgrade():
    # Revert to previous limits if needed
    pass

Action Items:

  • Create migration script
  • Test migration on staging database
  • Backup production database before migration
  • Run migration during maintenance window
  • Verify all subscriptions updated correctly

Phase 5: Monitoring & Adjustment (Priority: MEDIUM)

5.1 Set Up Monitoring

Metrics to Track:

  • Average cost per user per month
  • Users hitting $45 cost cap
  • Users hitting individual limits
  • OSS model usage vs proprietary model usage
  • Cost savings from OSS models

Action Items:

  • Set up cost monitoring dashboard
  • Create alerts for cost cap breaches
  • Track OSS model adoption rate
  • Monitor user satisfaction with limits

5.2 Adjustment Plan

Triggers for Adjustment:

  • If average cost > $35/user: Consider reducing limits
  • If >15% users hit cost cap: Consider increasing cost cap to $50
  • If <20% users use video/audio: Consider reducing those limits
  • If OSS models unavailable: Fallback to proprietary models

Action Items:

  • Define adjustment criteria
  • Create adjustment workflow
  • Plan communication strategy for limit changes

Next Steps (Priority Order)

  1. CRITICAL: Update pricing_service.py with new Basic tier limits
  2. CRITICAL: Add WaveSpeed OSS model pricing to pricing_service.py
  3. HIGH: Update default model selection to prefer OSS models
  4. HIGH: Create and run database migration
  5. HIGH: Update pricing page documentation
  6. HIGH: Test limit enforcement and cost calculations
  7. MEDIUM: Update frontend components with new limits
  8. MEDIUM: Set up monitoring and alerts
  9. LOW: Add OSS model indicators to UI

Monitoring & Adjustment Plan

Key Metrics to Track

  • Average LLM calls per user per month
  • Average media generation per user per month
  • Average cost per user per month
  • Users hitting cost cap
  • Users hitting individual limits

Adjustment Triggers

  • If average cost > $25/user: Consider reducing limits
  • If >20% users hit cost cap: Consider increasing cost cap
  • If <10% users use video/audio: Consider reducing those limits
  • If churn rate high: Consider increasing limits

Review Schedule

  • Week 1-2: Daily monitoring
  • Month 1: Weekly review
  • Month 2-3: Bi-weekly review
  • Month 4+: Monthly review

Conclusion

The proposed Basic tier limits (OSS-Focused) provide:

  • Access to all ALwrity features with generous limits
  • Sustainable cost structure using OSS models (25-50% savings)
  • Competitive pricing ($29 vs competitors $49+)
  • Protection against losses ($45 cost cap, hard limit $40-50)
  • Room for growth (can adjust based on usage)
  • OSS-first strategy (Qwen, FLUX, Ideogram, WAN 2.5, Minimax Speech)
  • Maximum user value while staying within cost constraints

Key Advantages of OSS-Focused Strategy:

  1. Cost Efficiency: 25-50% cost savings vs proprietary models
  2. Better Limits: Can offer more generations due to lower costs
  3. User Value: More value for the same $29/month price
  4. Sustainability: Lower costs = better margins = sustainable business
  5. Flexibility: Can adjust limits based on actual usage patterns

Recommendation: APPROVE for production launch with OSS-focused strategy.

Confidence Level: 🟢 HIGH (with proper monitoring, cost cap enforcement, and OSS model defaults)

Risk Mitigation:

  • $45 cost cap protects against losses (hard limit $40-50)
  • OSS model defaults ensure cost efficiency
  • Monitoring allows quick adjustment if needed
  • Realistic usage assumptions (60-70% of limits)