AI Analysis and Content Strategy fixes. Enhanced Strategy Routes refactoring.
This commit is contained in:
899
docs/Billing_Subscription/PRODUCTION_PRICING_STRATEGY.md
Normal file
899
docs/Billing_Subscription/PRODUCTION_PRICING_STRATEGY.md
Normal file
@@ -0,0 +1,899 @@
|
||||
# Production Pricing Strategy - Basic Tier Launch (OSS-Focused)
|
||||
|
||||
## Executive Summary
|
||||
|
||||
This document provides a comprehensive pricing strategy for ALwrity's production launch with **Basic Tier only**. All features and tools will be accessible to Basic tier users, requiring careful cost calculation and limit setting to ensure sustainability while providing value.
|
||||
|
||||
**Critical Goals**:
|
||||
1. **OSS-First Strategy**: Prioritize Open-Source AI models (WaveSpeed OSS models) for cost efficiency
|
||||
2. **Hard Cost Cap**: $40-50 per user per month maximum (protects against losses)
|
||||
3. **Maximum User Value**: Provide generous limits while staying within cost constraints
|
||||
4. **Fair Pricing**: Balance between sustainability and user value (not excessive profit margins)
|
||||
|
||||
**Strategy**: Use WaveSpeed's OSS models (Qwen, FLUX, Ideogram, WAN 2.5) which offer better pricing than proprietary alternatives, allowing us to provide more value to users while maintaining profitability.
|
||||
|
||||
---
|
||||
|
||||
## Current State Analysis
|
||||
|
||||
### Current Basic Tier (Code Implementation)
|
||||
|
||||
**Price**: $29/month ($290/year)
|
||||
|
||||
**Limits**:
|
||||
- **AI Text Generation**: 10 unified calls/month (across all LLM providers)
|
||||
- **Tokens**: 20,000 per provider (Gemini, OpenAI, Anthropic, Mistral)
|
||||
- **Search APIs**: 200 Tavily, 200 Serper, 100 Metaphor, 100 Firecrawl, 500 Exa
|
||||
- **Image Generation**: 5 Stability AI images/month
|
||||
- **Image Editing**: 30 AI image edits/month
|
||||
- **Video Generation**: 20 videos/month
|
||||
- **Audio Generation**: 50 TTS generations/month
|
||||
- **Monthly Cost Cap**: $50.00
|
||||
|
||||
**Problem**: 10 unified AI text generation calls is **too restrictive** for production launch where users need to experience all features.
|
||||
|
||||
---
|
||||
|
||||
## ALwrity Tools & Content Generation Analysis
|
||||
|
||||
### Content Generation Tools
|
||||
|
||||
#### 1. **Text Generation Tools** (Primary LLM Usage)
|
||||
|
||||
| Tool | API Calls per Generation | Typical Usage | Cost per Generation |
|
||||
|------|--------------------------|---------------|---------------------|
|
||||
| **Blog Writer** | 3-5 calls | 1 blog = research (1) + outline (1) + content (1-3) | $0.01 - $0.05 |
|
||||
| **Story Writer** | 2-3 calls | 1 story = outline (1) + script (1-2) | $0.01 - $0.03 |
|
||||
| **Podcast Maker** | 3-4 calls | 1 podcast = research (1) + script (1) + outline (1-2) | $0.01 - $0.04 |
|
||||
| **Facebook Writer** | 1-2 calls | 1 post = generation (1) + optional optimization (1) | $0.005 - $0.01 |
|
||||
| **LinkedIn Writer** | 1-2 calls | 1 post = generation (1) + optional optimization (1) | $0.005 - $0.01 |
|
||||
| **SEO Tools** | 1-3 calls | Varies by tool complexity | $0.005 - $0.02 |
|
||||
| **Content Planning** | 2-4 calls | Strategy generation + analysis | $0.01 - $0.03 |
|
||||
|
||||
**Average**: ~2-3 LLM calls per content generation workflow
|
||||
|
||||
#### 2. **Image Generation Tools**
|
||||
|
||||
| Tool | API Calls | Cost per Generation |
|
||||
|------|-----------|---------------------|
|
||||
| **Image Generator** | 1 Stability call | $0.04 per image |
|
||||
| **Image Editor** | 1 Image Edit call | $0.04 per edit operation |
|
||||
|
||||
**Current Limit**: 5 images/month (too low for production)
|
||||
|
||||
#### 3. **Video Generation Tools**
|
||||
|
||||
| Tool | API Calls | Cost per Video | Notes |
|
||||
|------|-----------|-----------------|-------|
|
||||
| **Video Studio** | 1 video call | $0.10 - $0.42 | Depends on model/duration |
|
||||
| **YouTube Creator** | 1 video call per scene | $0.10 - $0.42 per scene | 5 scenes = $0.50 - $2.10 |
|
||||
| **Story Writer Video** | 1 video call per scene | $0.10 - $0.42 per scene | Variable scenes |
|
||||
| **Podcast Maker Video** | 1 video call per scene | $0.10 - $0.42 per scene | Optional video generation |
|
||||
|
||||
**Current Limit**: 20 videos/month (reasonable)
|
||||
|
||||
#### 4. **Audio Generation Tools**
|
||||
|
||||
| Tool | API Calls | Cost per Generation | Notes |
|
||||
|------|-----------|---------------------|-------|
|
||||
| **Audio Generator** | 1 audio call | $0.05 per 1,000 chars | ~$0.10 - $0.50 per audio |
|
||||
| **Podcast Maker TTS** | 1 audio call per scene | $0.05 per 1,000 chars | Multiple scenes |
|
||||
| **Story Writer Narration** | 1 audio call per scene | $0.05 per 1,000 chars | Multiple scenes |
|
||||
|
||||
**Current Limit**: 50 audio generations/month (reasonable)
|
||||
|
||||
---
|
||||
|
||||
## API Cost Breakdown
|
||||
|
||||
### LLM Provider Costs (Per 1M Tokens)
|
||||
|
||||
| Provider | Model | Input Cost | Output Cost | Typical Use |
|
||||
|----------|-------|------------|-------------|-------------|
|
||||
| **Gemini** | 2.5 Flash | $0.30 | $2.50 | Default (cost-effective) |
|
||||
| **Gemini** | 2.5 Pro | $1.25 | $10.00 | Premium quality |
|
||||
| **OpenAI** | GPT-4o Mini | $0.15 | $0.60 | Cost-effective |
|
||||
| **OpenAI** | GPT-4o | $2.50 | $10.00 | Premium quality |
|
||||
| **Anthropic** | Claude 3.5 Sonnet | $3.00 | $15.00 | Premium quality |
|
||||
| **HuggingFace** | GPT-OSS-120B | $1.00 | $3.00 | Alternative option |
|
||||
|
||||
**Average Cost per LLM Call** (assuming 1K input + 2K output tokens):
|
||||
- Gemini Flash: ~$0.0056 per call
|
||||
- GPT-4o Mini: ~$0.0015 per call
|
||||
- Claude 3.5: ~$0.033 per call
|
||||
|
||||
**Recommendation**: Use Gemini Flash as default for cost efficiency.
|
||||
|
||||
### Search API Costs
|
||||
|
||||
| Provider | Cost per Search | Typical Usage |
|
||||
|----------|----------------|---------------|
|
||||
| **Tavily** | $0.001 | Research operations |
|
||||
| **Serper** | $0.001 | Research operations |
|
||||
| **Metaphor** | $0.003 | Research operations |
|
||||
| **Exa** | $0.005 | Neural search (premium) |
|
||||
| **Firecrawl** | $0.002 | Web page extraction |
|
||||
|
||||
**Average**: ~$0.002 per search operation
|
||||
|
||||
### Media Generation Costs (OSS-Focused via WaveSpeed)
|
||||
|
||||
#### **Image Generation** (OSS Models via WaveSpeed)
|
||||
| Model | Cost | Type | Notes |
|
||||
|------|------|------|-------|
|
||||
| **Qwen Image** | $0.03 per image | OSS | Fast generation, cost-effective |
|
||||
| **Ideogram V3 Turbo** | $0.05 per image | OSS | Photorealistic, text rendering |
|
||||
| **Default (Qwen)** | $0.03 per image | OSS | **Recommended for Basic tier** |
|
||||
|
||||
#### **Image Editing** (OSS Models via WaveSpeed)
|
||||
| Model | Cost | Type | Use Case |
|
||||
|------|------|------|----------|
|
||||
| **Qwen Image Edit** | $0.02 per edit | OSS | Budget editing, bilingual |
|
||||
| **Qwen Image Edit Plus** | $0.02 per edit | OSS | Multi-image editing |
|
||||
| **FLUX Kontext Pro** | $0.04 per edit | OSS | Typography, professional |
|
||||
| **Default (Qwen Edit)** | $0.02 per edit | OSS | **Recommended for Basic tier** |
|
||||
|
||||
#### **Video Generation** (OSS Models via WaveSpeed)
|
||||
| Model | Cost | Type | Duration | Notes |
|
||||
|------|------|------|----------|-------|
|
||||
| **WAN 2.5** | $0.05/sec | OSS | 5-15 sec | Text-to-Video, Image-to-Video |
|
||||
| **Seedance 1.5 Pro** | $0.08/sec | OSS | 10-30 sec | Longer duration |
|
||||
| **Kling v2.5 Turbo (5s)** | $0.21 per video | OSS | 5 sec | Image-to-Video |
|
||||
| **Kling v2.5 Turbo (10s)** | $0.42 per video | OSS | 10 sec | Extended duration |
|
||||
| **Default (WAN 2.5)** | $0.25 per video | OSS | ~5 sec | **Recommended for Basic tier** |
|
||||
|
||||
#### **Audio Generation** (OSS Models via WaveSpeed)
|
||||
| Model | Cost | Type | Notes |
|
||||
|------|------|------|-------|
|
||||
| **Minimax Speech 02 HD** | $0.05 per 1K chars | OSS | High-quality TTS |
|
||||
| **Default** | $0.05 per 1K chars | OSS | ~$0.10-0.50 per audio |
|
||||
|
||||
#### **Face Swap & Specialized** (OSS Models via WaveSpeed)
|
||||
| Operation | Cost | Type | Notes |
|
||||
|-----------|------|------|-------|
|
||||
| **Face Swap** | $0.01-$0.03 | OSS | Basic to premium quality |
|
||||
| **Image Upscaling** | $0.01-$0.06 | OSS | 2K/4K/8K options |
|
||||
| **3D Generation** | $0.02-$0.30 | OSS | Budget to premium |
|
||||
|
||||
**OSS Advantage**: WaveSpeed provides access to OSS models (Qwen, FLUX, Ideogram, WAN 2.5) at significantly lower costs than proprietary alternatives, enabling better value for users.
|
||||
|
||||
---
|
||||
|
||||
## Production-Ready Basic Tier Proposal
|
||||
|
||||
### Revised Limits for Production Launch
|
||||
|
||||
**Price**: $29/month ($290/year) - **KEEP CURRENT PRICING**
|
||||
|
||||
**Rationale**: Competitive pricing point, allows for sustainable margins with proper limits.
|
||||
|
||||
### Proposed Limits
|
||||
|
||||
#### 1. **AI Text Generation** (Unified Limit)
|
||||
- **Current**: 10 calls/month ❌ **TOO LOW**
|
||||
- **Proposed**: **50 calls/month** ✅
|
||||
- **Rationale**:
|
||||
- Allows ~16-25 content generations/month (assuming 2-3 calls each)
|
||||
- Enables users to experience Blog Writer, Story Writer, Podcast Maker, Social Writers
|
||||
- Sustainable cost: ~$0.28/month (50 calls × $0.0056 average)
|
||||
|
||||
#### 2. **Token Limits** (Per Provider)
|
||||
- **Current**: 20,000 tokens/provider
|
||||
- **Proposed**: **100,000 tokens/provider** ✅
|
||||
- **Rationale**:
|
||||
- Allows ~33-50 LLM calls per provider (assuming 2K tokens/call)
|
||||
- Provides buffer for longer content generation
|
||||
- Aligns with unified call limit (50 calls × 2K tokens = 100K tokens)
|
||||
|
||||
#### 3. **Search APIs**
|
||||
- **Tavily**: 200 calls/month ✅ (Keep)
|
||||
- **Serper**: 200 calls/month ✅ (Keep)
|
||||
- **Metaphor**: 100 calls/month ✅ (Keep)
|
||||
- **Firecrawl**: 100 calls/month ✅ (Keep)
|
||||
- **Exa**: 500 calls/month ✅ (Keep)
|
||||
- **Rationale**: Sufficient for research-heavy tools (Blog Writer, Podcast Maker, SEO tools)
|
||||
|
||||
#### 4. **Image Generation** (OSS Models via WaveSpeed)
|
||||
- **Current**: 5 images/month ❌ **TOO LOW**
|
||||
- **Proposed**: **50 images/month** ✅ (INCREASED - OSS models are cheaper)
|
||||
- **Rationale**:
|
||||
- OSS models (Qwen Image $0.03) are cheaper than Stability ($0.04)
|
||||
- Allows users to generate images for Story Writer, Blog Writer, Social Media
|
||||
- Cost: ~$1.50/month (50 × $0.03 using Qwen Image OSS model)
|
||||
- Enables visual content creation workflows
|
||||
- **Default to Qwen Image OSS model** for cost efficiency
|
||||
|
||||
#### 5. **Image Editing** (OSS Models via WaveSpeed)
|
||||
- **Current**: 30 edits/month
|
||||
- **Proposed**: **50 edits/month** ✅ (INCREASED - OSS models are cheaper)
|
||||
- **Rationale**:
|
||||
- OSS models (Qwen Edit $0.02) are cheaper than Stability ($0.04)
|
||||
- Cost: ~$1.00/month (50 × $0.02 using Qwen Edit OSS model)
|
||||
- Sufficient for image optimization workflows
|
||||
- **Default to Qwen Edit OSS model** for cost efficiency
|
||||
|
||||
#### 6. **Video Generation** (OSS Models via WaveSpeed)
|
||||
- **Current**: 20 videos/month
|
||||
- **Proposed**: **30 videos/month** ✅ (INCREASED - OSS models available)
|
||||
- **Rationale**:
|
||||
- OSS models (WAN 2.5 $0.25 per 5s video) provide good value
|
||||
- Allows ~6-10 full video projects/month (assuming 3-5 scenes each)
|
||||
- Cost: ~$7.50/month (30 × $0.25 using WAN 2.5 OSS model)
|
||||
- Enables Video Studio, YouTube Creator, Story Writer video features
|
||||
- **Default to WAN 2.5 OSS model** for cost efficiency
|
||||
|
||||
#### 7. **Audio Generation** (OSS Models via WaveSpeed)
|
||||
- **Current**: 50 generations/month
|
||||
- **Proposed**: **100 generations/month** ✅ (INCREASED - OSS models are affordable)
|
||||
- **Rationale**:
|
||||
- OSS models (Minimax Speech 02 HD) provide high quality at $0.05/1K chars
|
||||
- Sufficient for Podcast Maker, Story Writer narration
|
||||
- Cost: ~$10.00-$25.00/month (depending on length, assuming 2K-5K chars per audio)
|
||||
- Enables audio content workflows
|
||||
- **Default to Minimax Speech 02 HD OSS model**
|
||||
|
||||
#### 8. **Monthly Cost Cap**
|
||||
- **Current**: $50.00
|
||||
- **Proposed**: **$45.00** ✅ (ADJUSTED - aligns with $40-50 target)
|
||||
- **Rationale**:
|
||||
- Protects against unexpected high usage
|
||||
- Allows flexibility within limits
|
||||
- Provides safety margin
|
||||
- Aligns with $40-50 hard limit requirement
|
||||
|
||||
---
|
||||
|
||||
## Cost Analysis: Proposed Basic Tier (OSS-Focused)
|
||||
|
||||
### Monthly Cost Breakdown (Per User) - Using OSS Models
|
||||
|
||||
| Category | Usage | Cost per Unit (OSS) | Monthly Cost |
|
||||
|----------|-------|---------------------|--------------|
|
||||
| **LLM Calls** | 50 calls | $0.0056 avg (Gemini Flash) | **$0.28** |
|
||||
| **Search APIs** | 200 searches | $0.002 avg | **$0.40** |
|
||||
| **Image Generation** | 50 images | $0.03 (Qwen Image OSS) | **$1.50** |
|
||||
| **Image Editing** | 50 edits | $0.02 (Qwen Edit OSS) | **$1.00** |
|
||||
| **Video Generation** | 30 videos | $0.25 (WAN 2.5 OSS, ~5s) | **$7.50** |
|
||||
| **Audio Generation** | 100 audios | $0.10-$0.50 avg | **$10.00-$25.00** |
|
||||
| **Total Variable Cost** | | | **$20.68-$35.68** |
|
||||
|
||||
### Margin Analysis (OSS-Focused)
|
||||
|
||||
**Subscription Revenue**: $29.00/month
|
||||
**Variable Costs (OSS Models)**: $20.68-$35.68/month (depending on usage)
|
||||
**Gross Margin**: **-$6.68 to +$8.32/month**
|
||||
|
||||
**✅ IMPROVEMENT**: OSS models reduce costs significantly:
|
||||
- Image generation: $0.03 vs $0.04 (25% savings)
|
||||
- Image editing: $0.02 vs $0.04 (50% savings)
|
||||
- Video generation: $0.25 vs $0.42 (40% savings)
|
||||
|
||||
**Mitigation Strategy**:
|
||||
1. **Cost cap enforcement**: Monthly cost cap of $45 prevents extreme losses
|
||||
2. **OSS model defaults**: Default to cheaper OSS models (Qwen, WAN 2.5)
|
||||
3. **Realistic usage**: Most users won't hit all limits simultaneously
|
||||
4. **Average usage assumption**: ~60-70% of limits = $12-25 cost = $4-17 margin
|
||||
5. **Hard limit protection**: $45 cap ensures we never exceed $50/user/month
|
||||
|
||||
---
|
||||
|
||||
## Revised Basic Tier Limits (Production-Ready, OSS-Focused)
|
||||
|
||||
```python
|
||||
{
|
||||
"name": "Basic",
|
||||
"tier": SubscriptionTier.BASIC,
|
||||
"price_monthly": 29.0,
|
||||
"price_yearly": 290.0,
|
||||
|
||||
# AI Text Generation (Unified Limit)
|
||||
"ai_text_generation_calls_limit": 50, # INCREASED from 10
|
||||
|
||||
# Token Limits (Per Provider)
|
||||
"gemini_tokens_limit": 100000, # INCREASED from 20,000
|
||||
"openai_tokens_limit": 100000, # INCREASED from 20,000
|
||||
"anthropic_tokens_limit": 100000, # INCREASED from 20,000
|
||||
"mistral_tokens_limit": 100000, # INCREASED from 20,000
|
||||
|
||||
# Search APIs
|
||||
"tavily_calls_limit": 200, # Keep
|
||||
"serper_calls_limit": 200, # Keep
|
||||
"metaphor_calls_limit": 100, # Keep
|
||||
"firecrawl_calls_limit": 100, # Keep
|
||||
"exa_calls_limit": 500, # Keep
|
||||
|
||||
# Media Generation (OSS Models via WaveSpeed)
|
||||
"stability_calls_limit": 50, # INCREASED from 5 (using Qwen Image OSS $0.03)
|
||||
"image_edit_calls_limit": 50, # INCREASED from 30 (using Qwen Edit OSS $0.02)
|
||||
"video_calls_limit": 30, # INCREASED from 20 (using WAN 2.5 OSS $0.25)
|
||||
"audio_calls_limit": 100, # INCREASED from 50 (using Minimax Speech OSS)
|
||||
|
||||
# Cost Protection
|
||||
"monthly_cost_limit": 45.0, # ADJUSTED from 50.0 (aligns with $40-50 target)
|
||||
|
||||
# OSS Model Defaults
|
||||
"default_image_model": "qwen-image", # OSS model via WaveSpeed
|
||||
"default_image_edit_model": "qwen-edit", # OSS model via WaveSpeed
|
||||
"default_video_model": "wan-2.5", # OSS model via WaveSpeed
|
||||
"default_audio_model": "minimax-speech-02-hd", # OSS model via WaveSpeed
|
||||
|
||||
# Features
|
||||
"features": [
|
||||
"full_content_generation",
|
||||
"advanced_research",
|
||||
"basic_analytics",
|
||||
"all_tools_access", # All ALwrity tools accessible
|
||||
"billing_dashboard",
|
||||
"usage_tracking",
|
||||
"oss_models_priority" # NEW: OSS models prioritized for cost efficiency
|
||||
],
|
||||
"description": "Perfect for individuals and small teams. Access all ALwrity features with generous limits powered by OSS AI models."
|
||||
}
|
||||
```
|
||||
|
||||
---
|
||||
|
||||
## Tool Usage Scenarios & Limits
|
||||
|
||||
### Scenario 1: Blog Writer User
|
||||
- **Workflow**: 1 blog post = 3-5 LLM calls + 3-5 search calls + 1-2 images
|
||||
- **Monthly Capacity**: ~10-16 blog posts (with 50 LLM calls)
|
||||
- **Cost**: ~$0.50-$1.00 per blog post
|
||||
- **Status**: ✅ **FEASIBLE**
|
||||
|
||||
### Scenario 2: Story Writer User
|
||||
- **Workflow**: 1 story = 2-3 LLM calls + 5-10 images + 5-10 audio + 5-10 videos
|
||||
- **Monthly Capacity**: ~16-25 stories (LLM limit) OR ~3-6 stories (image/video limits)
|
||||
- **Cost**: ~$2.00-$5.00 per story
|
||||
- **Status**: ✅ **FEASIBLE** (limited by media, not LLM)
|
||||
|
||||
### Scenario 3: Podcast Maker User
|
||||
- **Workflow**: 1 podcast = 3-4 LLM calls + 3-5 search calls + 5-10 audio + optional 5-10 videos
|
||||
- **Monthly Capacity**: ~12-16 podcasts (LLM limit) OR ~5-10 podcasts (audio limit)
|
||||
- **Cost**: ~$1.00-$3.00 per podcast (without video)
|
||||
- **Status**: ✅ **FEASIBLE**
|
||||
|
||||
### Scenario 4: Social Media Content Creator
|
||||
- **Workflow**: 1 post = 1-2 LLM calls + 1 image (optional)
|
||||
- **Monthly Capacity**: ~25-50 posts (LLM limit) OR ~30 posts (image limit)
|
||||
- **Cost**: ~$0.10-$0.15 per post
|
||||
- **Status**: ✅ **FEASIBLE**
|
||||
|
||||
### Scenario 5: Video Creator (YouTube Creator)
|
||||
- **Workflow**: 1 video = 2-3 LLM calls + 5 scenes × (1 image + 1 audio + 1 video)
|
||||
- **Monthly Capacity**: ~4-5 full videos (video limit) OR ~16-25 videos (LLM limit)
|
||||
- **Cost**: ~$3.00-$5.00 per video
|
||||
- **Status**: ✅ **FEASIBLE** (limited by video limit, not LLM)
|
||||
|
||||
---
|
||||
|
||||
## Risk Mitigation Strategies
|
||||
|
||||
### 1. **Cost Cap Enforcement**
|
||||
- **Monthly cost cap**: $50.00 (hard limit)
|
||||
- **Behavior**: When cap reached, all API calls blocked until next billing period
|
||||
- **Protection**: Prevents losses from extreme usage
|
||||
|
||||
### 2. **Pre-flight Validation**
|
||||
- **Implementation**: Already in place
|
||||
- **Function**: Validates limits BEFORE making API calls
|
||||
- **Benefit**: Prevents wasted API calls on operations that would fail
|
||||
|
||||
### 3. **Usage Monitoring & Alerts**
|
||||
- **80% Warning**: Alert users at 80% of limits
|
||||
- **100% Block**: Block operations at 100% of limits
|
||||
- **Dashboard**: Real-time usage tracking
|
||||
|
||||
### 4. **Optimized Default Models**
|
||||
- **Strategy**: Use cost-effective models by default (Gemini Flash, GPT-4o Mini)
|
||||
- **Benefit**: Reduces costs while maintaining quality
|
||||
- **User Control**: Allow model selection for power users
|
||||
|
||||
### 5. **Efficient API Usage**
|
||||
- **Batching**: Batch multiple operations where possible
|
||||
- **Caching**: Cache research results and common queries
|
||||
- **Optimization**: Continue optimizing tool workflows to reduce API calls
|
||||
|
||||
---
|
||||
|
||||
## Pricing Page Updates Required
|
||||
|
||||
### Current Issues
|
||||
1. Pricing page shows outdated limits
|
||||
2. Missing unified `ai_text_generation_calls_limit` explanation
|
||||
3. Token limits don't match code (shows 1M/500K, code has 20K)
|
||||
4. Missing video/audio/image editing limits
|
||||
5. Missing cost transparency information
|
||||
|
||||
### Required Updates
|
||||
|
||||
#### Basic Tier Display
|
||||
```
|
||||
💰 Basic Plan - $29/month ($290/year)
|
||||
|
||||
✨ All ALwrity Features Included:
|
||||
✅ Blog Writer, Story Writer, Podcast Maker
|
||||
✅ Image Generator & Editor
|
||||
✅ Video Studio & YouTube Creator
|
||||
✅ Audio Generator
|
||||
✅ All Social Media Writers
|
||||
✅ All SEO Tools & Dashboards
|
||||
✅ Content Planning & Strategy Tools
|
||||
|
||||
📊 Usage Limits:
|
||||
• 50 AI Text Generations/month (unified across all LLM providers)
|
||||
• 100,000 tokens per provider (Gemini, OpenAI, Anthropic, Mistral)
|
||||
• 200 Research Searches/month (Tavily, Serper)
|
||||
• 500 Neural Searches/month (Exa)
|
||||
• 30 AI Images/month
|
||||
• 30 Image Edits/month
|
||||
• 20 AI Videos/month
|
||||
• 50 AI Audio Generations/month
|
||||
• $50 Monthly Cost Cap (protects you from overages)
|
||||
|
||||
💡 Perfect for: Individuals, content creators, small teams
|
||||
```
|
||||
|
||||
---
|
||||
|
||||
## Implementation Checklist
|
||||
|
||||
### Phase 1: Update Code Limits
|
||||
- [ ] Update `pricing_service.py` Basic tier limits:
|
||||
- [ ] `ai_text_generation_calls_limit`: 10 → 50
|
||||
- [ ] `gemini_tokens_limit`: 20,000 → 100,000
|
||||
- [ ] `openai_tokens_limit`: 20,000 → 100,000
|
||||
- [ ] `anthropic_tokens_limit`: 20,000 → 100,000
|
||||
- [ ] `mistral_tokens_limit`: 20,000 → 100,000
|
||||
- [ ] `stability_calls_limit`: 5 → 30
|
||||
- [ ] Run database migration script
|
||||
- [ ] Test limit enforcement
|
||||
|
||||
### Phase 2: Update Pricing Page
|
||||
- [ ] Update `docs-site/docs/features/subscription/pricing.md`
|
||||
- [ ] Update frontend pricing page component
|
||||
- [ ] Add cost transparency section
|
||||
- [ ] Add tool usage examples
|
||||
- [ ] Add FAQ section
|
||||
|
||||
### Phase 3: Update Documentation
|
||||
- [ ] Update subscription rule file (`.cursor/rules/subscription.mdc`)
|
||||
- [ ] Update API documentation
|
||||
- [ ] Create user-facing pricing guide
|
||||
|
||||
### Phase 4: Testing
|
||||
- [ ] Test all tools with new limits
|
||||
- [ ] Verify cost calculations
|
||||
- [ ] Test limit enforcement
|
||||
- [ ] Test cost cap enforcement
|
||||
- [ ] Verify pre-flight validation
|
||||
|
||||
---
|
||||
|
||||
## Cost Calculation Examples
|
||||
|
||||
### Example 1: Blog Writer - 1 Blog Post (OSS Models)
|
||||
```
|
||||
Research: 3 Exa searches = $0.015
|
||||
Outline: 1 LLM call (Gemini Flash) = $0.0056
|
||||
Content: 2 LLM calls (Gemini Flash) = $0.0112
|
||||
Image: 1 Qwen Image OSS = $0.03 (vs $0.04 Stability)
|
||||
Total: ~$0.06 per blog post (saved $0.01 with OSS)
|
||||
```
|
||||
|
||||
### Example 2: Story Writer - 1 Story (5 scenes, OSS Models)
|
||||
```
|
||||
Outline: 1 LLM call = $0.0056
|
||||
Script: 1 LLM call = $0.0056
|
||||
Images: 5 × $0.03 (Qwen Image OSS) = $0.15 (vs $0.20)
|
||||
Audio: 5 × $0.10 = $0.50
|
||||
Videos: 5 × $0.25 (WAN 2.5 OSS) = $1.25 (vs $0.50-$2.10)
|
||||
Total: ~$1.96 per story (higher video cost, but better quality)
|
||||
```
|
||||
|
||||
### Example 3: Podcast Maker - 1 Episode (10 min, 5 scenes, OSS Models)
|
||||
```
|
||||
Research: 3 Exa searches = $0.015
|
||||
Script: 1 LLM call = $0.0056
|
||||
Outline: 1 LLM call = $0.0056
|
||||
Audio: 5 × $0.20 (Minimax Speech OSS) = $1.00
|
||||
Video (optional): 5 × $0.25 (WAN 2.5 OSS) = $1.25
|
||||
Total: ~$1.03 per podcast (without video)
|
||||
Total: ~$2.28 per podcast (with video, OSS models)
|
||||
```
|
||||
|
||||
### Example 4: Social Media - 10 Posts (OSS Models)
|
||||
```
|
||||
Generation: 10 × 1 LLM call = 10 calls × $0.0056 = $0.056
|
||||
Images: 10 × $0.03 (Qwen Image OSS) = $0.30 (vs $0.40)
|
||||
Total: ~$0.36 for 10 posts (saved $0.10 with OSS)
|
||||
```
|
||||
|
||||
---
|
||||
|
||||
## Competitive Analysis
|
||||
|
||||
### Similar AI Content Platforms
|
||||
|
||||
| Platform | Price | Limits | Notes |
|
||||
|----------|-------|--------|-------|
|
||||
| **Jasper** | $49/month | 50K words | Text-focused |
|
||||
| **Copy.ai** | $49/month | Unlimited words | Text-focused |
|
||||
| **Writesonic** | $19/month | 100K words | Text-focused |
|
||||
| **ALwrity Basic** | $29/month | 50 LLM calls + media | **Full platform** |
|
||||
|
||||
**ALwrity Advantage**:
|
||||
- Lower price point ($29 vs $49)
|
||||
- Includes video, image, audio generation (competitors don't)
|
||||
- Comprehensive tool suite (not just text)
|
||||
- Better value proposition
|
||||
|
||||
---
|
||||
|
||||
## Recommendations Summary
|
||||
|
||||
### ✅ **APPROVED: Production-Ready Basic Tier (OSS-Focused)**
|
||||
|
||||
**Price**: $29/month ($290/year) - **KEEP**
|
||||
|
||||
**Key Changes** (OSS-Focused):
|
||||
1. ✅ **Increase AI Text Generation**: 10 → **50 calls/month**
|
||||
2. ✅ **Increase Token Limits**: 20K → **100K per provider**
|
||||
3. ✅ **Increase Image Generation**: 5 → **50 images/month** (OSS: Qwen Image $0.03)
|
||||
4. ✅ **Increase Image Editing**: 30 → **50 edits/month** (OSS: Qwen Edit $0.02)
|
||||
5. ✅ **Increase Video Generation**: 20 → **30 videos/month** (OSS: WAN 2.5 $0.25)
|
||||
6. ✅ **Increase Audio Generation**: 50 → **100 generations/month** (OSS: Minimax Speech)
|
||||
7. ✅ **Adjust Cost Cap**: $50 → **$45** (aligns with $40-50 target)
|
||||
8. ✅ **Default to OSS Models**: Qwen, WAN 2.5, Minimax Speech (cost-efficient)
|
||||
|
||||
**Expected Outcomes**:
|
||||
- Users can experience all ALwrity features with generous limits
|
||||
- Sustainable cost structure (~$20-35/user/month average with OSS models)
|
||||
- Competitive pricing ($29 vs competitors $49+)
|
||||
- Room for margin ($4-17/user/month average)
|
||||
- Cost cap ($45) protects against losses (hard limit $40-50)
|
||||
- **OSS models provide 25-50% cost savings** vs proprietary alternatives
|
||||
|
||||
**Risk Level**: 🟢 **LOW** (with cost cap enforcement and OSS model defaults)
|
||||
|
||||
---
|
||||
|
||||
## Implementation Plan
|
||||
|
||||
### Phase 1: Update Pricing Service & Database (Priority: HIGH)
|
||||
|
||||
#### 1.1 Update `pricing_service.py` Basic Tier Limits
|
||||
**File**: `backend/services/subscription/pricing_service.py`
|
||||
|
||||
**Changes Required**:
|
||||
```python
|
||||
# In initialize_default_plans() method
|
||||
{
|
||||
"name": "Basic",
|
||||
"tier": SubscriptionTier.BASIC,
|
||||
"price_monthly": 29.0,
|
||||
"price_yearly": 290.0,
|
||||
|
||||
# AI Text Generation (Unified Limit)
|
||||
"ai_text_generation_calls_limit": 50, # Changed from 10
|
||||
|
||||
# Token Limits (Per Provider)
|
||||
"gemini_tokens_limit": 100000, # Changed from 20,000
|
||||
"openai_tokens_limit": 100000, # Changed from 20,000
|
||||
"anthropic_tokens_limit": 100000, # Changed from 20,000
|
||||
"mistral_tokens_limit": 100000, # Changed from 20,000
|
||||
|
||||
# Search APIs (Keep existing)
|
||||
"tavily_calls_limit": 200,
|
||||
"serper_calls_limit": 200,
|
||||
"metaphor_calls_limit": 100,
|
||||
"firecrawl_calls_limit": 100,
|
||||
"exa_calls_limit": 500,
|
||||
|
||||
# Media Generation (OSS Models via WaveSpeed)
|
||||
"stability_calls_limit": 50, # Changed from 5 (now includes WaveSpeed OSS)
|
||||
"image_edit_calls_limit": 50, # Changed from 30
|
||||
"video_calls_limit": 30, # Changed from 20
|
||||
"audio_calls_limit": 100, # Changed from 50
|
||||
|
||||
# Cost Protection
|
||||
"monthly_cost_limit": 45.0, # Changed from 50.0
|
||||
}
|
||||
```
|
||||
|
||||
**Action Items**:
|
||||
- [ ] Update `initialize_default_plans()` method in `pricing_service.py`
|
||||
- [ ] Run database migration to update existing Basic tier subscriptions
|
||||
- [ ] Test limit enforcement with new values
|
||||
- [ ] Verify cost calculations reflect OSS model pricing
|
||||
|
||||
#### 1.2 Update WaveSpeed Model Pricing in `pricing_service.py`
|
||||
**File**: `backend/services/subscription/pricing_service.py`
|
||||
|
||||
**Changes Required**:
|
||||
```python
|
||||
# In initialize_default_pricing() method, update/add WaveSpeed OSS model pricing:
|
||||
|
||||
# Image Generation (OSS Models via WaveSpeed)
|
||||
{
|
||||
"provider": APIProvider.IMAGE,
|
||||
"model_name": "qwen-image",
|
||||
"cost_per_request": 0.03, # OSS model via WaveSpeed
|
||||
"description": "WaveSpeed Qwen Image (OSS) - Fast generation"
|
||||
},
|
||||
{
|
||||
"provider": APIProvider.IMAGE,
|
||||
"model_name": "ideogram-v3-turbo",
|
||||
"cost_per_request": 0.05, # OSS model via WaveSpeed
|
||||
"description": "WaveSpeed Ideogram V3 Turbo (OSS) - Photorealistic"
|
||||
},
|
||||
|
||||
# Image Editing (OSS Models via WaveSpeed)
|
||||
{
|
||||
"provider": APIProvider.IMAGE_EDIT,
|
||||
"model_name": "qwen-edit",
|
||||
"cost_per_request": 0.02, # OSS model via WaveSpeed
|
||||
"description": "WaveSpeed Qwen Image Edit (OSS) - Budget editing"
|
||||
},
|
||||
{
|
||||
"provider": APIProvider.IMAGE_EDIT,
|
||||
"model_name": "qwen-edit-plus",
|
||||
"cost_per_request": 0.02, # OSS model via WaveSpeed
|
||||
"description": "WaveSpeed Qwen Image Edit Plus (OSS) - Multi-image"
|
||||
},
|
||||
{
|
||||
"provider": APIProvider.IMAGE_EDIT,
|
||||
"model_name": "flux-kontext-pro",
|
||||
"cost_per_request": 0.04, # OSS model via WaveSpeed
|
||||
"description": "WaveSpeed FLUX Kontext Pro (OSS) - Professional"
|
||||
},
|
||||
|
||||
# Video Generation (OSS Models via WaveSpeed)
|
||||
{
|
||||
"provider": APIProvider.VIDEO,
|
||||
"model_name": "wan-2.5",
|
||||
"cost_per_request": 0.25, # OSS model via WaveSpeed (~5 seconds)
|
||||
"description": "WaveSpeed WAN 2.5 (OSS) - Text-to-Video, Image-to-Video"
|
||||
},
|
||||
{
|
||||
"provider": APIProvider.VIDEO,
|
||||
"model_name": "seedance-1.5-pro",
|
||||
"cost_per_request": 0.40, # OSS model via WaveSpeed (~5 seconds)
|
||||
"description": "WaveSpeed Seedance 1.5 Pro (OSS) - Longer duration"
|
||||
},
|
||||
|
||||
# Audio Generation (OSS Models via WaveSpeed)
|
||||
{
|
||||
"provider": APIProvider.AUDIO,
|
||||
"model_name": "minimax-speech-02-hd",
|
||||
"cost_per_input_token": 0.00005, # $0.05 per 1K chars
|
||||
"cost_per_output_token": 0.0,
|
||||
"cost_per_request": 0.0,
|
||||
"description": "WaveSpeed Minimax Speech 02 HD (OSS) - High-quality TTS"
|
||||
},
|
||||
```
|
||||
|
||||
**Action Items**:
|
||||
- [ ] Add WaveSpeed OSS model pricing entries
|
||||
- [ ] Update default model selection logic to prefer OSS models
|
||||
- [ ] Test cost calculation with OSS models
|
||||
- [ ] Verify pricing accuracy against WaveSpeed API documentation
|
||||
|
||||
#### 1.3 Update Default Model Selection Logic
|
||||
**Files**:
|
||||
- `backend/services/llm_providers/main_image_generation.py`
|
||||
- `backend/services/image_studio/create_service.py`
|
||||
- `backend/services/image_studio/edit_service.py`
|
||||
- `backend/services/video_studio/video_service.py`
|
||||
- `backend/services/audio_generation/audio_service.py`
|
||||
|
||||
**Changes Required**:
|
||||
- Default image generation to `qwen-image` (OSS) instead of Stability
|
||||
- Default image editing to `qwen-edit` (OSS) instead of Stability
|
||||
- Default video generation to `wan-2.5` (OSS) instead of HuggingFace
|
||||
- Default audio generation to `minimax-speech-02-hd` (OSS)
|
||||
|
||||
**Action Items**:
|
||||
- [ ] Update `get_default_provider()` methods to prefer WaveSpeed OSS models
|
||||
- [ ] Update model selection UI to show OSS models as default/recommended
|
||||
- [ ] Add cost comparison tooltips showing OSS model savings
|
||||
- [ ] Test all tools with OSS model defaults
|
||||
|
||||
### Phase 2: Update Frontend & Documentation (Priority: HIGH)
|
||||
|
||||
#### 2.1 Update Pricing Page
|
||||
**File**: `docs-site/docs/features/subscription/pricing.md`
|
||||
|
||||
**Changes Required**:
|
||||
- Update Basic tier limits to reflect new values (50 images, 50 edits, 30 videos, 100 audio)
|
||||
- Add OSS model information and cost savings messaging
|
||||
- Update cost examples to use OSS model pricing
|
||||
- Add FAQ about OSS models and cost efficiency
|
||||
|
||||
**Action Items**:
|
||||
- [ ] Update pricing page markdown
|
||||
- [ ] Update frontend pricing component (if exists)
|
||||
- [ ] Add OSS model badges/indicators
|
||||
- [ ] Add cost comparison table (OSS vs proprietary)
|
||||
|
||||
#### 2.2 Update Subscription Context & Components
|
||||
**Files**:
|
||||
- `frontend/src/contexts/SubscriptionContext.tsx`
|
||||
- `frontend/src/components/billing/EnhancedBillingDashboard.tsx`
|
||||
- `frontend/src/components/shared/UsageDashboard.tsx`
|
||||
|
||||
**Changes Required**:
|
||||
- Display OSS model indicators in usage dashboard
|
||||
- Show cost savings from using OSS models
|
||||
- Update limit displays to show new Basic tier limits
|
||||
- Add tooltips explaining OSS model benefits
|
||||
|
||||
**Action Items**:
|
||||
- [ ] Update limit displays in billing dashboard
|
||||
- [ ] Add OSS model indicators in cost breakdown
|
||||
- [ ] Update usage statistics to reflect new limits
|
||||
- [ ] Test UI with new limit values
|
||||
|
||||
### Phase 3: Testing & Validation (Priority: CRITICAL)
|
||||
|
||||
#### 3.1 Limit Enforcement Testing
|
||||
**Test Cases**:
|
||||
- [ ] Test 50 AI text generation calls limit
|
||||
- [ ] Test 50 image generation limit (OSS models)
|
||||
- [ ] Test 50 image editing limit (OSS models)
|
||||
- [ ] Test 30 video generation limit (OSS models)
|
||||
- [ ] Test 100 audio generation limit (OSS models)
|
||||
- [ ] Test $45 monthly cost cap enforcement
|
||||
- [ ] Test pre-flight validation with new limits
|
||||
- [ ] Test limit exceeded error messages
|
||||
|
||||
#### 3.2 Cost Calculation Testing
|
||||
**Test Cases**:
|
||||
- [ ] Verify Qwen Image cost: $0.03 per image
|
||||
- [ ] Verify Qwen Edit cost: $0.02 per edit
|
||||
- [ ] Verify WAN 2.5 video cost: $0.25 per video
|
||||
- [ ] Verify Minimax Speech cost: $0.05 per 1K chars
|
||||
- [ ] Test cost aggregation across all operations
|
||||
- [ ] Test cost cap enforcement at $45
|
||||
- [ ] Verify cost display in billing dashboard
|
||||
|
||||
#### 3.3 OSS Model Integration Testing
|
||||
**Test Cases**:
|
||||
- [ ] Test Qwen Image generation via WaveSpeed
|
||||
- [ ] Test Qwen Edit editing via WaveSpeed
|
||||
- [ ] Test WAN 2.5 video generation via WaveSpeed
|
||||
- [ ] Test Minimax Speech audio generation via WaveSpeed
|
||||
- [ ] Verify default model selection uses OSS models
|
||||
- [ ] Test model fallback if OSS model unavailable
|
||||
- [ ] Verify cost tracking for OSS models
|
||||
|
||||
### Phase 4: Database Migration (Priority: HIGH)
|
||||
|
||||
#### 4.1 Create Migration Script
|
||||
**File**: `backend/database/migrations/update_basic_tier_limits_oss.py`
|
||||
|
||||
**Script Requirements**:
|
||||
```python
|
||||
"""
|
||||
Migration: Update Basic Tier Limits for OSS-Focused Pricing Strategy
|
||||
- Increase AI text generation: 10 → 50
|
||||
- Increase token limits: 20K → 100K per provider
|
||||
- Increase image generation: 5 → 50
|
||||
- Increase image editing: 30 → 50
|
||||
- Increase video generation: 20 → 30
|
||||
- Increase audio generation: 50 → 100
|
||||
- Adjust cost cap: $50 → $45
|
||||
"""
|
||||
|
||||
def upgrade():
|
||||
# Update SubscriptionPlan for Basic tier
|
||||
# Update existing UserSubscription records
|
||||
# Clear pricing service cache
|
||||
pass
|
||||
|
||||
def downgrade():
|
||||
# Revert to previous limits if needed
|
||||
pass
|
||||
```
|
||||
|
||||
**Action Items**:
|
||||
- [ ] Create migration script
|
||||
- [ ] Test migration on staging database
|
||||
- [ ] Backup production database before migration
|
||||
- [ ] Run migration during maintenance window
|
||||
- [ ] Verify all subscriptions updated correctly
|
||||
|
||||
### Phase 5: Monitoring & Adjustment (Priority: MEDIUM)
|
||||
|
||||
#### 5.1 Set Up Monitoring
|
||||
**Metrics to Track**:
|
||||
- Average cost per user per month
|
||||
- Users hitting $45 cost cap
|
||||
- Users hitting individual limits
|
||||
- OSS model usage vs proprietary model usage
|
||||
- Cost savings from OSS models
|
||||
|
||||
**Action Items**:
|
||||
- [ ] Set up cost monitoring dashboard
|
||||
- [ ] Create alerts for cost cap breaches
|
||||
- [ ] Track OSS model adoption rate
|
||||
- [ ] Monitor user satisfaction with limits
|
||||
|
||||
#### 5.2 Adjustment Plan
|
||||
**Triggers for Adjustment**:
|
||||
- If average cost > $35/user: Consider reducing limits
|
||||
- If >15% users hit cost cap: Consider increasing cost cap to $50
|
||||
- If <20% users use video/audio: Consider reducing those limits
|
||||
- If OSS models unavailable: Fallback to proprietary models
|
||||
|
||||
**Action Items**:
|
||||
- [ ] Define adjustment criteria
|
||||
- [ ] Create adjustment workflow
|
||||
- [ ] Plan communication strategy for limit changes
|
||||
|
||||
---
|
||||
|
||||
## Next Steps (Priority Order)
|
||||
|
||||
1. **CRITICAL**: Update `pricing_service.py` with new Basic tier limits
|
||||
2. **CRITICAL**: Add WaveSpeed OSS model pricing to `pricing_service.py`
|
||||
3. **HIGH**: Update default model selection to prefer OSS models
|
||||
4. **HIGH**: Create and run database migration
|
||||
5. **HIGH**: Update pricing page documentation
|
||||
6. **HIGH**: Test limit enforcement and cost calculations
|
||||
7. **MEDIUM**: Update frontend components with new limits
|
||||
8. **MEDIUM**: Set up monitoring and alerts
|
||||
9. **LOW**: Add OSS model indicators to UI
|
||||
|
||||
---
|
||||
|
||||
## Monitoring & Adjustment Plan
|
||||
|
||||
### Key Metrics to Track
|
||||
- Average LLM calls per user per month
|
||||
- Average media generation per user per month
|
||||
- Average cost per user per month
|
||||
- Users hitting cost cap
|
||||
- Users hitting individual limits
|
||||
|
||||
### Adjustment Triggers
|
||||
- **If average cost > $25/user**: Consider reducing limits
|
||||
- **If >20% users hit cost cap**: Consider increasing cost cap
|
||||
- **If <10% users use video/audio**: Consider reducing those limits
|
||||
- **If churn rate high**: Consider increasing limits
|
||||
|
||||
### Review Schedule
|
||||
- **Week 1-2**: Daily monitoring
|
||||
- **Month 1**: Weekly review
|
||||
- **Month 2-3**: Bi-weekly review
|
||||
- **Month 4+**: Monthly review
|
||||
|
||||
---
|
||||
|
||||
## Conclusion
|
||||
|
||||
The proposed Basic tier limits (OSS-Focused) provide:
|
||||
- ✅ **Access to all ALwrity features** with generous limits
|
||||
- ✅ **Sustainable cost structure** using OSS models (25-50% savings)
|
||||
- ✅ **Competitive pricing** ($29 vs competitors $49+)
|
||||
- ✅ **Protection against losses** ($45 cost cap, hard limit $40-50)
|
||||
- ✅ **Room for growth** (can adjust based on usage)
|
||||
- ✅ **OSS-first strategy** (Qwen, FLUX, Ideogram, WAN 2.5, Minimax Speech)
|
||||
- ✅ **Maximum user value** while staying within cost constraints
|
||||
|
||||
**Key Advantages of OSS-Focused Strategy**:
|
||||
1. **Cost Efficiency**: 25-50% cost savings vs proprietary models
|
||||
2. **Better Limits**: Can offer more generations due to lower costs
|
||||
3. **User Value**: More value for the same $29/month price
|
||||
4. **Sustainability**: Lower costs = better margins = sustainable business
|
||||
5. **Flexibility**: Can adjust limits based on actual usage patterns
|
||||
|
||||
**Recommendation**: **APPROVE** for production launch with OSS-focused strategy.
|
||||
|
||||
**Confidence Level**: 🟢 **HIGH** (with proper monitoring, cost cap enforcement, and OSS model defaults)
|
||||
|
||||
**Risk Mitigation**:
|
||||
- $45 cost cap protects against losses (hard limit $40-50)
|
||||
- OSS model defaults ensure cost efficiency
|
||||
- Monitoring allows quick adjustment if needed
|
||||
- Realistic usage assumptions (60-70% of limits)
|
||||
Reference in New Issue
Block a user