Files
ALwrity/docs/image studio/IMAGE_STUDIO_EDITING_RECOMMENDED_MODELS.md

7.0 KiB
Raw Permalink Blame History

Image Studio Editing - Recommended Additional Models

Date: Current Session
Status: Ready for Documentation
Current Progress: 3 of 14 models integrated (21%)


Currently Integrated (3/14)

  1. Qwen Image Edit Plus ($0.02) - Budget, multi-image, ControlNet
  2. Google Nano Banana Pro Edit Ultra ($0.15-0.18) - Premium, 4K/8K, multilingual
  3. Bytedance Seedream V4.5 Edit ($0.04) - Mid-tier, reference-faithful, 4K

Priority 1: High-Value, Cost-Effective Models

1. Qwen Image Edit (Basic Version)

  • Why: Budget alternative to Qwen Edit Plus, simpler use cases
  • Cost: ~$0.02 (estimated)
  • Use Case: Basic editing when Plus features aren't needed
  • Docs Needed: Model path, exact cost, max resolution, capabilities

2. Alibaba WAN 2.5 Image Edit

  • Why: Structure-preserving edits, good balance of cost/quality
  • Cost: ~$0.035 (from enhancement proposal)
  • Use Case: Quick adjustments, cost-effective professional editing
  • Docs Needed: Model path, exact cost, API parameters, capabilities

3. Step1X Edit

  • Why: Simple, straightforward editing for quick modifications
  • Cost: ~$0.03 (from enhancement proposal)
  • Use Case: Quick edits, precise modifications
  • Docs Needed: Model path, exact cost, API parameters

Priority 2: Premium Quality Models

4. FLUX Kontext Pro

  • Why: Improved prompt adherence, typography generation
  • Cost: ~$0.04 (from enhancement proposal)
  • Use Case: Typography-heavy edits, consistent results
  • Docs Needed: Model path, exact cost, typography capabilities, API params

5. FLUX Kontext Max

  • Why: Premium quality, high-fidelity transformations
  • Cost: ~$0.08 (from enhancement proposal)
  • Use Case: Professional retouching, style transformations
  • Docs Needed: Model path, exact cost, quality tiers, API params

6. FLUX Kontext Pro Multi

  • Why: Multi-image editing with FLUX quality
  • Cost: ~$0.04-0.08 (estimated)
  • Use Case: Batch editing with consistent style
  • Docs Needed: Model path, cost, multi-image support, API params

Priority 3: Specialized Models

7. SeedEdit V3 (Bytedance)

  • Why: Prompt-guided editing, identity preservation
  • Cost: ~$0.027 (from enhancement proposal)
  • Use Case: Portrait edits, e-commerce variants
  • Docs Needed: Model path, exact cost, identity preservation features

8. HiDream E1 Full

  • Why: Identity-preserving edits, wardrobe/accessory changes
  • Cost: ~$0.024 (from enhancement proposal)
  • Use Case: Fashion edits, character consistency
  • Docs Needed: Model path, exact cost, identity preservation features

9. Ideogram Character

  • Why: Character consistency, outfit/appearance changes
  • Cost: ~$0.10-0.20 (from enhancement proposal)
  • Use Case: Character-focused editing, consistent character work
  • Docs Needed: Model path, exact cost, character consistency features

Priority 4: Advanced/Specialized

10. OpenAI GPT Image 1

  • Why: Quality tiers, mask support, style transfers
  • Cost: ~$0.011-$0.250 (varies by tier)
  • Use Case: Style transfers, creative transformations
  • Docs Needed: Model path, cost tiers, quality options, API params

11. Z-Image Turbo Inpaint

  • Why: Fast inpainting, specialized for object removal
  • Cost: Unknown (need docs)
  • Use Case: Quick object removal, inpainting
  • Docs Needed: Model path, cost, speed, capabilities

12. Image Zoom-Out

  • Why: Specialized outpainting/zoom-out functionality
  • Cost: Unknown (need docs)
  • Use Case: Extending images, outpainting
  • Docs Needed: Model path, cost, zoom-out capabilities

📊 Model Comparison Matrix

Model Cost Tier Max Res Multi-Image Special Features
Qwen Edit Plus $0.02 Budget 1536×1536 (3) ControlNet, Bilingual
Nano Banana Pro $0.15-0.18 Premium 8192×8192 (14) 4K/8K, Multilingual
Seedream V4.5 $0.04 Mid 4096×4096 (10) Reference-faithful
Qwen Edit ~$0.02 Budget ? Basic editing
WAN 2.5 Edit ~$0.035 Mid ? Structure-preserving
Step1X Edit ~$0.03 Budget ? Simple, precise
FLUX Kontext Pro ~$0.04 Mid ? Typography
FLUX Kontext Max ~$0.08 Premium ? High-fidelity
SeedEdit V3 ~$0.027 Mid ? Identity preservation
HiDream E1 ~$0.024 Mid ? Identity preservation
Ideogram Character ~$0.10-0.20 Premium ? Character consistency

Phase 1: Complete Budget Tier (Next 2-3 models)

  1. Qwen Image Edit (basic) - Complete Qwen family
  2. Step1X Edit - Simple, cost-effective option
  3. WAN 2.5 Edit - Good mid-tier option

Result: 6 models total, covering budget to mid-tier

Phase 2: Add Premium Options (Next 2-3 models)

  1. FLUX Kontext Pro - Typography focus
  2. FLUX Kontext Max - Premium quality
  3. SeedEdit V3 - Identity preservation

Result: 9 models total, covering all tiers

Phase 3: Specialized Models (Remaining)

  1. HiDream E1 Full - Fashion/character
  2. Ideogram Character - Character consistency
  3. FLUX Kontext Pro Multi - Multi-image FLUX
  4. OpenAI GPT Image 1 - Quality tiers
  5. Z-Image Turbo Inpaint - Fast inpainting
  6. Image Zoom-Out - Specialized outpainting

Result: 14 models total, comprehensive coverage


📋 Documentation Requirements

For each model, please provide:

  1. Model Information:

    • Model ID (e.g., "qwen-edit")
    • Model path/endpoint (e.g., "wavespeed-ai/qwen-image/edit")
    • Display name
  2. Pricing:

    • Cost per edit (exact amount)
    • Any tiered pricing (e.g., 4K vs 8K)
  3. Technical Specs:

    • Max resolution (width × height)
    • Supported operations/capabilities
    • Multi-image support (max number)
  4. API Parameters:

    • Required parameters
    • Optional parameters
    • Parameter format (size vs aspect_ratio/resolution)
    • Special parameters (e.g., seed, guidance_scale)
  5. Special Features:

    • Identity preservation
    • Typography support
    • ControlNet support
    • Multi-language support
    • Character consistency

💡 Quick Wins

If you want to prioritize based on user value:

  1. Qwen Image Edit (basic) - Complete the Qwen family, budget option
  2. WAN 2.5 Edit - Good balance, structure-preserving
  3. FLUX Kontext Pro - Typography is a unique feature
  4. SeedEdit V3 - Identity preservation is valuable for portraits

These 4 models would give us 7 total, covering:

  • Budget tier: Qwen Edit, Qwen Edit Plus, Step1X
  • Mid tier: Seedream V4.5, WAN 2.5, FLUX Kontext Pro
  • Premium tier: Nano Banana Pro, SeedEdit V3

Ready to integrate once documentation is provided