524 lines
17 KiB
Markdown
524 lines
17 KiB
Markdown
# Competitor Analysis & Sitemap Analysis Plan for Onboarding Step 4
|
||
|
||
## Overview
|
||
|
||
This document outlines the implementation plan for Phase 1 of Step 4 onboarding, focusing on competitor analysis using the Exa API and enhanced sitemap analysis. This approach provides comprehensive competitive intelligence while optimizing API usage and costs.
|
||
|
||
---
|
||
|
||
## 1. Exa API Integration for Competitor Discovery
|
||
|
||
### 1.1 Exa API Analysis
|
||
|
||
Based on the [Exa API documentation](https://docs.exa.ai/reference/find-similar-links), the `findSimilar` endpoint is perfectly suited for competitor discovery:
|
||
|
||
#### Key Features for Competitor Analysis
|
||
- **Neural Search**: Uses AI to find semantically similar content (up to 100 results)
|
||
- **Content Analysis**: Provides summaries, highlights, and full text
|
||
- **Domain Filtering**: Can include/exclude specific domains
|
||
- **Date Filtering**: Filter by published/crawl dates
|
||
- **Cost Effective**: $0.005 for 1-25 results, $0.025 for 26-100 results
|
||
|
||
#### Optimal API Configuration for Competitor Discovery
|
||
```json
|
||
{
|
||
"url": "https://user-website.com",
|
||
"numResults": 25,
|
||
"contents": {
|
||
"text": true,
|
||
"summary": {
|
||
"query": "Business model, target audience, content strategy"
|
||
},
|
||
"highlights": {
|
||
"numSentences": 2,
|
||
"highlightsPerUrl": 3,
|
||
"query": "Unique value proposition, competitive advantages"
|
||
}
|
||
},
|
||
"context": true,
|
||
"moderation": true
|
||
}
|
||
```
|
||
|
||
### 1.2 Competitor Discovery Strategy
|
||
|
||
#### Phase 1: Initial Competitor Discovery
|
||
```python
|
||
async def discover_competitors(user_url: str, industry: str = None) -> Dict[str, Any]:
|
||
"""
|
||
Discover competitors using Exa API findSimilar endpoint
|
||
"""
|
||
# Primary competitor search
|
||
primary_competitors = await exa.find_similar_and_contents(
|
||
url=user_url,
|
||
num_results=15,
|
||
contents={
|
||
"text": True,
|
||
"summary": {
|
||
"query": f"Business model, target audience, content strategy in {industry or 'this industry'}"
|
||
},
|
||
"highlights": {
|
||
"numSentences": 2,
|
||
"highlightsPerUrl": 3,
|
||
"query": "Unique value proposition, competitive advantages, market position"
|
||
}
|
||
},
|
||
context=True,
|
||
moderation=True
|
||
)
|
||
|
||
# Enhanced competitor search with domain filtering
|
||
enhanced_competitors = await exa.find_similar_and_contents(
|
||
url=user_url,
|
||
num_results=10,
|
||
exclude_domains=[extract_domain(user_url)], # Exclude user's domain
|
||
contents={
|
||
"text": True,
|
||
"summary": {
|
||
"query": "Content strategy, SEO approach, marketing tactics"
|
||
}
|
||
}
|
||
)
|
||
|
||
return {
|
||
"primary_competitors": primary_competitors,
|
||
"enhanced_competitors": enhanced_competitors,
|
||
"total_competitors": len(primary_competitors.results) + len(enhanced_competitors.results)
|
||
}
|
||
```
|
||
|
||
#### Phase 2: Competitor Analysis Enhancement
|
||
```python
|
||
async def analyze_competitor_content(competitor_urls: List[str]) -> Dict[str, Any]:
|
||
"""
|
||
Deep dive analysis of discovered competitors
|
||
"""
|
||
competitor_analyses = []
|
||
|
||
for competitor_url in competitor_urls[:10]: # Limit to top 10 competitors
|
||
# Get competitor's sitemap for structure analysis
|
||
sitemap_analysis = await analyze_sitemap(f"{competitor_url}/sitemap.xml")
|
||
|
||
# Get competitor's content strategy insights
|
||
content_analysis = await exa.find_similar_and_contents(
|
||
url=competitor_url,
|
||
num_results=5,
|
||
contents={
|
||
"text": True,
|
||
"summary": {
|
||
"query": "Content strategy, target keywords, audience engagement"
|
||
}
|
||
}
|
||
)
|
||
|
||
competitor_analyses.append({
|
||
"url": competitor_url,
|
||
"sitemap_analysis": sitemap_analysis,
|
||
"content_insights": content_analysis,
|
||
"competitive_score": calculate_competitive_score(sitemap_analysis, content_analysis)
|
||
})
|
||
|
||
return competitor_analyses
|
||
```
|
||
|
||
---
|
||
|
||
## 2. Enhanced Sitemap Analysis Integration
|
||
|
||
### 2.1 Current Sitemap Service Enhancement
|
||
|
||
The existing `SitemapService` will be enhanced to support competitive benchmarking:
|
||
|
||
#### Enhanced Sitemap Analysis with Competitive Context
|
||
```python
|
||
async def analyze_sitemap_with_competitive_context(
|
||
user_sitemap_url: str,
|
||
competitor_data: Dict[str, Any],
|
||
industry: str = None
|
||
) -> Dict[str, Any]:
|
||
"""
|
||
Enhanced sitemap analysis with competitive benchmarking
|
||
"""
|
||
# Get user's sitemap analysis
|
||
user_analysis = await sitemap_service.analyze_sitemap(
|
||
user_sitemap_url,
|
||
analyze_content_trends=True,
|
||
analyze_publishing_patterns=True
|
||
)
|
||
|
||
# Extract competitive benchmarks
|
||
competitor_benchmarks = extract_competitive_benchmarks(competitor_data)
|
||
|
||
# Generate AI insights with competitive context
|
||
competitive_insights = await generate_competitive_sitemap_insights(
|
||
user_analysis, competitor_benchmarks, industry
|
||
)
|
||
|
||
return {
|
||
"user_sitemap_analysis": user_analysis,
|
||
"competitive_benchmarks": competitor_benchmarks,
|
||
"competitive_insights": competitive_insights,
|
||
"market_positioning": calculate_market_positioning(user_analysis, competitor_benchmarks)
|
||
}
|
||
```
|
||
|
||
### 2.2 Competitive Benchmarking Metrics
|
||
|
||
#### Key Metrics for Competitive Analysis
|
||
```json
|
||
{
|
||
"competitive_benchmarks": {
|
||
"content_volume": {
|
||
"user_total_urls": 1250,
|
||
"competitor_average": 2100,
|
||
"market_leader": 4500,
|
||
"user_position": "below_average",
|
||
"opportunity_score": 75
|
||
},
|
||
"publishing_velocity": {
|
||
"user_velocity": 2.5,
|
||
"competitor_average": 3.8,
|
||
"market_leader": 6.2,
|
||
"user_position": "below_average",
|
||
"opportunity_score": 80
|
||
},
|
||
"content_structure": {
|
||
"user_categories": ["blog", "products", "resources"],
|
||
"competitor_categories": ["blog", "products", "resources", "case_studies", "guides"],
|
||
"missing_categories": ["case_studies", "guides"],
|
||
"opportunity_score": 85
|
||
},
|
||
"seo_optimization": {
|
||
"user_structure_quality": "good",
|
||
"competitor_average": "excellent",
|
||
"optimization_gaps": ["priority_values", "changefreq_optimization"],
|
||
"opportunity_score": 70
|
||
}
|
||
}
|
||
}
|
||
```
|
||
|
||
---
|
||
|
||
## 3. AI Insights Generation Strategy
|
||
|
||
### 3.1 Competitor Analysis AI Prompts
|
||
|
||
#### Primary Competitor Analysis Prompt
|
||
```python
|
||
COMPETITOR_ANALYSIS_PROMPT = """
|
||
Analyze these competitors discovered for the user's website: {user_url}
|
||
|
||
User Website Context:
|
||
- Industry: {industry}
|
||
- Current Content Strategy: {user_content_strategy}
|
||
- Target Audience: {user_target_audience}
|
||
|
||
Competitor Data:
|
||
{competitor_data}
|
||
|
||
Provide strategic insights on:
|
||
|
||
1. **Market Position Assessment**:
|
||
- Where does the user stand vs competitors?
|
||
- What are the user's competitive advantages?
|
||
- What are the main competitive gaps?
|
||
|
||
2. **Content Strategy Opportunities**:
|
||
- What content categories are competitors using that the user isn't?
|
||
- What content gaps present the biggest opportunities?
|
||
- What content strategies are working for competitors?
|
||
|
||
3. **Competitive Advantages**:
|
||
- What unique strengths does the user have?
|
||
- How can the user differentiate from competitors?
|
||
- What market positioning opportunities exist?
|
||
|
||
4. **Strategic Recommendations**:
|
||
- Top 5 actionable steps to improve competitive position
|
||
- Content priorities for the next 3 months
|
||
- Quick wins vs long-term strategic moves
|
||
|
||
Focus on actionable insights that help content creators and digital marketers make informed decisions.
|
||
"""
|
||
```
|
||
|
||
#### Enhanced Sitemap Analysis Prompt
|
||
```python
|
||
COMPETITIVE_SITEMAP_PROMPT = """
|
||
Analyze this sitemap data with competitive context:
|
||
|
||
User Sitemap Analysis:
|
||
{user_sitemap_data}
|
||
|
||
Competitive Benchmarks:
|
||
{competitive_benchmarks}
|
||
|
||
Industry Context: {industry}
|
||
|
||
Provide insights on:
|
||
|
||
1. **Content Volume Positioning**:
|
||
- How does the user's content volume compare to competitors?
|
||
- What content expansion opportunities exist?
|
||
- What content categories should be prioritized?
|
||
|
||
2. **Publishing Strategy Optimization**:
|
||
- How does the user's publishing frequency compare?
|
||
- What publishing patterns work best for competitors?
|
||
- What publishing schedule would be optimal?
|
||
|
||
3. **Site Structure Competitive Analysis**:
|
||
- How does the user's site organization compare?
|
||
- What structural improvements would help competitiveness?
|
||
- What SEO structure optimizations are needed?
|
||
|
||
4. **Content Gap Identification**:
|
||
- What content categories are competitors using that the user isn't?
|
||
- What content depth opportunities exist?
|
||
- What content types should be prioritized?
|
||
|
||
5. **Strategic Content Recommendations**:
|
||
- Top 10 content ideas based on competitive analysis
|
||
- Content calendar recommendations
|
||
- Content strategy priorities for next 6 months
|
||
|
||
Provide specific, actionable recommendations with business impact estimates.
|
||
"""
|
||
```
|
||
|
||
### 3.2 AI Insights Output Structure
|
||
|
||
#### Expected AI Insights Format
|
||
```json
|
||
{
|
||
"competitive_analysis": {
|
||
"market_position": "above_average",
|
||
"competitive_advantages": [
|
||
"Strong technical content depth",
|
||
"Regular publishing consistency",
|
||
"Good site organization"
|
||
],
|
||
"competitive_gaps": [
|
||
"Missing case studies content",
|
||
"Limited video content",
|
||
"No product comparison pages"
|
||
],
|
||
"market_opportunities": [
|
||
{
|
||
"opportunity": "Case studies content",
|
||
"priority": "high",
|
||
"effort": "medium",
|
||
"impact": "high",
|
||
"competitor_examples": ["competitor1.com/case-studies"]
|
||
}
|
||
]
|
||
},
|
||
"content_strategy_recommendations": {
|
||
"immediate_priorities": [
|
||
"Create case studies section",
|
||
"Develop product comparison pages",
|
||
"Increase publishing frequency to 3 posts/week"
|
||
],
|
||
"content_expansion": [
|
||
"Video content library",
|
||
"Industry insights section",
|
||
"Customer success stories"
|
||
],
|
||
"publishing_optimization": {
|
||
"recommended_frequency": "3 posts/week",
|
||
"optimal_schedule": "Tuesday, Thursday, Saturday",
|
||
"content_mix": "70% blog posts, 20% case studies, 10% videos"
|
||
}
|
||
},
|
||
"competitive_positioning": {
|
||
"unique_value_proposition": "Technical expertise with practical application",
|
||
"differentiation_strategy": "Focus on actionable insights over theory",
|
||
"market_positioning": "Premium technical content provider"
|
||
}
|
||
}
|
||
```
|
||
|
||
---
|
||
|
||
## 4. Implementation Roadmap
|
||
|
||
### 4.1 Phase 1: Core Implementation (Week 1)
|
||
|
||
#### Day 1-2: Exa API Integration
|
||
- [ ] Create Exa API service wrapper
|
||
- [ ] Implement competitor discovery endpoint
|
||
- [ ] Add error handling and rate limiting
|
||
- [ ] Create competitor data models
|
||
|
||
#### Day 3-4: Enhanced Sitemap Analysis
|
||
- [ ] Enhance existing sitemap service for competitive analysis
|
||
- [ ] Add competitive benchmarking metrics
|
||
- [ ] Implement market positioning calculations
|
||
- [ ] Create competitive insights generation
|
||
|
||
#### Day 5: AI Integration
|
||
- [ ] Implement competitive analysis AI prompts
|
||
- [ ] Create enhanced sitemap analysis prompts
|
||
- [ ] Add insights parsing and structuring
|
||
- [ ] Implement result aggregation
|
||
|
||
### 4.2 Phase 2: Frontend Integration (Week 2)
|
||
|
||
#### Day 1-2: API Endpoints
|
||
- [ ] Create Step 4 onboarding endpoints
|
||
- [ ] Implement competitor analysis endpoint
|
||
- [ ] Add enhanced sitemap analysis endpoint
|
||
- [ ] Create unified analysis results endpoint
|
||
|
||
#### Day 3-4: Frontend Components
|
||
- [ ] Create competitor analysis display component
|
||
- [ ] Build enhanced sitemap analysis UI
|
||
- [ ] Implement competitive insights visualization
|
||
- [ ] Add progress tracking and real-time updates
|
||
|
||
#### Day 5: Integration Testing
|
||
- [ ] End-to-end testing of competitor discovery
|
||
- [ ] Test sitemap analysis with competitive context
|
||
- [ ] Validate AI insights accuracy
|
||
- [ ] Performance optimization
|
||
|
||
### 4.3 Phase 3: Optimization & Enhancement (Week 3)
|
||
|
||
#### Day 1-2: Performance Optimization
|
||
- [ ] Implement parallel processing for competitor analysis
|
||
- [ ] Add caching for repeated analyses
|
||
- [ ] Optimize API call efficiency
|
||
- [ ] Add result pagination
|
||
|
||
#### Day 3-4: Advanced Features
|
||
- [ ] Add competitor monitoring capabilities
|
||
- [ ] Implement trend analysis
|
||
- [ ] Create competitive alerts system
|
||
- [ ] Add export functionality
|
||
|
||
#### Day 5: Documentation & Testing
|
||
- [ ] Complete API documentation
|
||
- [ ] Create user guides
|
||
- [ ] Comprehensive testing
|
||
- [ ] Performance benchmarking
|
||
|
||
---
|
||
|
||
## 5. Expected Outputs and Value
|
||
|
||
### 5.1 Competitor Analysis Outputs
|
||
|
||
#### Data Points Provided
|
||
- **Competitor URLs**: 15-25 relevant competitors discovered
|
||
- **Competitive Positioning**: Market position vs competitors
|
||
- **Content Gap Analysis**: Missing content opportunities
|
||
- **Competitive Advantages**: User's unique strengths
|
||
- **Strategic Recommendations**: Actionable next steps
|
||
|
||
#### Business Value
|
||
- **Market Intelligence**: Understanding competitive landscape
|
||
- **Content Strategy**: Data-driven content decisions
|
||
- **Competitive Positioning**: Clear differentiation strategy
|
||
- **Opportunity Identification**: High-impact content opportunities
|
||
|
||
### 5.2 Enhanced Sitemap Analysis Outputs
|
||
|
||
#### Data Points Provided
|
||
- **Competitive Benchmarks**: Performance vs market leaders
|
||
- **Content Volume Analysis**: Publishing frequency comparison
|
||
- **Structure Optimization**: Site organization improvements
|
||
- **SEO Opportunities**: Technical optimization recommendations
|
||
|
||
#### Business Value
|
||
- **Performance Benchmarking**: Know where you stand
|
||
- **Optimization Priorities**: Focus on high-impact improvements
|
||
- **Content Strategy**: Data-driven publishing decisions
|
||
- **Technical SEO**: Competitive technical optimization
|
||
|
||
### 5.3 Combined Strategic Value
|
||
|
||
#### For Content Creators
|
||
- Clear understanding of competitive landscape
|
||
- Data-driven content strategy recommendations
|
||
- Specific content opportunities to pursue
|
||
- Competitive positioning guidance
|
||
|
||
#### For Digital Marketers
|
||
- Market intelligence and competitive insights
|
||
- Performance benchmarking against competitors
|
||
- Strategic recommendations with business impact
|
||
- Actionable optimization priorities
|
||
|
||
#### For Business Owners
|
||
- Competitive market position assessment
|
||
- Strategic content and marketing direction
|
||
- ROI-focused recommendations
|
||
- Long-term competitive advantage planning
|
||
|
||
---
|
||
|
||
## 6. Cost Analysis and Optimization
|
||
|
||
### 6.1 Exa API Costs
|
||
|
||
#### Per Analysis Session
|
||
- **Competitor Discovery**: 25 results × $0.005 = $0.125
|
||
- **Enhanced Analysis**: 10 results × $0.005 = $0.05
|
||
- **Content Analysis**: 50 results × $0.001 = $0.05
|
||
- **Total per Session**: ~$0.225
|
||
|
||
#### Monthly Projections (100 users)
|
||
- **100 users × 4 analyses/month**: 400 sessions
|
||
- **400 sessions × $0.225**: $90/month
|
||
- **Cost per user per analysis**: $0.225
|
||
|
||
### 6.2 Optimization Strategies
|
||
|
||
#### Cost Reduction
|
||
- **Caching**: Store competitor results for 30 days
|
||
- **Batch Processing**: Analyze multiple competitors together
|
||
- **Smart Filtering**: Only analyze top competitors
|
||
- **Result Pagination**: Load more results on demand
|
||
|
||
#### Value Maximization
|
||
- **Rich Insights**: Comprehensive competitive intelligence
|
||
- **Actionable Recommendations**: Specific next steps
|
||
- **Business Impact**: ROI-focused insights
|
||
- **User Experience**: Intuitive, professional interface
|
||
|
||
---
|
||
|
||
## 7. Success Metrics
|
||
|
||
### 7.1 Technical Metrics
|
||
- **Analysis Completion Rate**: >95%
|
||
- **Average Analysis Time**: <2 minutes
|
||
- **API Success Rate**: >98%
|
||
- **Data Accuracy**: >90% user satisfaction
|
||
|
||
### 7.2 Business Metrics
|
||
- **User Engagement**: >4.5/5 rating for insights quality
|
||
- **Actionability**: >80% of users implement recommendations
|
||
- **Competitive Intelligence Value**: Measurable business impact
|
||
- **Content Strategy Improvement**: Quantifiable results
|
||
|
||
### 7.3 User Experience Metrics
|
||
- **Onboarding Completion**: >85% complete Step 4
|
||
- **Insights Relevance**: >90% find insights actionable
|
||
- **Competitive Understanding**: >80% better understand market position
|
||
- **Strategic Direction**: >75% have clearer content strategy
|
||
|
||
---
|
||
|
||
## Conclusion
|
||
|
||
This Phase 1 implementation provides a solid foundation for competitive analysis in Step 4 onboarding. By combining Exa API's powerful competitor discovery with enhanced sitemap analysis, users will receive:
|
||
|
||
- **Comprehensive Competitive Intelligence**: Understanding of market position and opportunities
|
||
- **Data-Driven Content Strategy**: Specific recommendations for content development
|
||
- **Strategic Business Insights**: Actionable recommendations for competitive advantage
|
||
- **Professional-Grade Analysis**: Enterprise-level competitive intelligence
|
||
|
||
The implementation is cost-effective, scalable, and provides immediate value to users while setting the foundation for more advanced competitive analysis features in future phases.
|