Files

ajaysi 6fd9a4e354 ALwrity HALLUCINATION DETECTOR AND ASSISTIVE WRITING

2025-09-08 21:14:27 +05:30

6.4 KiB

Raw Blame History

Hallucination Detector Setup Guide

This guide explains how to set up and configure the hallucination detector feature in ALwrity, which provides fact-checking capabilities using Exa.ai integration.

📋 Overview

The hallucination detector allows users to:

Select text in the LinkedIn editor
Check facts using AI-powered claim extraction and verification
View confidence scores and source attribution
Get detailed analysis of factual accuracy

🔧 Backend Setup

1. Environment Variables

Add the following environment variables to your .env file:

# Exa.ai API Key for Hallucination Detection
EXA_API_KEY=your_exa_api_key_here

# OpenAI API Key for claim extraction and verification
OPENAI_API_KEY=your_openai_api_key_here

2. Get Exa.ai API Key

Visit Exa.ai
Sign up for an account
Navigate to your API dashboard
Generate an API key
Add the key to your .env file

3. Install Dependencies

The hallucination detector uses the following Python packages (already included in requirements.txt):

pip install openai requests

4. Start the Backend

cd backend
python start_alwrity_backend.py

The hallucination detector API will be available at:

POST /api/hallucination-detector/detect - Main fact-checking endpoint
POST /api/hallucination-detector/extract-claims - Extract claims only
POST /api/hallucination-detector/verify-claim - Verify single claim
GET /api/hallucination-detector/health - Health check
GET /api/hallucination-detector/demo - Demo information

🎨 Frontend Setup

1. Environment Variables

Add the following to your frontend .env file:

# Backend API URL
REACT_APP_API_URL=http://localhost:8000

2. Start the Frontend

cd frontend
npm start

🚀 Usage

1. In LinkedIn Editor

Generate or paste content in the LinkedIn editor
Select any text (minimum 10 characters)
Click "🔍 Check Facts" in the selection menu
View the fact-checking results with:
- Overall confidence score
- Individual claim assessments
- Supporting/refuting sources
- Detailed reasoning

2. API Usage

Detect Hallucinations

curl -X POST "http://localhost:8000/api/hallucination-detector/detect" \
  -H "Content-Type: application/json" \
  -d '{
    "text": "The Eiffel Tower is located in Paris and was built in 1889.",
    "include_sources": true,
    "max_claims": 5
  }'

Extract Claims Only

curl -X POST "http://localhost:8000/api/hallucination-detector/extract-claims" \
  -H "Content-Type: application/json" \
  -d '{
    "text": "Our company increased sales by 25% last quarter.",
    "max_claims": 10
  }'

Verify Single Claim

curl -X POST "http://localhost:8000/api/hallucination-detector/verify-claim" \
  -H "Content-Type: application/json" \
  -d '{
    "claim": "The Eiffel Tower is in Paris",
    "include_sources": true
  }'

🔍 How It Works

Three-Step Process

Claim Extraction: Uses OpenAI to identify verifiable statements from text
Evidence Search: Uses Exa.ai to find relevant sources for each claim
Claim Verification: Uses OpenAI to assess whether sources support or refute claims

Assessment Types

Supported: Claim is backed by credible sources
Refuted: Claim is contradicted by credible sources
Insufficient Information: Not enough evidence to make a determination

Confidence Scores

0.8-1.0: High confidence (green)
0.6-0.8: Medium confidence (orange)
0.0-0.6: Low confidence (red)

🛠️ Configuration Options

Backend Configuration

In backend/services/hallucination_detector.py:

# Adjust claim extraction parameters
max_claims = 10  # Maximum claims to extract
min_claim_length = 10  # Minimum claim length

# Adjust Exa.ai search parameters
num_results = 5  # Number of sources to retrieve
use_autoprompt = True  # Use Exa's autoprompt feature

Frontend Configuration

In frontend/src/services/hallucinationDetectorService.ts:

// Adjust API timeout
const timeout = 30000; // 30 seconds

// Adjust request parameters
const defaultMaxClaims = 10;
const defaultIncludeSources = true;

🐛 Troubleshooting

Common Issues

"EXA_API_KEY not found"
- Ensure the API key is set in your .env file
- Restart the backend server after adding the key
"OpenAI API key not found"
- Ensure the OpenAI API key is set in your .env file
- Verify the key has sufficient credits
"No sources found"
- Check your Exa.ai API key and account status
- Verify internet connectivity
- Check Exa.ai service status
Frontend connection errors
- Ensure the backend is running on the correct port
- Check CORS configuration
- Verify the API URL in frontend environment variables

Fallback Behavior

The system includes fallback mechanisms:

If Exa.ai is unavailable, mock sources are used
If OpenAI is unavailable, simple keyword matching is used
If both APIs fail, the system returns a safe error response

📊 Monitoring

Health Check

curl http://localhost:8000/api/hallucination-detector/health

Response:

{
  "status": "healthy",
  "version": "1.0.0",
  "exa_api_available": true,
  "openai_api_available": true,
  "timestamp": "2024-01-01T12:00:00"
}

Logs

Check backend logs for:

API call success/failure
Processing times
Error messages
Fallback activations

🔒 Security Considerations

API Keys: Store securely and never commit to version control
Rate Limiting: Respect API rate limits for Exa.ai and OpenAI
Data Privacy: Text sent to APIs may be logged by third parties
Input Validation: All user input is validated before processing

📈 Performance Optimization

Caching: Consider implementing result caching for repeated queries
Batch Processing: Process multiple claims in parallel
Source Limiting: Limit the number of sources retrieved per claim
Timeout Management: Set appropriate timeouts for API calls

🚀 Future Enhancements

Potential improvements:

Integration with additional fact-checking APIs
Custom claim extraction models
Source credibility scoring
Historical fact-checking database
Real-time fact-checking during content generation

6.4 KiB Raw Blame History

Hallucination Detector Setup Guide

📋 Overview

🔧 Backend Setup

1. Environment Variables

2. Get Exa.ai API Key

3. Install Dependencies

4. Start the Backend

🎨 Frontend Setup

1. Environment Variables

2. Start the Frontend

🚀 Usage

1. In LinkedIn Editor

2. API Usage

Detect Hallucinations

Extract Claims Only

Verify Single Claim

🔍 How It Works

Three-Step Process

Assessment Types

Confidence Scores

🛠️ Configuration Options

Backend Configuration

Frontend Configuration

🐛 Troubleshooting

Common Issues

Fallback Behavior

📊 Monitoring

Health Check

Logs

🔒 Security Considerations

📈 Performance Optimization

🚀 Future Enhancements

6.4 KiB

Raw Blame History