Files

Kunthawat Greethong 9be686f587 Auto-sync from website-creator

2026-03-08 23:03:19 +07:00

1.6 KiB

Raw Blame History

name, description

name	description
image-analyze	Analyze images using vision AI when the current model doesn't support image input. Use this skill when you need to understand, describe, or extract information from images.

Image Analyze

Analyze images with vision AI via python3 scripts/analyze_image.py <image_path> [prompt].

Commands

Command	Args	Description
`analyze`	`<image_path> [prompt]`	Analyze image with optional custom prompt

Options

Option	Default	Description
`--max-tokens`	1024	Maximum tokens in response
`--temperature`	0.7	Response creativity (0-2)
`--model`	moonshotai/Kimi-K2.5-TEE	Vision model to use

Examples

# Basic analysis
python3 scripts/analyze_image.py photo.jpg

# With custom prompt
python3 scripts/analyze_image.py diagram.png "Extract all text and explain the workflow"

# Detailed analysis
python3 scripts/analyze_image.py screenshot.png "Describe all UI elements and their positions"

# OCR-like extraction
python3 scripts/analyze_image.py document.jpg "Transcribe all text exactly as shown"

Workflow

Provide image path (PNG, JPG, JPEG, GIF, WEBP, BMP)
Optionally provide custom analysis prompt
Script converts image to base64 and sends to vision API
Returns detailed analysis text

Output Format

Success: Analysis text directly
Error: Error: message (to stderr)

Notes

Requires CHUTES_API_TOKEN in environment
Uses Kimi-K2.5-TEE vision model via Chutes AI
Supports common image formats
Best for: image description, OCR, UI analysis, diagram interpretation

1.6 KiB Raw Blame History