1.0 KiB
1.0 KiB
AGENTS.md
CLI tool and HTTP API for AI image/video generation via Gemini Web.
What this tool does
Phantom Canvas wraps Gemini Web as a programmable CLI and HTTP API. It launches Chrome via CDP, automates Gemini's web UI, and exposes generation capabilities for AI agents, scripts, and applications.
How to use
bun add -g phantom-canvas # or: npm install -g phantom-canvas
phantom-canvas generate "your prompt" -o output.png --headed # first time: login in Chrome
phantom-canvas generate "your prompt" -o output.png # after that: headless
See SKILL.md for complete agent instructions.
Architecture
index.ts— CLI entry point (chrome / generate / serve)lib/browser.ts— Browser automation (Chrome CDP + Playwright)lib/tasks.ts— Async task queuedist/index.js— Compiled Node.js bundle
Session
Chrome stores login in ~/.phantom-canvas/chrome-profile/. First time requires --headed to login interactively. After that, login persists and headless mode works.