Files
pi-skill/skills/phantom-canvas/references/AGENTS.md
2026-05-26 12:43:48 +07:00

1.0 KiB

AGENTS.md

CLI tool and HTTP API for AI image/video generation via Gemini Web.

What this tool does

Phantom Canvas wraps Gemini Web as a programmable CLI and HTTP API. It launches Chrome via CDP, automates Gemini's web UI, and exposes generation capabilities for AI agents, scripts, and applications.

How to use

bun add -g phantom-canvas       # or: npm install -g phantom-canvas
phantom-canvas generate "your prompt" -o output.png --headed  # first time: login in Chrome
phantom-canvas generate "your prompt" -o output.png           # after that: headless

See SKILL.md for complete agent instructions.

Architecture

  • index.ts — CLI entry point (chrome / generate / serve)
  • lib/browser.ts — Browser automation (Chrome CDP + Playwright)
  • lib/tasks.ts — Async task queue
  • dist/index.js — Compiled Node.js bundle

Session

Chrome stores login in ~/.phantom-canvas/chrome-profile/. First time requires --headed to login interactively. After that, login persists and headless mode works.