Files
pi-skill/skills/slack-web/SKILL.md
2026-05-25 16:41:08 +07:00

219 lines
5.7 KiB
Markdown

---
name: slack-web
description: >
Slack web automation via agent-browser. Use when the user asks to check Slack,
read messages, send messages, reply in threads, search Slack, check notifications,
react to messages, send DMs, or any Slack communication task. Triggers include
"check my Slack", "send a message on Slack", "reply to that Slack thread",
"search Slack for...", "what's new on Slack", "read my DMs", "react to that message",
"check Slack notifications", "message [person] on Slack".
allowed-tools: Bash(agent-browser:*)
---
# Slack Web Automation
Interact with Slack through its web interface using `agent-browser`.
## Core Workflow
Every Slack interaction follows this pattern:
1. **Restore session**: `agent-browser state load ~/.slack-auth-state.json`
2. **Open Slack**: `agent-browser --headed open "https://app.slack.com"`
3. **Snapshot**: `agent-browser snapshot -i -c` (get refs)
4. **Interact**: Use refs to click, type, navigate
5. **Re-snapshot**: After every action — Slack's DOM changes constantly
```bash
# Restore and open
agent-browser state load ~/.slack-auth-state.json
agent-browser --headed open "https://app.slack.com"
agent-browser wait --load networkidle
agent-browser set viewport 1920 1080
# Verify logged in (URL should contain /client/)
agent-browser get url
# Navigate with quick switcher
agent-browser press "Meta+k"
agent-browser wait 500
agent-browser snapshot -i
agent-browser fill @eXX "general"
agent-browser wait 1000
agent-browser snapshot -i
agent-browser click @eXX
agent-browser wait --load networkidle
# Read messages
agent-browser snapshot -c
```
## Authentication
### First-Time Login (Manual)
```bash
agent-browser --headed open "https://app.slack.com"
# PAUSE — tell user to log in manually
# After login confirmed:
agent-browser get url # verify /client/ in URL
agent-browser state save ~/.slack-auth-state.json
```
### Restore Session
```bash
agent-browser state load ~/.slack-auth-state.json
agent-browser --headed open "https://app.slack.com"
agent-browser wait --load networkidle
agent-browser get url
# If URL contains "signin" → session expired, user must re-login
```
## Actions
### Read Messages
```bash
# Navigate to channel (quick switcher is fastest)
agent-browser press "Meta+k"
agent-browser wait 500
agent-browser snapshot -i
agent-browser fill @eXX "channel-name"
agent-browser wait 1000
agent-browser snapshot -i
agent-browser click @eXX
agent-browser wait --load networkidle
# Read
agent-browser snapshot -c
```
### Send a Message
```bash
# In the target channel, find compose box
agent-browser snapshot -i
# Look for textbox with "Message #channel" placeholder
agent-browser click @eXX # compose area
agent-browser type @eXX "Your message"
agent-browser press "Enter"
agent-browser wait 1000
agent-browser snapshot -c # verify sent
```
Use `type` not `fill` — Slack's compose box is contenteditable.
### Reply to Thread
```bash
agent-browser snapshot -c
agent-browser hover @eXX # hover message to show toolbar
agent-browser snapshot -i
agent-browser click @eXX # reply/thread button
agent-browser wait --load networkidle
agent-browser snapshot -i
agent-browser click @eXX # thread compose box
agent-browser type @eXX "Thread reply"
agent-browser press "Enter"
```
### Send DM
```bash
agent-browser press "Meta+k"
agent-browser wait 500
agent-browser snapshot -i
agent-browser fill @eXX "Person Name"
agent-browser wait 1000
agent-browser snapshot -i
agent-browser click @eXX # person result
agent-browser wait --load networkidle
agent-browser snapshot -i
agent-browser click @eXX # compose box
agent-browser type @eXX "Your DM"
agent-browser press "Enter"
```
### Search
```bash
agent-browser press "Meta+g" # Slack search
agent-browser wait 500
agent-browser snapshot -i
agent-browser fill @eXX "search query"
agent-browser press "Enter"
agent-browser wait --load networkidle
agent-browser snapshot -c
```
### React with Emoji
```bash
agent-browser snapshot -c
agent-browser hover @eXX # hover over message
agent-browser snapshot -i
agent-browser click @eXX # emoji/reaction button (smiley icon)
agent-browser wait 500
agent-browser snapshot -i
agent-browser fill @eXX "thumbsup" # emoji search
agent-browser wait 500
agent-browser snapshot -i
agent-browser click @eXX # emoji result
```
### Check Notifications
```bash
# Click "Activity" in sidebar or navigate:
agent-browser snapshot -i -c
agent-browser click @eXX # Activity link
agent-browser wait --load networkidle
agent-browser snapshot -c
```
### Check Unreads
```bash
agent-browser snapshot -i -c
# Look for bold channel names or "Unreads" section
agent-browser click @eXX # Unreads link
agent-browser wait --load networkidle
agent-browser snapshot -c
```
## Critical Rules
1. **ALWAYS re-snapshot** after every action — refs invalidate constantly
2. **Use `--headed`** mode so user can see and intervene
3. **Set viewport 1920x1080** — Slack hides features at small widths
4. **Use `Meta+k`** (quick switcher) as primary navigation
5. **Use `type` not `fill`** for compose boxes (contenteditable)
6. **Wait after actions**: `agent-browser wait --load networkidle`
7. **Save auth state often**: `agent-browser state save ~/.slack-auth-state.json`
## Semantic Locator Fallbacks
When refs don't match:
```bash
agent-browser find text "general" click
agent-browser find role textbox fill "Hello"
agent-browser find placeholder "Message #general" type "Hello"
agent-browser find role button click --name "Send"
```
## Debugging
```bash
agent-browser screenshot /tmp/slack-debug.png
agent-browser get url
agent-browser snapshot # full unfiltered tree
```
## Deep-Dive References
| Reference | When to Use |
|-----------|-------------|
| [references/workflows.md](references/workflows.md) | Full workflow examples |