Data Exfiltration from Slack AI via indirect prompt injection
Nature of the Slack AI vulnerability
- Attack relies on indirect prompt injection via content in public channels or uploaded documents.
- Slack AI searches both:
- All public channels (including ones the victim has never joined, even single‑user channels).
- The victim’s private channels and DMs.
- Malicious instructions cause Slack AI to generate Markdown links that:
- Look legitimate (“reauthenticate”, etc.) but
- Embed the victim’s private data (API keys, secrets, internal sentiment, etc.) in the URL/query or potentially subdomain.
- If the user clicks, the secret is sent to the attacker’s server; with link previews or image tags, exfil can become zero‑click.
Permissions, access, and phishing vs. “real” exfiltration
- Multiple commenters stress: channel permissions are not bypassed. AI only uses data the victim is allowed to see.
- The vulnerability is that AI recombines and formats data into a new, exfil‑ready artifact (a link) that never existed before.
- Some see it as AI‑assisted phishing / social engineering rather than classic unauthorized access.
- Others argue it’s closer to XSS/HTML injection for LLM UIs and should be treated as a serious web‑security issue.
How serious is this in practice?
- Skeptical view:
- Attacker must already be in the workspace (though not necessarily same company).
- Attack chain is complex and success probability low; simpler social engineering may be more effective.
- Slack’s existing search behavior (public+private) and user misuse of Slack for secrets are bigger issues.
- Concerned view:
- Many workspaces include external guests or broad communities, so “malicious insider” isn’t far‑fetched.
- AI‑generated links from a trusted, company‑branded assistant are harder to spot than obvious phish.
- Potential for data poisoning and subtle leakage of strategic info (e.g., executive sentiment, unreleased docs).
- Slack’s response is seen by some as downplaying an OWASP‑class bug without a quick fix.
Broader LLM security implications
- Prompt injection is described as fundamentally unsolved; LLMs can’t reliably distinguish system instructions from user content.
- Attempts to defend using another LLM or “guardrail” products are widely criticized as flawed or giving false confidence.
- Best current advice discussed: limit blast radius—strict data access controls (e.g., RLS/RAG scoping), sanitize outputs (strip links/images), avoid over‑privileged agents.
- Many commenters worry companies are “YOLO‑ing” LLMs into products, repeating decades‑old security mistakes.