This is the engineering blog of Slack, the popular workplace messaging platform. It covers how Slack’s team tackles real-world challenges like migrating core infrastructure, building AI agents for security investigations, and scaling observability. Anyone curious about how a large SaaS company operates under the hood will find these posts insightful.
SlackEngineering at Slack
This is the engineering blog of Slack, the popular workplace messaging platform. It covers how Slack’s team tackles real-world challenges like migrating core infrastructure, building AI agents for security investigations, and scaling observability. Anyone curious about how a large SaaS company operates under the hood will find these posts insightful.
“Slack’s engineering blog: scaling reliability, security, and AI agents.”
Read this when you want to see how a mature SaaS platform modernizes infrastructure, adopts AI agents, and improves developer experience at scale.
Skip it if you need beginner tutorials, frontend-focused content, or posts on consumer-scale systems.
Compared to the Uber Engineering Blog, this one is more focused on security modernization and AI agent workflows rather than broad distributed systems.
What this is
As told for the tourist
Start Here
A recommended reading path through the code
Start Here
A recommended reading path through the code
- 01
- 02
- 03
- 04
- 05
- 06
- 07
What's inside
6 sections of the codebase
Posting History
Activity over time
The Archive
Every post, searchable and filtered
From SSH to REST: A Security-Driven Modernization of Slack’s EMR Data Pipelines
8mSlack modernized its data pipelines by replacing 700+ SSH-based operators with a REST-based approach for improved security and scalability.
Managing context in long-run agentic applications
10mThis post discusses challenges and mechanisms for maintaining alignment and coherent reasoning in long-running agentic systems.
From Custom to Open: Scalable Network Probing and HTTP/3 Readiness with Prometheus
7mSlack describes migrating from custom network probing tools to a scalable Prometheus-based solution for HTTP/3 readiness.
How Slack Rebuilt Notifications 📣
9mSlack rebuilt its notification system from the ground up to reduce overwhelm and improve user experience.
Streamlining Security Investigations with Agents
6mSlack's Security Engineering team uses agent-based systems to streamline security investigations from billions of daily events.
Android VPAT journey
5mThis post details Slack's journey to create an Android Voluntary Product Accessibility Template (VPAT) to document accessibility standards.
Build better software to build software better
7mSlack's build pipeline team reduced build times from 60 minutes to improve developer velocity and feedback loops.
Advancing Our Chef Infrastructure: Safety Without Disruption
6mSlack advanced its Chef infrastructure to a multi-stack model, focusing on safety and minimal disruption during migration.
Export & Share
Take the field notes with you