This is the engineering blog of Lyft, a major ride-hailing and logistics company. It covers how they build and scale machine learning systems, run experiments, and improve rider and driver experiences. If you're curious about how a real-world marketplace uses data science and AI, this blog is a great read.
EngLyft Engineering - Medium
This is the engineering blog of Lyft, a major ride-hailing and logistics company. It covers how they build and scale machine learning systems, run experiments, and improve rider and driver experiences. If you're curious about how a real-world marketplace uses data science and AI, this blog is a great read.
“Lyft's engineering blog: scaling marketplace ML and experimentation.”
Read this when you want to understand how a mature marketplace platform applies causal inference, ML infrastructure, and product design at scale.
Skip it if you need deep dives into distributed systems, networking, or security.
Compared to Uber Engineering, this one is more focused on causal inference and experimentation than on core infrastructure.
What this is
As told for the tourist
Start Here
A recommended reading path through the code
Start Here
A recommended reading path through the code
- 01
- 02
- 03
- 04
- 05
- 06
- 07
What's inside
7 sections of the codebase
Posting History
Activity over time
The Archive
Every post, searchable and filtered
How We Built a Smarter Pickup Experience for Gated Communities
6mDescribes how Lyft improved the pickup experience for riders in gated communities by addressing communication and navigation challenges.
Predicting Rider Conversion in Sparse Data Environments with Bayesian Trees
8mExplains the use of Bayesian trees to predict rider conversion in sparse data environments at Lyft.
Beyond A/B Testing: Using Surrogacy and Region-Splits to Measure Long-Term Effects in Marketplaces
10mDiscusses using surrogacy and region-splits as alternatives to A/B testing for measuring long-term marketplace effects.
Scaling Localization with AI at Lyft
7mDetails how Lyft scaled its localization efforts by integrating AI to replace or augment human translation.
Trusting the Untestable: Validation and Diagnostics for the Doubly Robust Models
9mCovers validation and diagnostic techniques for doubly robust models used in causal inference beyond randomized experiments.
Lyft’s Feature Store: Architecture, Optimization, and Evolution
11mDescribes the architecture, optimization, and evolution of Lyft's Feature Store for machine learning.
From Python3.8 to Python3.10: Our Journey Through a Memory Leak
8mChronicles the journey of upgrading from Python 3.8 to 3.10 and debugging a memory leak encountered along the way.
LyftLearn Evolution: Rethinking ML Platform Architecture
12mExplains the evolution of Lyft's ML platform architecture, LyftLearn, to support thousands of production models.
My Starter Project on the Lyft Rider Data Science Team
5mA personal account of a starter project on the Lyft Rider Data Science team using the Rider Experience Score tool.
Migrating Lyft’s Android Codebase to Kotlin
7mDescribes the multi-year migration of Lyft's Android codebase from Java to Kotlin across multiple apps.
Export & Share
Take the field notes with you