All articles

Curated Engineering Blogs from Top Tech Companies

A curated collection of 300+ engineering blog articles from top tech companies. Learn how the best engineering teams solve real-world problems at scale.

Agoda

BlogYearRead
How Agoda manages 450 million+ property images2025Link
How Agoda built it's property bot to cut response time from 8 hours to seconds2024Link
How Agoda migrated it's GraphQL Monolith API service to Microservices2024Link
How Agoda Solved Retry Storms to Boost System Reliability2024Link
How Agoda Designs and Maintains a High-Performing Data Pipeline2023Link
How Agoda manages 1.8 trillion Events per day on Kafka2023Link
How Agoda indexes hundreds of millions of series in a time-series database2022Link

Airbnb

BlogYearRead
Embedding-Based Retrieval for Airbnb Search2025Link
How Airbnb improved page performance using HTTP Streaming2023Link
Airbnb’s Data Framework for faster and more reliable read-heavy workloads2023Link
Avoiding Double Payments in a Distributed Payments System2019Link

Amazon Science

BlogYearRead
Training code generation models to debug their own outputs2025Link
The technology behind Amazon’s GenAI-powered shopping assistant, Rufus2024Link
Ensuring that customers don't miss out on trending products2023Link
From structured search to learning-to-rank-and-retrieve2023Link
Invalidating robotic ad clicks in real time2023Link
Using large language models (LLMs) to synthesize training data2023Link
Lessons learned from 10 years of DynamoDB2022Link
Using graph neural networks to recommend related products2022Link

Atlassian

BlogYearRead
How Atlassian Scaled and Enhanced Throughput in the Jira Export Service2025Link
How one of Atlassian’s critical services consistently gets above 99.9999% of availability2022Link
How Atlassian made Git push over HTTPS faster for Bitbucket Cloud2022Link
How Atlassian Revamped Confluence Cloud Search2021Link
Caching JQL search in Jira Cloud2021Link
Scaling, rearchitecting, and decomposing Confluence Cloud2020Link
Scaling Bitbucket’s Database2020Link
Atlassian's journey scaling low latency, multi-region services on AWS2019Link

Auth0

BlogYearRead
Build an AI Assistant with LangGraph, Vercel, and Next.js2025Link
Building a Secure RAG with Python, LangChain, and OpenFGA2025Link
Identity Challenges for AI-Powered Applications2024Link

Booking.com

BlogYearRead
Anomaly Detection in Time Series Using Statistical Analysis2025Link
How Booking Cut 20% of the Cloud Cost with a Single Code Change2025Link
The Engineering Behind Booking.com's High-Performance Ranking Platform2024Link
How Booking.com Leverage graph technology for real-time Fraud Detection and Prevention2024Link
How Booking.com Predicts cancellations with survival modeling2024Link

Canva

BlogYearRead
Canva’s continuous data platform2025Link
How Canva's drawing tool works2024Link
How Canva collects 25 billion events per day2024Link
Canva's scalable and reliable content usage counting service2024Link
How Canva saves millions annually in Amazon S3 costs2023Link
How Canva scaled media uploads from Zero to 50 Million per day2022Link
Canva's fast and scalable reverse image search2022Link
How Canva enables real-time collaboration with RSocket2021Link

Coinbase

BlogYearRead
How Coinbase Optimizes Network Requests2024Link
Accelerating Deep Learning Adoption at Coinbase2024Link
Lessons from launching Enterprise-grade GenAI solutions at Coinbase2024Link
How Coinbase Uses ML to Predict Traffic and Scale Databases2024Link
Detecting Fraudulent Transactions at Coinbase2023Link
Building a notification platform at Coinbase2022Link

Discord

BlogYearRead
How Discord Reduced Websocket Traffic by 40%2024Link
How Discord Stores Trillions of Messages2023Link
Pushing Discord’s Limits with a Million+ Online Users in a Single Server2023Link
How Discord uses ML to Build a Delightful Notification Experience2022Link
How Discord Creates Insights from Trillions of Data Points2021Link

DoorDash

BlogYearRead
How DoorDash Uses LLMs to transcribe restaurant menu photos2025Link
How DoorDash leverages LLMs for better search retrieval2024Link
Building DoorDash’s product knowledge graph with large language models2024Link
DoorDash’s in-house search engine2024Link
DoorDash's write-heavy scalable and reliable inventory platform2023Link
Doordash's scalable real time event processing with Kafka and Flink2022Link
DoorDash’s Lessons on Improving Performance on High-Traffic Web Pages2022Link
How DoorDash Applied Client-Side Caching to Improve Feature Store Performance by 70%2022Link
Building a Unified Chat Experience at DoorDash2022Link

Dropbox

BlogYearRead
How Dropbox evolved its infrastructure through the messaging system model2025Link
Dropbox's scalable, consistent, metadata caching solution2024Link
Bringing AI-powered answers and summaries to file previews on the web2024Link
Dropbox's ML-powered file organization2023Link
How Dropbox uses ML to identify date formats in file names2023Link
How Dropbox optimizes payments with machine learning2021Link

Ebay

BlogYearRead
How ebay Exports Billion-Scale Graphs on Transactional Graph Databases2023Link
eBay's Personalized User-Based Ranking Model for Recommendations2023Link
How Multimodal Embeddings Elevate eBay's Product Recommendations2023Link
eBay’s Blazingly Fast Billion-Scale Vector Similarity Engine2023Link
How eBay Handles Real-Time Push Notifications at Scale2022Link
Building a Deep Learning Based Retrieval System for Personalized Recommendations2022Link
How eBay Loads and Updates Over Ten-Billion-Vertex Graphs2021Link
eBay's real-time and performant index service for its large-scale, in-house database platform2021Link

Expedia

BlogYearRead
Inside Expedia’s Migration to ScyllaDB for Change Data Capture2024Link
How Expedia Built its Core Maching Learning Platform2024Link
How Expedia Group ranks website search results2024Link
How Expedia Built a Tool to Query Near Real-Time Streaming Data2023Link
Configuration Management at Expedia Group2023Link

Facebook

BlogYearRead
Indexing code at scale with Glean - Meta’s open source system2024Link
Inside Facebook’s video delivery system2024Link
Meta's Sequence learning Model for personalized ads recommendations2024Link
How Meta animates AI-generated images at scale2024Link
How Meta trains large language models at scale2024Link
Building Meta’s GenAI Infrastructure2024Link
RoCE networks for distributed AI training at scale2024Link
How Meta built the infrastructure for Threads2023Link
Building end-to-end security for Messenger2023Link
Modernizing Meta’s data platform2023Link
How Precision Time Protocol is being deployed at Meta2022Link
Scaling data ingestion for machine learning training at Meta2022Link
Meta’s cloud gaming infrastructure2022Link
Cache made consistent - How Meta handles cache invalidation2022Link
A highly available, strongly consistent storage service using chain replication2022Link
Making a distributed priority queue disaster-ready2022Link
How we built a general purpose key value store for Facebook with ZippyDB2021Link
Fully Sharded Data Parallel: faster AI training with fewer GPUs2021Link
How Facebook encodes your videos2021Link
Scaling a distributed priority queue at Meta2021Link
How machine learning powers Facebook’s News Feed ranking algorithm2021Link
How Meta scaled Live streaming for millions of viewers simultaneously2020Link

Figma

BlogYearRead
The infrastructure behind AI search in Figma2024Link
Speeding up file load times at Figma2024Link
Figma's LiveGraph: a real-time data system at scale2024Link
How Figma horizontally scaled Postgres to unlock nearly infinit scalability2024Link
How Figma improved performance and load time with incremental frame loading2024Link
How Figma reduced potential instability by scaling to multiple databases2023Link
The hidden challenges of autosave2020Link
Figma's deep search to find the right files even faster2020Link

Flipkart

BlogYearRead
Flipkart's MySQL Highly Available Setup2023Link
Running a multi-region Zookeeper at Flipkart2021Link
Memory Tuning a High Throughput Microservice2021Link
Building Flipkart's Personalized Search Autosuggestion2021Link
Predicting your next query even before you type!2021Link
How Flipkart Adapted Search to Indian Phonetics2020Link

GitHub

BlogYearRead
How we improved push processing on GitHub2024Link
How GitHub uses merge queue to ship hundreds of changes every day2024Link
How GitHub Docs’ new search works2023Link
The technology behind GitHub’s new code search2023Link
Scaling Git’s garbage collection2022Link
Improve Git monorepo performance with a file system monitor2022Link
Partitioning GitHub’s relational databases to handle scale2021Link

GoDaddy

BlogYearRead
How A/B Testing Transformed Product Development at GoDaddy2025Link
AI-Powered Social Media Posts2025Link
Generative AI Domain Search2024Link
How LLMs Are Enhancing GoDaddy’s CMS Experience2024Link
API Gateway at GoDaddy2023Link
Godaddy's Search Data Infrastructure to find domain names2022Link

Google Research

BlogYearRead
Load balancing with random job arrivals2025Link
Transformers in music recommendation2024Link
Scaling multimodal understanding to long videos2023Link
Answering billions of reporting queries each day with low latency2023Link
Grammar checking at Google Search scale2023Link
World scale inverse reinforcement learning in Google Maps2023Link
Resolving code review comments with ML2023Link

Grab

BlogYearRead
Grab AI Gateway: Connecting Grabbers to multiple GenAI providers2025Link
Leveraging RAG-powered LLMs for analytical tasks2024Link
ML Model serving platform at Grab2024Link
LLM-powered data classification for data entities at scale2024Link
Enabling near real-time data analytics on the data lake2024Link
The journey of building a comprehensive attribution platform2024Link
Kafka on Kubernetes: Reloaded for fault tolerance2023Link
Sliding window rate limits in distributed systems2023Link
Road localisation in GrabMaps2023Link
Building hyperlocal GrabMaps2023Link
How Grab stores and processes millions of orders daily2022Link
How Kafka Connect helps move data seamlessly at Grab2022Link
Real-time data ingestion in Grab2022Link
How Grab built a scalable, high-performance ad server2022Link
Using real-world patterns to improve driver-rider matching2021Link
Search indexing optimisation at Grab2021Link
How Grab Built its In-house Chat Platform for the Web2020Link

Gusto

BlogYearRead
API Versioning At Gusto2025Link
How Gusto tackles AI Hallucinations in LLM Apps2025Link
Platform Engineering at Gusto2024Link
How Gusto simplifies large monoliths2023Link

HashNode

BlogYearRead
Hashnode's Feed Architecture2023Link
Hashnode's Overall Architecture2023Link
How Hasnode generates personlized feeds that match users' interest2023Link
Hashnode's Rate Limiting Architecture2023Link
Building an Event-Driven Architecture at Hashnode2022Link
How Hashnode Sends Mass Personalised Emails using AWS Serverless Technologies2022Link
How Hashnode Leverages Serverless for Backing up Posts2022Link
How Hashnode Built Serverless Audio Blogs wiht AWS2022Link

Hostinger

BlogYearRead
How Hostinger Built one of the most advanced LLM-based chat assistants2024Link
How Hostinger Keeps Your Websites Safe2024Link
How Hostinger Deals With DDoS Attacks2022Link

Hotstar

BlogYearRead
Scaling Infrastructure for Millions at Hotstar2024Link
Hotstar’s tale of 10x scale up2023Link
Capturing A Billion Emo(j)i-ons2020Link

HubSpot

BlogYearRead
How Does Hubspot's Prediction Engine Score Millions of CRM Objects Daily2024Link
How HubSpot Upgraded a Thousand MySQL Clusters at Once2023Link
Saving Millions on the storage costs of application logs at HubSpot2023Link
Building a Fast, Thread-safe Hotspot Tracking Library2022Link
Cross Datacenter MySql Data Replication2022Link
Supporting Cross-Region Kafka Messaging2022Link
Improving Database Reliability: Preventing Hotspotting with Client-Side Request Deduplication2022Link
Building a Vitess Balancer to Minimize MySQL Downtime2022Link

Instacart

BlogYearRead
Real-time Fraud Detection with Yoda and ClickHouse2024Link
How Instacart Uses ML to Suggest Replacements for Out-of-Stock Products2024Link
Sequence models for Contextual Recommendations at Instacart2024Link
Supercharging Discovery in Search with LLMs2024Link
Optimizing search relevance at Instacart using hybrid retrieval2024Link
Instacart’s Item Availability Architecture: Solving for scale and consistency2023Link
Instacart's one Deep Learning model for multiple surfaces2023Link
Distributed Machine Learning at Instacart2023Link
How Instacart Uses Embeddings to Improve Search Relevance2022Link
The Journey to Real-Time Machine Learning at Instacart2022Link
How Instacart Uses ML-Driven Autocomplete to Help People Fill Their Carts2022Link
How Instacard optimized its Logistics engine using ML2021Link
A simple search query correction heuristic for the resource-constrained2020Link
Predicting the real-time availability of 200 million grocery items2018Link
How Instacart delivers on time2018Link

Instagram

BlogYearRead
Scaling the Instagram Explore recommendations system2023Link
Reducing Instagram’s basic video compute time by 94 percent2022Link
Improving Instagram notification management with machine learning and causal inference2022Link
Building text animations for Instagram Stories2022Link
Pushing the limits of compression in Facebook’s mobile apps2021Link
How Instagram suggests new content2020Link

LinkedIn

BlogYearRead
Scalable federated learning at LinkedIn2025Link
Building a resilient DNS client for web-scale infrastructure2025Link
Journey of next generation control plane for data systems2025Link
Candidate Generation in a Large Scale Graph Recommendation System2024Link
Accelerating LinkedIn’s My Network tab by reducing latency and improving UX2024Link
Tuning Java for high-performance services2024Link
LinkedIn OpenHouse for Big Data Management2023Link
How LinkedIn Adopted A GraphQL Architecture for Product Development2023Link
How LinkedIn Is Using Embeddings to Up Its Match Game for Job Seekers2023Link
Building the Infrastructure for Delivering Labor Market Insights from LinkedIn Data2023Link
Upscaling LinkedIn's Profile Datastore While Reducing Costs2023Link
Unifying Messaging Experiences across LinkedIn2023Link
Applying multitask learning to AI models at LinkedIn2022Link
Building a mutable dataset in data lake2022Link
Completing a member knowledge graph with Graph Neural Networks2021Link
Homepage feed multi-task learning using TensorFlow2021Link
Evolving LinkedIn’s analytics tech stack2021Link
Scaling LinkedIn's Hadoop YARN cluster beyond 10,000 nodes2021Link
HTTP/2 in infrastructure: Ambry network stack refactoring2021Link
Building a heterogeneous social network recommendation system2020Link

Lyft

BlogYearRead
From Big Data to Better Data: Ensuring Data Quality with Verity2023Link
Building Real-time Machine Learning Foundations at Lyft2023Link
The Recommendation System at Lyft2023Link
lyft2vec — Embeddings at Lyft2023Link
Powering Millions of Real-Time Decisions with LyftLearn Serving2023Link
Pricing at Lyft2022Link
ML Model Training Infrastructure built on Kubernetes2021Link
Elasticsearch Optimizations at Lyft2021Link
How Lyft discovered OpenStreetMap is the Freshest Map for Rideshare2021Link
Using Client-Side Map Data to Improve Real-Time Positioning2021Link
How Lyft predicts a rider’s destination for better in-app experience2020Link
A New Real-Time Map-Matching Algorithm at Lyft2020Link

Medium

BlogYearRead
Taming Post Claps - The Two Billion Claps Bug2024Link
How Medium uses ScyllaDB to build a fast and scalable data layer2024Link
Building a ChatGPT Plugin for Medium2023Link
Fixing duplicate stories in Medium’s For You feed2023Link
Kubernetes Infrastructure At Medium2023Link
How Medium counts your followers2020Link
Scaling Email Infrastructure for Medium Digest2020Link
Mapping Medium’s Tags using ML2018Link
Microservice Architecture at Medium2018Link

Netflix

BlogYearRead
Foundation Model for Personalized Recommendation2025Link
How Netflix processes billions of impressions daily2025Link
Netflix’s Distributed Counter Abstraction2024Link
Evolving Netflix’s WebSocket proxy for the future2024Link
Netflix’s Key-Value Data Abstraction Layer2024Link
Netflix’s TimeSeries Data Abstraction Layer2024Link
Recommending for Long-Term Member Satisfaction at Netflix2024Link
Maestro: Data/ML Workflow Orchestrator at Netflix2024Link
Reverse Searching Netflix’s Federated Graph2024Link
Supporting Diverse ML Systems at Netflix2024Link
Rebuilding Netflix Video Processing Pipeline with Microservices2024Link
Building In-Video Search2023Link
Streaming SQL in Data Mesh2023Link
Migrating Netflix to GraphQL Safely2023Link
Scaling Media Machine Learning at Netflix2023Link
Building a Media Understanding Platform for ML Innovations2023Link
Finding Cuts with Smooth Visual Transitions Using Machine Learning2022Link
Machine Learning for Fraud Detection in Streaming Services2022Link
Netflix’s High-Throughput, Low-Latency Priority Queueing System2022Link
Rapid Event Notification System at Netflix2022Link
Building Netflix’s Distributed Tracing Infrastructure2020Link

Notion

BlogYearRead
Building and scaling Notion’s data lake2024Link
How we sped up Notion in the browser with WASM SQLite2024Link
The Great Re-shard: adding Postgres capacity (again) with zero downtime2023Link
Creating the Notion API2022Link
The data model behind Notion's flexibility2021Link
Lessons learned from sharding Postgres at Notion2021Link

PayPal

BlogYearRead
Scaling PayPal’s AI Capabilities with PayPal Cosmos.AI Platform2024Link
Scaling Kafka to Support PayPal’s Data Growth2023Link
JunoDB: PayPal’s Key-Value Store2023Link
Scaling Kubernetes to Over 4k Nodes and 200k Pods2022Link
GraphQL at PayPal: An Adoption Story2021Link
How PayPal Uses Real-time Graph Database and Graph Analysis to Fight Fraud2021Link
Next-Gen Data Movement Platform at PayPal2021Link
Deploying Large-scale Fraud Detection Machine Learning Models at PayPal2021Link

Pinterest

BlogYearRead
How Pinterest improved Search Relevance using LLMs2025Link
How Pinterest built it's Text-to-SQL feature2024Link
Change Data Capture at Pinterest2024Link
Real Time Anomaly Detection at Pinterest2023Link
Improving Distributed Caching Performance and Efficiency at Pinterest2022Link
How Pinterest Leverages Realtime User Actions to Boost Homefeed Engagement Volume2022Link
How Pinterest scaled the size of it's ad corpus by 60x2021Link
The machine learning behind delivering relevant ads2021Link

Quora

BlogYearRead
Building Embedding Search at Quora2024Link
Migrating a decade of Redshift usages to Trino at Quora2024Link
Trino at Quora Scale: Cost, Speed, and Reliability2023Link
MySQL sharding at Quora2020Link

Razorpay

BlogYearRead
Razorpay’s Authentication Revamp2023Link
The Making of Razorpay Developer-Console2023Link
How Razorpay Reduced Data Platform Cost by $2M2023Link
Reducing Kubernetes cost by $300,000 at Razorpay2023Link
How does Razorpay Capital Detect Duplicate or Fraud Merchants?2023Link
Razorpay's Real-Time Denormalized Data Streaming Platform2023Link
How Razorpay’s Notification Service Handles Increasing Load2022Link
How Trino and Alluxio power analytics at Razorpay2022Link
Handling Burst Traffic During IPL2021Link

Reddit

BlogYearRead
Evolving Reddit's Media Infrastructure2025Link
Scaling our Apache Flink powered real-time ad event validation pipeline2025Link
Scaling Reddit’s ad-serving system2024Link
Product Candidate Generation for Reddit Dynamic Product Ads2024Link
Scaling Ads Pacing: from Singleton to Sharded2024Link
Introducing a Global Retrieval Ranking Model in the Ads Funnel2024Link
Building an Experiment-Based Routing Service2023Link
The Reddit Media Metadata Store2023Link

Salesforce

BlogYearRead
Scaling Real-Time Search to 30 Billion Queries with Sub-Second Latency and 0% Downtime2025Link
Scaling Agentic AI Powering 2 Billion Predictions Monthly2025Link
How Agentforce Data Library Powers RAG with 99.99% Uptime2025Link
Secrets for Managing 100,000 Training and Metadata Requests Per Minute2024Link
Inside the Brain of Agentforce2024Link
How Salesforce Supports Millions of Users Seamlessly for GenAI2024Link
nside Salesforce’s Scalable Time Series Forecasting AI Platform2024Link
How Salesforce's Data Cloud Handles 250 Trillion Transactions Weekly2024Link

Shopify

BlogYearRead
How Shopify improved consumer search intent with real-time ML2024Link
Horizontally scaling the Rails backend of Shop app with Vitess2024Link
Improving Shopify App’s Performance2024Link
Building a ShopifyQL Code Editor2023Link
Creating a Flexible Order Routing System with Shopify Functions2023Link
Using Server Sent Events to Simplify Real-time Streaming at Scale2022Link
Capturing Every Change From Shopify’s Sharded Monolith2021Link

Slack

BlogYearRead
How Slack Optimizes its E2E Pipeline2025Link
How Slack built enterprise search to be secure and private2025Link
Advancing Our Chef Infrastructure2024Link
How We Re-Architected Slack for Our Largest Customers2024Link
How Slack autmatically detects stolen session cookies2024Link
How a request flows — from a Slack’s user perspective2023Link
Slack’s Migration to a Cellular Architecture2023Link
Real-time Messaging at Slack2023Link
How Slack traces the flow of notifications across systems2023Link
Slack's Unified end-to-end machine learning infrastructure to generate recommendations2023Link
How We Design Our APIs at Slack2021Link
How we built an eventually-consistent data model to predict Slack Connect invites2021Link
Migrating Millions of Concurrent Websockets to Envoy2021Link
Scaling Datastores at Slack with Vitess2020Link

Snap

BlogYearRead
Bento - Snap's ML Platform2025Link
Snap's Embedding-based Retrieval for its video recommendation system2023Link
How Snap Speed Up Feature Engineering for Recommendation Systems2022Link
How Slack leverages synthetic data to boost the development of ML models2022Link
Training Large-Scale Recommendation Models with TPUs2022Link
Machine Learning for Snapchat Ad Ranking2022Link

Spotify

BlogYearRead
How Spotify Generated Millions of Content Annotations2024Link
Spotify's Data Platform2024Link
The What, Why, and How of Mastering App Size2023Link
How Spotify Automated Content Marketing to Acquire Users at Scale2023Link
How We Built Infrastructure to Run User Forecasts at Spotify2022Link

Squarespace

BlogYearRead
Why We Built a Write Back Cache for Our Asset Library2024Link
Developing Fluid Engine2022Link
How we use WebGL at Squarespace2022Link
A Better Way to Upload Images2022Link

Stripe

BlogYearRead
Stripe’s system for tracking and validating money movement2024Link
How Stripe Processed $1 Trillion in Payments with Zero Downtime2023Link
How Stripe built it's fraud prevention system2023Link
How Stripe builds interactive docs with Markdoc2022Link
Stripe’s payments APIs: The first 10 years2020Link

Swiggy

BlogYearRead
Swiggy's Text-to-SQL Solution2024Link
Optimising the picking process to enable faster deliveries for Instamart2024Link
Improving search relevance in hyperlocal food delivery using (small) language models2024Link
Predicting Food Delivery Time at Cart2023Link
Contextual Bandits for Ads Recommendations2022Link
Using deep learning to detect dissonance between address text and location2022Link
Designing Resilient Microservices at Swiggy2021Link
Designing the Serviceability Platform at Swiggy for High Scale2021Link
A brief introduction to Engineering challenges at Swiggy2021Link
Re-Architecting Swiggy’s logistics systems2021Link
Using Deep Learning for Ranking in Dish Search2021Link
Learning to Predict Two-Wheeler Travel Distance2021Link
Learning To Rank Restaurants2021Link
Running Geo Queries At Scale2020Link
Building Video Stories and Caching2020Link
Deploying deep learning models at scale at Swiggy2020Link
Decoding Food Intelligence at Swiggy2020Link

Target

BlogYearRead
A Deep Dive into Data Replication Mechanisms2025Link
Predictive Modeling for Availability of Inventory2024Link
Contextual Offer Recommendations Engine at Target2024Link
Bundled Product Recommendations at Target2024Link
Target AutoComplete: Real Time Item Recommendations at Target2023Link
Real-Time Personalization Using Microservices2023Link

Timescale

BlogYearRead
Document Loading, Parsing, and Cleaning in AI Applications2025Link
Building a RAG System With Claude, PostgreSQL & Python on AWS2025Link
Automating Data Enrichment in PostgreSQL With OpenAI2025Link
Semantic Search With Ollama and PostgreSQL2025Link
PostgreSQL Indexes for Columnstore2025Link
Handling Billions of Rows in PostgreSQL2025Link
Enhancing Text-to-SQL With Synthetic Summaries2025Link
Scale PostgreSQL via Partitioning2024Link

Tinder

BlogYearRead
Tinder API Style Guide2024Link
Building Obsidian, Tinder’s Design System2023Link
How Tinder built it's API Gateway2022Link
Scaling out Tinder Android Payment Flow using State Machine2020Link

Twitch

BlogYearRead
Ingesting Live Video Streams at Global Scale2022Link
Breaking the Monolith at Twitch2022Link
Using Machine Learning to Review Emotes2022Link
Defense, threat modeling and High Availability at Twitch2021Link

Uber

BlogYearRead
Migrating Uber’s Compute Platform to Kubernetes2025Link
MySQL At Uber2025Link
How Uber Uses Ray® to Optimize the Rides Business2025Link
How Uber Optimizes LLM Training2024Link
Natural Language to SQL Using Gen AI2024Link
Lucene: Uber’s Search Platform2024Link
Uber’s implementation of Live Activity on iOS2024Link
Odin: Uber’s Stateful Platform2024Link
Kafka Tiered Storage at Uber2024Link
Modernizing Logging at Uber with CLP2024Link
How Uber ensures Apache Cassandra®’s tolerance for single-zone failure2024Link
How LedgerStore Supports Trillions of Indexes at Uber2024Link
Balancing HDFS DataNodes in the Uber DataLake2024Link
How Uber Serves Over 40 Million Reads Per Second from Online Storage Using an Integrated Cache2024Link
How Uber Optimized Cassandra Operations At Scale2023Link
How Uber Optimizes the Timing of Push Notifications using ML and Linear Programming2022Link
Deduping and Storing Images at Uber Eats2022Link
Uber’s Next Gen Push Platform on gRPC2022Link
Uber’s Highly Scalable and Distributed Shuffle as a Service2022Link
How Uber Predicts Arrival Times Using Deep Learning2022Link
Real-Time Exactly-Once Ad Event Processing with Apache Flink, Kafka, and Pinot2021Link

Vimeo

BlogYearRead
Unlocking knowledge sharing for videos with RAG2024Link
A deep dive into Vimeo’s storage strategy for videos2023Link

Walmart

BlogYearRead
Walmart’s Cassandra CDC Solution2022Link
Scaling the Walmart Inventory Reservations API for Peak Traffic2022Link
A Markov Chain Formulation for the Grocery Item Picking Process2021Link
How we rebuilt the Walmart Autocomplete Backend2021Link
Building a Notification Framework for Microservice-based Application2021Link

Twitter (X)

BlogYearRead
Twitter's Recommendation Algorithm2023Link
How we scaled Reads On the Twitter Users Database2023Link
Powering real-time data analytics with Druid at Twitter2022Link
How we built Twitter’s highly reliable ads pacing service2021Link
Storing and retrieving millions of ad impressions per second2021Link
Processing billions of events in real time at Twitter2021Link
Logging at Twitter2021Link
Twitter’s ads serving platform2021Link

Yelp

BlogYearRead
Search Query Understanding with LLMs2025Link
Enhancing Neural Network Training at Yelp2025Link
Boosting ML Pipeline Efficiency2024Link
Yelp’s AI pipeline for inappropriate language detection in reviews2024Link
Rebuilding a Cassandra cluster using Yelp’s Data Pipeline2023Link

Zendesk

BlogYearRead
Improving job execution by ditching the job executor2025Link
Provisioning Kafka topics the easy way2024Link
Moving from DynamoDB to tiered storage with MySQL+S32023Link

Zillow

BlogYearRead
Leveraging Knowledge Graphs in Real Estate Search2025Link
The Data Infra Behind Zillow’s 3x Growth in Experiment Volume2023Link
Serving Machine Learning Models Efficiently at Scale at Zillow2022Link
Optimizing Elasticsearch for Low Latency, Real-Time Recommendations2022Link

Zomato

BlogYearRead
Building a cost-effective logging platform for petabyte scale2023Link
How Zomato Handles 100 Million Daily Search Queries2023Link
How Zomato Powers restaurant ads using ML2022Link
How Zomato uses embeddings to identify and cluster unique addresses2022Link
How Zomato predicts your order's Food preparation time2022Link
How Zomato locates its users2021Link
The Deep Tech Behind Estimating Food Preparation Time2020Link
Photo of Rahul Aher

Written by Rahul Aher

I'm Rahul, Sr. Software Engineer (SDE II) and passionate content creator. Sharing my expertise in software development to assist learners.

More about me