Uber is expanding its infrastructure and artificial intelligence capabilities on Amazon Web Services (AWS) as it looks to improve real-time performance across its global ride-sharing and delivery platforms.
- Scaling cloud and AI capabilities to support millions of daily rides and deliveries
- Scaling real-time infrastructure for ride matching
- AI model training with next-generation cloud chips
- Supporting global demand spikes across the Uber platform
- AWS strengthens role in powering large-scale real-time applications
Scaling cloud and AI capabilities to support millions of daily rides and deliveries
The company is increasing its use of AWS computing infrastructure to support its Trip Serving Zones, the systems responsible for matching riders with drivers and coordinating deliveries in real time. Uber has also begun piloting AWS’s Trainium AI chips to train models that power predictions and recommendations across its apps.
The move is designed to help Uber handle millions of daily ride and delivery requests, improve matching speed, and support more personalized user experiences.
Scaling real-time infrastructure for ride matching
Behind every ride or delivery request on the Uber platform is a system designed to process large volumes of real-time data.
Uber’s Trip Serving Zones infrastructure processes location data and performs millions of predictions in milliseconds to determine factors such as the closest driver, estimated arrival times, and optimal routes.
To support these operations at scale, Uber is expanding its use of AWS compute, storage, and networking services, including the use of the company’s Graviton processors.
By running more workloads on AWS Graviton4, Uber aims to improve performance during periods of high demand, such as rush hour or major events, while reducing latency and optimizing costs.
According to Kamran Zargahi, Vice President of Engineering at Uber, the speed and flexibility of cloud infrastructure are critical to the platform’s real-time operations.
“Uber operates at a scale where milliseconds matter,” said Kamran Zargahi, Vice President of Engineering at Uber. “Moving more Trip Serving workloads to AWS gives us the flexibility to match riders and drivers faster and handle delivery demand spikes without disruption.”
AI model training with next-generation cloud chips
In addition to expanding real-time infrastructure, Uber has begun piloting AI model training on AWS Trainium chips.
These models analyse data generated from billions of rides and deliveries to determine key operational decisions, including which driver or courier to assign to a trip, predicting arrival times, and recommending delivery options.
Uber is experimenting with AWS Trainium3, a chip designed specifically for training large-scale artificial intelligence models.
Using Trainium allows Uber to train AI systems more efficiently while processing the large datasets required to support its global services.
“By starting to pilot some of our AI models on Trainium, we’re building a technology foundation that will make every Uber experience smarter—so we can keep our focus where it belongs: on the people who use Uber every day,” said Kamran Zargahi, Vice President of Engineering at Uber.
Supporting global demand spikes across the Uber platform
The cloud infrastructure expansion also supports Uber’s ability to scale services during sudden increases in demand.
Trip Serving Zones must manage millions of concurrent interactions, including driver availability, rider location data, and delivery logistics.
Running these workloads on AWS enables Uber to scale infrastructure quickly while maintaining reliability, availability, and security, according to the companies.
AWS strengthens role in powering large-scale real-time applications
For AWS, Uber represents one of the most complex real-time application environments running on its cloud infrastructure.
Rich Geraffo, Vice President and Managing Director of North America at Amazon Web Services, said the collaboration highlights the growing role of cloud and AI infrastructure in large-scale digital platforms.
“Uber is one of the most demanding real-time applications in the world, and we’re proud to be an important part of the infrastructure powering their global operations,” said Rich Geraffo, Vice President and Managing Director of North America at AWS. “We’re helping Uber deliver the reliability hundreds of millions of people count on today—and the AI-powered experiences that will define ride-sharing and on-demand delivery tomorrow.”



