Breaking the AI Inference Bottleneck with AWS Trainium and vLLM

Name: Breaking the AI Inference Bottleneck with AWS Trainium and vLLM
Start: 2025-08-14T14:30:00.000-07:00
End: 2025-08-14T18:30:00.000-07:00
Location: AWS Builder Loft

Hosted by Ruchi Bhatia & AWS Builder Loft (Formerly AWS GenAI Loft)

AWS Builder Loft

San Francisco, California

Past Event

Welcome! To join the event, please register below.

You will be asked to verify token ownership with your wallet.

About Event

The surge in reasoning models and agentic AI has exposed a critical infrastructure gap: conventional deployment approaches can't deliver the performance these applications demand without prohibitive costs. Learn how AWS Trainium and vLLM solve this challenge, enabling production-scale AI that's both lightning-fast and cost-effective.

Join us for an engaging evening featuring a hands-on workshop, tech talks, networking opportunities, and delicious food and drinks. Seats are limited. We encourage you to register in advance to secure your spot.

What to expect:

2:30 pm - Welcome and Registration
3:00 pm - AWS Overview Introduction
3:20 pm - Hands-on Workshop featuring AWS Trainium and vLLM
4:00 pm - Tech Talk: Optimizing Performance for Your Applications
4:20 pm - Tech Talk: Disaggregated Prefill on AWS Trainium and vLLM for Enhanced Performance
4:40 pm - Tech Talk: vLLM's Trainium Optimization Roadmap: Performance Enhancements and Features
5:00 pm - Networking and Happy Hour

Speakers:
Jim Burtoft, Senior Solutions Architect, AWS
Pinak Panigrahi, Senior GenAI Architect, Annapurna ML, AWS
Mrinal Shukla, Serving Acceleration Senior Manager, Annapurna ML, AWS

Note

Please bring a physical government-issued photo ID. Digital IDs won't be accepted.
There is no scooter/bike parking available in the building. If you plan to bring one, you will need to find parking options nearby.
All attendees must be 18 or older to attend this event at the space.

Location

AWS Builder Loft

525 Market St, San Francisco, CA 94105, USA

Hosted By

456 Went

AI