Cover Image for Breaking the AI Inference Bottleneck with AWS Trainium and vLLM
Cover Image for Breaking the AI Inference Bottleneck with AWS Trainium and vLLM
456 Went

Breaking the AI Inference Bottleneck with AWS Trainium and vLLM

Hosted by Ruchi Bhatia & AWS Builder Loft (Formerly AWS GenAI Loft)
Registration
Past Event
Welcome! To join the event, please register below.
About Event

The surge in reasoning models and agentic AI has exposed a critical infrastructure gap: conventional deployment approaches can't deliver the performance these applications demand without prohibitive costs. Learn how AWS Trainium and vLLM solve this challenge, enabling production-scale AI that's both lightning-fast and cost-effective.

Join us for an engaging evening featuring a hands-on workshop, tech talks, networking opportunities, and delicious food and drinks. Seats are limited. We encourage you to register in advance to secure your spot.

What to expect:

  • 2:30 pm - Welcome and Registration

  • 3:00 pm - AWS Overview Introduction

  • 3:20 pm - Hands-on Workshop featuring AWS Trainium and vLLM

  • 4:00 pm - Tech Talk: Optimizing Performance for Your Applications

  • 4:20 pm - Tech Talk: Disaggregated Prefill on AWS Trainium and vLLM for Enhanced Performance

  • 4:40 pm - Tech Talk: vLLM's Trainium Optimization Roadmap: Performance Enhancements and Features

  • 5:00 pm - Networking and Happy Hour

Speakers:
Jim Burtoft, Senior Solutions Architect, AWS
Pinak Panigrahi, Senior GenAI Architect, Annapurna ML, AWS
Mrinal Shukla, Serving Acceleration Senior Manager, Annapurna ML, AWS

Note

  • Please bring a physical government-issued photo ID. Digital IDs won't be accepted.

  • There is no scooter/bike parking available in the building. If you plan to bring one, you will need to find parking options nearby.

  • All attendees must be 18 or older to attend this event at the space.

Location
AWS Builder Loft
525 Market St, San Francisco, CA 94105, USA
456 Went