


NYC vLLM Meetup
Join us for the NYC vLLM meetup!
We're excited to invite you to the first NYC vLLM meetup, hosted by Red Hat and IBM on May 7, 2025, at 1 Madison Avenue in New York City. Look forward to hearing from speakers from AMD, IBM, Meta (the PyTorch Team), and Red Hat.
This is your chance to learn and connect with a growing community of vLLM users, developers, maintainers, and engineers. Together, we'll dive deep into technical talks, share insights, and discuss our journey in optimizing LLM inference for performance and efficiency.
Agenda
5:00pm: Doors Open
5:30pm: Intro to vLLM & Project Update
5:50pm: Intro to torch.compile and How It Works with vLLM
6:20pm: Demo of Production Deployment of vLLM on AMD
6:50pm: Intro to Mamba SSM Architecture
7:10pm: Q&A and Open Discussion
7:30pm: Pizza and Networking 🍕 🤝
Important Information
Registration Deadline: We’ll close registrations 72 hours prior to the event. We will be unable to admit any attendees who are not registered.
Check-In: Please bring a photo ID to verify your registration upon arrival.
No Recordings: This event will not be recorded, so don’t miss out on this live experience!
We look forward to seeing you there!