Cover Image for vLLM & NVIDIA Triton User Meetup
Cover Image for vLLM & NVIDIA Triton User Meetup
Avatar for vLLM Meetups and Events
Join the vLLM community to discuss optimizing LLM inference!
338 Going
Registration
Past Event
Please click on the button below to join the waitlist. You will be notified if additional spots become available.
About Event

We are excited to invite you to the next vLLM meetup hosted by NVIDIA at Gallery 308 of the Fort Mason Center For Arts & Culture.

​What to Expect:

  • ​Inspiring Talks: hear from vLLM and NVIDIA Triton Inference Server experts as they present recent updates and innovative advancements in the field.

  • ​Expert Insights: dive into the latest developments in vLLM. Learn about the latest Triton Inference Server features including vLLM support, Kubernetes multi-node scaling, Python in-process API, OpenAI compatible API, and performance benchmarking.

  • ​Networking Opportunities: connect with fellow AI builders, researchers, and enthusiasts to share ideas and best practices.

Agenda: 

4:00 - 5:30 pm: Doors open and social hour 

5:30 - 6:30 pm: vLLM presentation

6:30 - 7:30 pm: NVIDIA Triton Inference Server presentation

7:30 - 9:00 pm: Networking and light refreshments

​To get there:

Public parking is available at the venue. Parking fee is $3/hr and $15 for the entire day. The parking fee can be paid at the kiosk. Please find your way to Gallery 308 after parking.

​All attendees will be required to show a photo ID at the reception.

Location
Landmark Building A, 2 Marina Blvd, San Francisco, CA 94123
Avatar for vLLM Meetups and Events
Join the vLLM community to discuss optimizing LLM inference!
338 Going