vLLM & NVIDIA Triton User Meetup

Name: vLLM & NVIDIA Triton User Meetup
Start: 2024-09-09T16:00:00.000-07:00
End: 2024-09-09T21:00:00.000-07:00
Location: Landmark Building A, 2 Marina Blvd, San Francisco, CA 94123

vLLM Meetups and Events

Landmark Building A, 2 Marina Blvd, San Francisco, CA 94123

Past Event

Please click on the button below to join the waitlist. You will be notified if additional spots become available.

You will be asked to verify token ownership with your wallet.

About Event

We are excited to invite you to the next vLLM meetup hosted by NVIDIA at Gallery 308 of the Fort Mason Center For Arts & Culture.

What to Expect:

Inspiring Talks: hear from vLLM and NVIDIA Triton Inference Server experts as they present recent updates and innovative advancements in the field.
Expert Insights: dive into the latest developments in vLLM. Learn about the latest Triton Inference Server features including vLLM support, Kubernetes multi-node scaling, Python in-process API, OpenAI compatible API, and performance benchmarking.
Networking Opportunities: connect with fellow AI builders, researchers, and enthusiasts to share ideas and best practices.

Agenda:

4:00 - 5:30 pm: Doors open and social hour

5:30 - 6:30 pm: vLLM presentation

6:30 - 7:30 pm: NVIDIA Triton Inference Server presentation

7:30 - 9:00 pm: Networking and light refreshments

To get there:

Public parking is available at the venue. Parking fee is $3/hr and $15 for the entire day. The parking fee can be paid at the kiosk. Please find your way to Gallery 308 after parking.

All attendees will be required to show a photo ID at the reception.

Location

Landmark Building A, 2 Marina Blvd, San Francisco, CA 94123

Presented by

vLLM Meetups and Events

Join the vLLM community to discuss optimizing LLM inference!

Hosted By

338 Went