Cover Image for vLLM: Easy, Fast, and Cheap LLM Serving for Everyone

Presented by

In-person and virtual Gen AI Developer events! 📤 Subscribe for a weekly ✍️ AI newsletter https://tune.beehiiv.com/ 🗓️ Submit an event or ping us on X/LinkedIn/Discord

Hosted By

Featured in

Generative AI San Francisco and Bay Area

vLLM: Easy, Fast, and Cheap LLM Serving for Everyone

Tune AI

Zoom

Registration Closed

This event is not currently taking registrations. You may contact the host or subscribe to receive updates.

About Event

For developers, builders, AI enthusiasts, and anyone looking to optimize LLM serving and opportunity to contribute to open source project.

📅 When: July 18th, 2024

⏰ Time: 10:30 AM PST

📍 Where: Zoom Webinar

📝 Agenda:

This presentation introduces vLLM, a high-performance open-source LLM inference engine. We'll cover:

Overview of vLLM and its key benefits
Deep dive into features like pipeline parallelism and speculative decoding
Live demonstration of setting up and optimizing LLM inference
Contribution opportunities and current development priorities

🎤 Speakers:

Woosuk Kwon - Ph.D. student at UC Berkeley, creator of vLLM
Kaichao You - Ph.D. student at Tsinghua University, vLLM contributor

💫 What's the Deal?

We promise 1 hour of cutting-edge AI technology, insightful demonstrations, and the chance to connect with the creators of a amazing open-source project. Learn how to significantly improve your AI infrastructure's performance and cost-efficiency!

🔗 Resources:

vLLM GitHub: https://github.com/vllm-project/vllm

Presented by

Tune AI

In-person and virtual Gen AI Developer events! 📤 Subscribe for a weekly ✍️ AI newsletter https://tune.beehiiv.com/ 🗓️ Submit an event or ping us on X/LinkedIn/Discord

Hosted By

vLLM: Easy, Fast, and Cheap LLM Serving for Everyone

​📝 Agenda:

​🎤 Speakers:

​💫 What's the Deal?

​🔗 Resources:

📝 Agenda:

🎤 Speakers:

💫 What's the Deal?

🔗 Resources: