Cover Image for Awesome AI Dev Tools - May
Cover Image for Awesome AI Dev Tools - May
Avatar for Open Source for AI
Presented by
Open Source for AI
Event series focused on amplifying activities that promote open source for AI tools.
Hosted By
407 Going
Registration
Welcome! To join the event, please register below.
About Event

You probably already know that AI is changing the world. Come learn about some of the awesome AI dev tools being worked on.

Agenda

5:00 - 5:30: Arrival

5:30 - 5:35: Intro

5:40 - 6:10: Laurie Voss, VP of DevRel at LlamaIndex

6:10 - 6:40: Accelerate Data Loading for AI/GenAI, Lucy Ge, Staff Software Engineer and Tarik Bennett, Senior Solution Engineer at Alluxio

6:40 - 7:10: Data Versioning in Generative AI: A Pathway to Cost-effective ML, Dmitry Petrov, CEO/Founder at DVC.ai

7:10 - 7:25: TBA, Salad

7:25 - 7:30: Raffle of $200 worth of prizes sponsored by Alluxio

7:30 - End: Post Talk Networking

About the Talks

Data Versioning in Generative AI: A Pathway to Cost-effective ML:
For 5 years, we've built DVC, understanding the benefits of data versioning. With evolving Generative AI workflows, versioning needs to adapt. This era relies on vast unstructured data—images, videos, audio, MRI scans, and more—scaling into billions of objects. Managing this, along with resource-intensive model scoring, poses unique challenges. In this talk, we'll explore data versioning for Generative AI, focusing on minimizing processing time and API calls like ChatGPT, leading to cost savings. We'll also discuss sharing datasets for collaboration and recent transformations in data versioning, such as annotations and embeddings. These insights offer a deep dive into the changing data management landscape in Generative AI.

About the Speakers

Dmitry Petrov is the CEO and co-founder of Iterative.ai, working on building data-centric MLOps tools. He’s an ex-data scientist at Microsoft with a Ph.D. in Computer Science and an active open-source contributor. He has written and open-sourced the first version of DVC.org (part of Iterative) – a data versioning and machine learning workflow management tool. He also implemented a wavelet-based image hashing algorithm (wHash) in the open-source library ImageHash for Python.

Location
GitHub
88 Colin P Kelly Jr St, San Francisco, CA 94107, USA
Avatar for Open Source for AI
Presented by
Open Source for AI
Event series focused on amplifying activities that promote open source for AI tools.
Hosted By
407 Going