Cover Image for Unstructured Data Meetup SF
Cover Image for Unstructured Data Meetup SF
Avatar for Unstructured Data Meetup
Registration Closed
This event is not currently taking registrations. You may contact the host or subscribe to receive updates.
About Event

This is an in-person event! Registration is required in order to get in. Github will email you a form the day before the event, which you will need to complete for your access pass.

Topic: Connecting your unstructured data with Generative AI

What we’ll do:
Have some food and refreshments. Hear three exciting talks about unstructured data and generative AI.

5:30 - 6:30 - Welcome/Networking/Registration
7:05 - 7:30 - James Luan, VP Engineering, Zilliz
6:35 - 7:00 - Lance Martin, Software/ML, LangChain
7:35 - 8:00 - AJ Steers, Staff Software Engineer, Airbyte
8:00 - 8:10 - Community demos
8:10 - 8:30 - Networking

Who Should attend:
Anyone interested in talking and learning about Unstructured Data and Generative AI Apps.

When:
March 19, 2023
5:30PM

Tech Talk 1: VectorDB Schema Design 101 - Considerations for Building a Scalable and Performant Vector Search
Speaker: James Luan
Abstract: People often say that vector search is easy, but that's not entirely true. Vector search is more than just vector indexing and a Python wrapper. If you want to build a high-performance, scalable, and production-ready vector search service, you need to consider many factors.

Milvus is a high-performance, highly scalable vector database built for the cloud. Like a traditional database, we have designed a schema system with concepts such as collections, fields, shards, partitions, indexes, and various secondary indexes, to improve the performance of vector search.

In this talk, I will share how to correctly design your database schema structure and use the right data model to enable you to use Milvus with efficiency and scalability. We will also introduce complex use cases in RAG applications, such as multi-tenancy, scalar filtering/scalar indexing, and hybrid searches.

Tech Talk 2: LangGraph for Flow Engineering
Speaker: Lance Martin
Abstract: Recent work on code generation and RAG have used a "flow" paradigm rather than a naive "prompt:answer" paradigm. With a flow paradigm, answers can be iteratively constructed by testing and / or self-reflection to improve the solution. We recently launched LangGraph to support flow engineering, giving the user the ability to represent flows as a graph. We've shown that this can be employed for RAG (Self-RAG, Corrective RAG) as well as code-generation (AlphaCodium).

Tech Talk 3: Text as Data, From Anywhere to Anywhere
Speaker: AJ Steers
Abstract: With great data comes great GenAI opportunities.” We live in a world where not only do we have access to amazing and valuable data, but we now have GenAI solutions that can consume, analyze, and act on that data in ways we could never have foreseen just 2 short years ago. In this talk, we’ll talk about the value of having a uniform way to access all your data, data you may have access to today without even knowing it, and how to send it exactly where it is needed - whether that’s to a data warehouse for analytics or to a vector store for the next great AI app.

Where:
This is an in-person event. Registration is required to get into the event. Registration in advance will close 2 days before the event. Co-sponsored by Zilliz maintainers of Milvus and Airbyte.

Also streaming: on twitch.tv at 6:30pm

Location
GitHub
88 Colin P Kelly Jr St, San Francisco, CA 94107, USA
Avatar for Unstructured Data Meetup