Cover Image for Unstructured Data Meetup SF

Presented by

meetups for people working in unstructured data. Speakers will come present about related topics such as vector databases, LLMs, and managing data at scale. The intended audience of this group includes roles like machine learning engineers, data scientists, data engineers, software engineers, and PMs. This meetup is sponsored by Zilliz.

Hosted By

Featured in

Generative AI San Francisco and Bay Area

Unstructured Data Meetup SF

Name: Unstructured Data Meetup SF
Start: 2024-03-19T17:30:00.000-07:00
End: 2024-03-19T20:30:00.000-07:00
Location: GitHub

Unstructured Data Meetup

GitHub

San Francisco, California

Registration Closed

This event is not currently taking registrations. You may contact the host or subscribe to receive updates.

About Event

This is an in-person event! Registration is required in order to get in. Github will email you a form the day before the event, which you will need to complete for your access pass.

Topic: Connecting your unstructured data with Generative AI

What we’ll do:
Have some food and refreshments. Hear three exciting talks about unstructured data and generative AI.

5:30 - 6:30 - Welcome/Networking/Registration
7:05 - 7:30 - James Luan, VP Engineering, Zilliz
6:35 - 7:00 - Lance Martin, Software/ML, LangChain
7:35 - 8:00 - AJ Steers, Staff Software Engineer, Airbyte
8:00 - 8:10 - Community demos
8:10 - 8:30 - Networking

Who Should attend:
Anyone interested in talking and learning about Unstructured Data and Generative AI Apps.

When:
March 19, 2023
5:30PM

Tech Talk 1: VectorDB Schema Design 101 - Considerations for Building a Scalable and Performant Vector Search
Speaker: James Luan
Abstract: People often say that vector search is easy, but that's not entirely true. Vector search is more than just vector indexing and a Python wrapper. If you want to build a high-performance, scalable, and production-ready vector search service, you need to consider many factors.

Milvus is a high-performance, highly scalable vector database built for the cloud. Like a traditional database, we have designed a schema system with concepts such as collections, fields, shards, partitions, indexes, and various secondary indexes, to improve the performance of vector search.

In this talk, I will share how to correctly design your database schema structure and use the right data model to enable you to use Milvus with efficiency and scalability. We will also introduce complex use cases in RAG applications, such as multi-tenancy, scalar filtering/scalar indexing, and hybrid searches.

Tech Talk 2: LangGraph for Flow Engineering
Speaker: Lance Martin
Abstract: Recent work on code generation and RAG have used a "flow" paradigm rather than a naive "prompt:answer" paradigm. With a flow paradigm, answers can be iteratively constructed by testing and / or self-reflection to improve the solution. We recently launched LangGraph to support flow engineering, giving the user the ability to represent flows as a graph. We've shown that this can be employed for RAG (Self-RAG, Corrective RAG) as well as code-generation (AlphaCodium).

Tech Talk 3: Text as Data, From Anywhere to Anywhere
Speaker: AJ Steers
Abstract: With great data comes great GenAI opportunities.” We live in a world where not only do we have access to amazing and valuable data, but we now have GenAI solutions that can consume, analyze, and act on that data in ways we could never have foreseen just 2 short years ago. In this talk, we’ll talk about the value of having a uniform way to access all your data, data you may have access to today without even knowing it, and how to send it exactly where it is needed - whether that’s to a data warehouse for analytics or to a vector store for the next great AI app.

Where:
This is an in-person event. Registration is required to get into the event. Registration in advance will close 2 days before the event. Co-sponsored by Zilliz maintainers of Milvus and Airbyte.

Also streaming: on twitch.tv at 6:30pm

Location

GitHub

88 Colin P Kelly Jr St, San Francisco, CA 94107, USA

Presented by

Unstructured Data Meetup

meetups for people working in unstructured data. Speakers will come present about related topics such as vector databases, LLMs, and managing data at scale. The intended audience of this group includes roles like machine learning engineers, data scientists, data engineers, software engineers, and PMs. This meetup is sponsored by Zilliz.

Hosted By