Cover Image for Unstructured Data Meetup Princeton / Startup Grind Princeton
Cover Image for Unstructured Data Meetup Princeton / Startup Grind Princeton
Avatar for Unstructured Data Meetup
Hosted By
1 Going

Unstructured Data Meetup Princeton / Startup Grind Princeton

Registration
Welcome! To join the event, please register below.
About Event

This is an in-person event! Registration is required to get in.

Topic: Connecting your unstructured data with Generative LLMs

What we’ll do:
Have some food and refreshments. Hear three exciting talks about unstructured data, vector databases and generative AI.

6:30 - 6:45 - Welcome/Networking/Registration
6:45 - 7:00 - Tim Spann, Principal DevRel, Zilliz
7:00 - 8:00 - Naren, Unstract

Intro Talk on RAG 101

Tech talk 2: Unstructured Document Data Extraction at Scale with LLMs: Challenges and Solutions

Unstructured documents present a significant challenge for businesses, particularly those managing them at scale. Traditional Intelligent Document Processing (IDP) systems—let's call them IDP 1.0—rely heavily on machine learning and NLP techniques. These systems require extensive manual annotation, making them time-consuming and less effective as document complexity and variability increase.

The advent of Large Language Models (LLMs) is ushering in a new era: IDP 2.0. However, while LLMs offer significant advancements, they also come with their own set of challenges, particularly around accuracy and cost, which can become prohibitive at scale. In this talk, we will look at how Unstract, an open source IDP 2.0 platform purpose-built for structured document data extraction, solves these challenges. Processing over 5 million pages of unstructured documents per month, Unstract uses various techniques to extract structured data with accuracy and cost efficiency, chief among them—the use of vector databases.

Naren H - Co-founder/COO, Unstract

Naren H is the co-founder at Unstract, an open source startup building an LLM-powered platform that extracts data from unstructured documents, helping automate critical business processes. Before Unstract, Naren founded Mediavak, a digital marketing agency, and co-founded Social Animal and Tweeple Search, building tools that made social media analytics and content marketing a breeze. He holds a Master’s in Computer Science from the State University of New York at Buffalo. He has a knack for turning data chaos into order — occasionally, he even manages to keep his emails under control.

Speaker LinkedIn Profile: https://www.linkedin.com/in/naren87/

Who Should attend:
Anyone interested in talking and learning about Unstructured Data and Generative AI Apps.

When:
October 24, 2024
6:30PM

Where:
This is an in-person event! Registration is required to get in. Registration will close 2 days before the event. Sponsored by Zilliz maintainers of Milvus.

See more information at Startup Grind Princeton.

Supercharging Startups with Unstructured Data, Vector Databases, and AI

Oct 24, 6:30 – 8:00 PM

Princeton

Startup Grind Princeton, 23 Orchard Road, Montgomery, 08558

We host AI Experts Narendran Hariparanthaman and Tim Spann as we continue our AI series.

Location
23 Orchard Rd
Skillman, NJ 08558, USA
Front Door
Avatar for Unstructured Data Meetup
Hosted By
1 Going