Cover Image for [ONLINE] Open NLP Meetup #15
Cover Image for [ONLINE] Open NLP Meetup #15
Avatar for Haystack
Presented by
Haystack
View and subscribe to events by Haystack. Join us to discuss NLP, open-source tools, generative AI and more.
2 Going

[ONLINE] Open NLP Meetup #15

Zoom
Registration
Welcome! To join the event, please register below.
About Event

🌍 Join us for an exciting hybrid event, live from deepset HQ in Berlin and accessible from anywhere in the world!

Discover the latest open-source tools and techniques to supercharge your AI journey. This meetup features two insightful talks, followed by a live Q&A session where you can connect directly with our expert speakers.

​For the in-person event, check out here.

🎤 Talks

Optimizing Web Data Extraction for NLP and LLMs with Trafilatura by Adrien Barbaresi

As the demand for Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG) continues to grow, the need for better training data becomes increasingly critical. This talk introduces Trafilatura, a powerful open-source Python package and command-line tool that streamlines text discovery and extraction, from web crawling to robust and configurable extraction.
We will discuss how Trafilatura tackles common data quality issues, such as noise or missing metadata, and highlight its key features, including deduplication, content selection and multiple output formats. We will also explore Trafilatura's seamless integration with Haystack and explain how to make the most of existing parameters. Join us to discover how to transform raw HTML into meaningful data to improve model training and fine-tuning.

Deploying an LLM Application with Hayhooks and OpenWeb UI by deepset Team

Deploying LLM applications doesn’t have to be complex. In this talk, we’ll walk through an end-to-end demo of deployment using Hayhooks, an open-source project for deploying Haystack pipelines. We’ll spotlight OpenWeb UI, an intuitive and customizable interface tailored to LLM applications, and discuss its pivotal role in enhancing user experience. Whether you’re a developer, researcher, or AI enthusiast, you’ll learn the practical tools to go from concept to deployment with ease. As a bonus, we’ll cover how concurrent requests and streaming responses can be handled with this approach

Avatar for Haystack
Presented by
Haystack
View and subscribe to events by Haystack. Join us to discuss NLP, open-source tools, generative AI and more.
2 Going