Cover Image for Building and Optimizing RAG Pipelines: Data Preprocessing, Embeddings, and Evaluation with ZenML
Cover Image for Building and Optimizing RAG Pipelines: Data Preprocessing, Embeddings, and Evaluation with ZenML
Hosted By
150 Going

Building and Optimizing RAG Pipelines: Data Preprocessing, Embeddings, and Evaluation with ZenML

Hosted by ZenML
Virtual
Registration
Past Event
Welcome! To join the event, please register below.
About Event

Join us for a deep dive into the world of Retrieval-Augmented Generation (RAG) pipelines and how ZenML can streamline your RAG workflows. In this hands-on workshop, Alex will guide you through the essential components of building and optimizing RAG pipelines.

We'll cover:

  • The fundamentals of RAG, discussing why it exists and the problems it solves in the realm of natural language processing.

  • The process of ingesting and preprocessing data for your RAG pipeline, focusing on best practices and techniques to ensure optimal performance.

  • The critical role of embeddings in a RAG retrieval workflow, including how to generate and store these embeddings in a vector database for efficient retrieval of relevant information.

  • How ZenML simplifies the tracking and management of RAG-associated artifacts, ensuring reproducibility and facilitating collaboration.

  • Strategies for assessing the performance of your RAG pipelines and measuring the impact of any modifications you make, along with insights on how to approach RAG evaluation and interpret the results effectively.

  • The use of rerankers to enhance the overall retrieval process in your RAG pipeline, with practical examples and guidance on implementing rerankers to improve the relevance and quality of the retrieved information.

This workshop is designed to cater to both newcomers to RAG and those with some experience who want to leverage ZenML to streamline their workflows. We'll adopt a tool-agnostic approach, using plain Python wherever possible to ensure accessibility and flexibility.

Please note that this workshop is the first part of a two-part series. The second part, which will be hosted at a later date, will focus on fine-tuning embeddings and language models specifically for RAG pipelines.

Speaker: Alex Strick van Linschoten is an ML Engineer at ZenML. Based in Delft, Alex had a previous career as a historian and linguist before retraining in a technical domain. He's interested in the ways small fine-tuned language models can outperform the proprietary options as well as the uses of LLMs (and generative AI in general) for education.

Don't miss this opportunity to elevate your RAG pipeline skills and discover how ZenML can revolutionize your workflow. Join us for this immersive workshop and unlock the full potential of Retrieval-Augmented Generation!

Hosted By
150 Going