How to Build Multimodal Document RAG with Llama 3.2 Vision and ColQwen2

Name: How to Build Multimodal Document RAG with Llama 3.2 Vision and ColQwen2
Start: 2024-10-10T09:00:00.000-07:00
End: 2024-10-10T10:00:00.000-07:00
Location: Online Event

Hosted by Together AI & Zain Hasan

Virtual

Registration Closed

This event is not currently taking registrations. You may contact the host or subscribe to receive updates.

About Event

In this event we'll discuss how you can perform RAG over complex PDF documents that contain images, graphs, tables text charts and more! We'll describe in detail how:

The new image retriever ColPali works
How you can finetune ColPali to improve further for your usecase
How to leverage multi-vector retrieval to retrieve from PDFs
How to use language vision models like the new Llama 3.2 vision series to perform document RAG

Hosted By

587 Went