Cover Image for How to Build Multimodal Document RAG with Llama 3.2 Vision and ColQwen2
Cover Image for How to Build Multimodal Document RAG with Llama 3.2 Vision and ColQwen2
587 Went

How to Build Multimodal Document RAG with Llama 3.2 Vision and ColQwen2

Hosted by Together AI & Zain Hasan
Virtual
Registration Closed
This event is not currently taking registrations. You may contact the host or subscribe to receive updates.
About Event

In this event we'll discuss how you can perform RAG over complex PDF documents that contain images, graphs, tables text charts and more! We'll describe in detail how:

  • The new image retriever ColPali works

  • How you can finetune ColPali to improve further for your usecase

  • How to leverage multi-vector retrieval to retrieve from PDFs

  • How to use language vision models like the new Llama 3.2 vision series to perform document RAG

587 Went