FourthBrain Workshop

MultiModal  Models with RAG

A 1-day LIVE workshop for engineers, developers, and data scientists on combining the capabilities of RAG with vision-language models. You'll learn to enrich application functionalities with multimodal data understanding and retrieval. 

  • Date: Coming Soon!
  • Time: 9am-3pm PT 
  • Cost: $500

Generative AI is evolving past strictly textual or strictly visual tasks. Combining multiple types of data can create a much more powerful experience for your users. 

In this workshop you will deepen your understanding of vision-language models, and  integrate visual and textural information to enhance applications.  You'll build an application combining advanced RAG features with vision-language models to solve complex tasks involving both visual and textual inputs.  

As industries adapt to take advantage of AI, businesses around the world are realizing that AI implementation will soon be essential to remain competitive and up-to-date.

Business leaders face the biggest challenge of not knowing how to begin. This workshop, exclusively for executives, directors, and decision-makers, has been designed to help them come up with concrete next steps to create a roadmap for incorporating AI into their businesses.

Key Outcomes

Understand the architecture, functionality, and applications of multimodal models

Integrate vision-langauge models with RAG for enhanced multimodal data processing

Use multimodal models to solve complex tasks

Workshop Schedule

This workshop includes two modules. 

  • Introduction to latest vision-language models including CLIP, Dall-E, and more
  • Integrating vision-language models with RAG
  • Set up a vision-language model to perform tasks
Download the Syllabus

This workshop is for you if:

You are an engineer, data scientist, or  developer who wants to incorporate multi-modal capabilities to your projects

You have some experience with LLMs and understand some of the use cases for multimodal applications 

You are fluent in Python and have experience manipulating data, building basic ML models like classifiers, and have deployed an application. 

How to Prepare

Create a Google Colab Account

We suggest you work in Google Colab for fine-tuning, so you should have a paid account. 

Familiarize yourself with Hugging Face

This Introduction to Hugging Face Course will teach you about NLP using libraries from the Hugging Face ecosystem. 

Many employers offer reimbursement for programs like ours. Check out our tips for getting reimbursed.

"67% of companies saw revenue increase due to AI adoption"

- McKinsey Tech Trends Outlook 2022

Register Here

Interested in this program for your team? Reach out!

Part of a group? Register as a team here.



About FourthBrain

FourthBrain's mission is to bring more people into the growing fields of Machine Learning and Artificial Intelligence through flexible education programs. We equip leaders with the skills to lead organizations towards AI maturity, and support engineers, developers, and data scientists to make an impact in this field.

About Andrew Ng

Featured Speaker: Dr. Andrew Ng

Dr. Andrew Ng is a globally recognized leader in AI. He is the Founder of DeepLearning.AI and Founder and CEO of Landing AI, and an Advisor to FourthBrain. 


AI-First Mindset For Leaders

This class was designed to give company decision-makers at various levels, the AI understanding they need to lead from the front. The curriculum has been crafted to fill knowledge gaps for leaders experiencing any of the following:
  • You're struggling to figure out how AI can impact your organization
  • You want to stay ahead of the curve with AI technology, but aren't sure how to start
  • Your organization is experimenting with AI, but has not seen any real business impact from it
  • Your organization has some valuable models, but they are not fully integrated into daily operations