FourthBrain Workshop

Building with MultiModal Models

A 3-week live workshop for engineers on developing applications with the next generation of large foundation models that integrate text, image, audio, and beyond.

Date: Coming Soon
Time: 4-7pm PT on Tuesdays and Thursdays
Cost: $2,000

Register Now

The world is no longer black and white – it's a symphony of data. To unlock the true potential of the next wave of AI, you need to unlock the true potential of all types of data.

This intensive workshop equips you to build the next generation of AI applications with Multimodal Foundation Models. Master models like CLIP and Gemini to understand the richness of text, images, audio, and more. You'll build a full application incorporating multiple types of data and walk away with a fundamental understanding of how to use multimodal models in your work.

As industries adapt to take advantage of AI, businesses around the world are realizing that AI implementation will soon be essential to remain competitive and up-to-date.

Business leaders face the biggest challenge of not knowing how to begin. This workshop, exclusively for executives, directors, and decision-makers, has been designed to help them come up with concrete next steps to create a roadmap for incorporating AI into their businesses.

Key Outcomes

Master the fundamentals of multimodal training like CLIP for contrastive language-image pre-training

Apply MMFs to tasks like image classification, text-based image retrieval, and conversational assistance

Explore emerging research directions in the field of large multimodal models including generating multimodal output

Workshop Schedule

This workshop includes two live sessions a week on Tuesdays and Thursdays, plus additional practice on your own time.

Exploring multi-modal datasets and training objectives
Understanding CLIP - contrastive language-image pre-training
Introduction to Flamingo and alternative architectures

Download the Syllabus

This workshop is for you if:

You are an engineer, data scientist, or developer who wants to incorporate multimodal capabilities to your projects

You have a solid understanding of machine learning and deep learning concepts

You are fluent in Python and have some experience with NLP and Computer Vision

How to Prepare

Create a Google Colab Account

We suggest you work in Google Colab for fine-tuning, so you should have a paid account.

Familiarize yourself with Hugging Face

This Introduction to Hugging Face Course will teach you about NLP using libraries from the Hugging Face ecosystem.

Many employers offer reimbursement for programs like ours. Check out our tips for getting reimbursed.

Register Here

Register as a group here.

"67% of companies saw revenue increase due to AI adoption"

- McKinsey Tech Trends Outlook 2022

"67% of companies saw revenue increase due to AI adoption"

- McKinsey Tech Trends Outlook 2022

"67% of companies saw revenue increase due to AI adoption"

- McKinsey Tech Trends Outlook 2022

Register Here

Part of a group? Register as a pair or team here.

FAQs

This three-week workshop is designed for students who want to work more deeply with Multimodal Models. You will spend time on building and curating multimodal data, learning how to fine-tune with multiple types of data, and build an application incorporating multiple types of data.

Our one-day workshop "Multimodal Models and RAG" is designed for students who want an introduction to working with multimodal models.

Both workshops require a background in Python and machine learning.

About FourthBrain

FourthBrain's mission is to bring more people into the growing fields of Machine Learning and Artificial Intelligence through flexible education programs. We equip leaders with the skills to lead organizations towards AI maturity, and support engineers, developers, and data scientists to make an impact in this field.

About Andrew Ng

Featured Speaker: Dr. Andrew Ng

Dr. Andrew Ng is a globally recognized leader in AI. He is the Founder of DeepLearning.AI and Founder and CEO of Landing AI, and an Advisor to FourthBrain.

AI-First Mindset For Leaders

This class was designed to give company decision-makers at various levels, the AI understanding they need to lead from the front. The curriculum has been crafted to fill knowledge gaps for leaders experiencing any of the following:

You're struggling to figure out how AI can impact your organization
You want to stay ahead of the curve with AI technology, but aren't sure how to start
Your organization is experimenting with AI, but has not seen any real business impact from it
Your organization has some valuable models, but they are not fully integrated into daily operations