Event box
One AI to Rule Them All: How to Build a Large Multimodal Model (LMM) that Handles Text, Images, and Music - Data Hub Tech Talk
Note: the session will be recorded for registrants who cannot attend at the scheduled time.
Modern transformer architectures are adept at not only understanding text, but also images, video, speech, music, and much more. We discuss how these multimodal models work under the hood, and will build and train a simple model that uses multiple input encoders and a shared semantic embedding space. Our model will learn how to jointly interpret text, images, and music.
- Date:
- Friday, December 6, 2024
- Time:
- 12:30pm - 1:30pm
- Location:
- Data Hub presentation / collaboration space
- Campus:
- University of Idaho - Moscow campus
- Presenter:
- Dr. Lucas Sheneman, Director - Research Computing and Data Services
- Categories:
- Data Hub Library Workshop
Registration has closed.
Upcoming Workshops
Time Zone: Pacific Time - US & Canada (change)
Oct
27
10:00am - 11:00am, Second Floor Classroom, University of Idaho - Moscow campus.
Oct
27
1:30pm - 2:30pm, Data Hub presentation / collaboration space, University of Idaho - Moscow campus.
Oct
27
1:30pm - 2:30pm, The MILL @ UIdaho Library, University of Idaho - Moscow campus.
Oct
28
10:30am - 11:30am, Second Floor Classroom, University of Idaho - Moscow campus.
Oct
28
12:30pm - 1:30pm, Living Room Presentation Space (Library first-floor), University of Idaho - Moscow campus.
![University of Idaho Library [logo]](https://www.lib.uidaho.edu/media/images/ui_library_horizontal.png)