Multimodal AI
In this episode Lan and Ryan show Martin a multimodal AI application that solves a real business problem. “Multimodal” means that the application processes video, audio, and text to create output. Lan and Ryan demo the finished application and also dig into the code to show how to call Google’s Vertex AI.
Chapters:
0:00 Intro
1:01 The business problem
2:04 Demo
2:45 Code walkthrough
5:15 Takeaways
6:07 Wrap-up
Resources:
Source code repo→ https://goo.gle/4c7PQWb
Watch more Serverless Expeditions → https://goo.gle/ServerlessExpeditions
Subscribe to Google Cloud Tech → https://goo.gle/GoogleCloudTech
#serverless #ai #vertexai #cloudfunctions
Speakers:
Lan Tran, Customer Solution Engineer
Ryan Sibbaluca, Customer Solution Engineer
Martin Omander, Developer Advocate
Products Mentioned: Vertex AI, Cloud Functions
Google Cloud Tech
Helping you build what's next with secure infrastructure, developer tools, APIs, data analytics and machine learning....