Skip to main content
Comments
T
Youtube
What the AI quack!

Google launched its Gemini AI model earlier today to compete with OpenAI’s popular GPT-4. While Google has a number of videos demonstrating Gemini’s capabilities, the one below stood out to me.

This multimodal AI model is capable of reasoning across images, audio, video, code, and, of course, text. So you can start drawing, and Gemini will understand you’re drawing a duck, or you can set some cups down on a table with a paper ball and Gemini reasons you want to play a game. If Gemini can understand my poor attempts at doodling, then I’ll be the one shouting, “what the quack!”