API_KEY

FILE_NAME

This is a beginner-friendly course where you’ll learn Gemini fundamentals, use Gemini effectively, and build your first Gemini applications.

jupyter.tar.gz

python run

jupyter

streamlit

python run-copy

game_demo

doNothing

doNothing-streamlit

doNothing-copy

This course unlocks the power of Google Gemini, Google’s best generative AI model yet. It helps you dive deep into this powerful language model’s capabilities, exploring its text-to-text, image-to-text, text-to-code, and speech-to-text capabilities. 

The course starts with an introduction to language models and how unimodal and multimodal models work. It covers how Gemini can be set up via the API and how Gemini chat works, presenting some important prompting techniques. Next, you’ll learn how different Gemini capabilities can be leveraged in a fun and interactive real-world pictionary application. Finally, you’ll explore the tools provided by Google’s Vertex AI studio for utilizing Gemini and other machine learning models and enhance the Pictionary application using speech-to-text features.

This course is perfect for developers, data scientists, and anyone eager to explore Google Gemini’s transformative potential.

Getting Started with Google Gemini

Learn about the Files API and how it can be used to send audio and videos in prompts.

Audio/Video-to-Text Generation with Gemini

Getting Started

Capabilities of Gemini

Gemini on Vertex AI

Assess Your Knowledge

Conclusion

Audio/Video-to-Text Generation with Gemini

The Files API