API_KEY

FILE_NAME

Explore this Gemini course to master Google Gemini’s AI features, including text-to-text and image-to-text. Build apps, learn prompting techniques, and enhance workflows with tools like Vertex AI.

jupyter.tar.gz

GRPC_VERBOSITY

python run

jupyter

streamlit

python run-copy

game_demo

doNothing

doNothing-streamlit

doNothing-copy

Unlock the power of Google Gemini, Google’s cutting-edge generative AI model, and discover its transformative potential. This course deeply explains Gemini’s capabilities, including text-to-text, image-to-text, text-to-code, and speech-to-text functionalities.

Begin with an introduction to unimodal and multimodal models and learn how to set up Gemini using the Google Gemini API. Dive into prompting techniques and practical applications, such as building a real-world Pictionary game powered by Gemini. Explore Google Vertex AI tools to enhance and deploy your AI models, incorporating features like speech-to-text.

This course is perfect for developers, data scientists, and anyone excited to explore the transformative potential of Google’s Gemini AI.

Google Gemini for Beginners: From Basics to Building AI Apps

Learn how our pictionary bot understands hand-drawn images and evaluates them using the image-to-text models in Gemini. Also, understand how images can be sent as prompts to Google Gemini.

Understanding Hand-Drawn Images with Image-to-Text Processing

Introduction to Google Gemini

Capabilities of Gemini

Gemini and Vertex AI

Assess Your Knowledge

Conclusion

Build a RAG Using LangChain with Google Gemini

Understanding Hand-Drawn Images with Image-to-Text Processing

Behind the scenes

Prompts with images