/google-gemini-cookbook

Examples and guides for using the Gemini API.

Primary LanguageJupyter NotebookApache License 2.0Apache-2.0

Welcome to the Gemini API Cookbook

This is a collection of guides and examples for the Gemini API, including quickstart tutorials for writing prompts and using different features of the API, and examples of things you can build.

Get started with the Gemini API

The Gemini API gives you access to Gemini models created by Google DeepMind. Gemini models are built from the ground up to be multimodal, so you can reason seamlessly across text, images, code, and audio. You can use these to develop a range of applications.

Start developing

  1. Go to Google AI Studio.
  2. Login with your Google account.
  3. Create an API key.
  4. Use a quickstart for Python, or call the REST API using curl.

Table of contents

Learn about the capabilities of the Gemini API by checking out these quickstart tutorials.

  • Authentication: Start here to learn how you can set up your API key so you can get access to the Gemini API.
  • Counting Tokens Tokens are the basic inputs to the Gemini models. Through this notebook, you will gain a better understanding of tokens through an interactive experience.
  • Files: Use the Gemini API to upload files (text, code, images, audio, video) and write prompts using them.
  • Audio: Learn how to use the Gemini API with audio files.
  • JSON mode: Discover how to use JSON mode.
  • Function Calling: The Gemini API works great with code. Use this quickstart to learn how to write prompts to understand and call functions. Then check out the function calling config tutorial to learn more.
  • System Instructions: Give models additional context on how to respond by setting system instructions.
  • Embeddings: Create high quality and task-specific embeddings.
  • Tuning: Learn how to improve model performance on a specific task through tuning.

You can find lots more in the quickstarts folder, and check out the examples folder for fun examples. We're also maintaining an Awesome Gemini list of all the cool projects the community is building using Gemini.

Official SDKs

The Gemini API is a REST API. You can call the API using a command line tool like curl (and you can find REST examples here) , or by using one of our official SDKs:

Get help

Ask a question on the Google AI Developer Forum.

Contributing

Contributions are welcome. See contributing to learn more.

Thank you for developing with the Gemini API! We’re excited to see what you create.