Google: Gemini Pro Vision 1.0

google/gemini-pro-vision

About Google: Gemini Pro Vision 1.0

Google's flagship multimodal model, supporting image and video in text or chat prompts for a text or code response.

See the benchmarks and prompting guidelines from Deepmind.

Usage of Gemini is subject to Google's Gemini Terms of Use.

#multimodal

Specifications

Context Length

16,384

Tokenizer

Gemini

Pricing

Prompt

0.500

Completion

1.500

Image

0.0025

Request

0

Last updated: 4/11/2025