Leveraging the Gemini Pro Vision model for image understanding, multimodal prompts and accessibility

  1. Which of the following Gemini models supports image prompting?

  2. What does the error “[400 Bad Request] Image input modality is not enabled for models/gemini-pro” suggest is going wrong when calling the Gemini API?

  3. The Gemini model can solve basic geometric or logic based problems based on images: