MiniGPT-v2 Demo

0.1 1.5

For Abilities Involving Visual Grounding:

  1. Grounding: CLICK Send to generate a grounded image description.
  2. Refer: Input a referring object and CLICK Send.
  3. Detection: Write a caption or phrase, and CLICK Send.
  4. Identify: Draw the bounding box on the uploaded image window and CLICK Send to generate the bounding box. (CLICK "clear" button before re-drawing next time).
  5. VQA: Input a visual question and CLICK Send.
  6. No Tag: Input whatever you want and CLICK Send without any tagging

You can also simply chat in free form!

Task Shortcuts

Hint: Upload your image and chat

Examples
Examples