Visual Referring Expression Demo

Upload an image and input description text, the system will return the thinking process and region annotation

Examples
Input Image Description Text