OpenGVLab/InternVid
Viewer
•
Updated
•
21.3M
•
244
•
70
A unified multimodal understanding and generation model.
Next-generation reasoning model that runs locally in-browser
In-browser unified multimodal understanding and generation.
Ask questions about images or generate images from text
Generate detailed image edits and inpainting using prompts