[ICLR'24] Guiding Instruction-based Image Editing via Multimodal Large Language Models

This repo contains LLaVA-7B and pre-trained MGIE ckpt (on IPr2Pr + MagicBrush) for MGIE

Please follow the offical repo and ipynb to use it

@inproceedings{fu2024mgie,
  author = {Tsu-Jui Fu and Wenze Hu and Xianzhi Du and William Yang Wang and Yinfei Yang, and Zhe Gan}, 
  title = {{Guiding Instruction-based Image Editing via Multimodal Large Language Models}}, 
  booktitle = {International Conference on Learning Representations (ICLR)}, 
  year = {2024} 
}
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.

Spaces using tsujuifu/ml-mgie 9