Look forward to the GGUF version of this model

by jian2023 - opened May 20

Discussion

jian2023

May 20

Maybe it can be used in ollama

ibrahimkettaneh

May 21

•

edited May 21

Currently using the v2 gguf, though this is a significant improvement and having a gguf of this would be greatly appreciated.
The strong OCR abilities of this model would be very helpful for making documents accessible.
This model is excellent and all your kind efforts and contributions to the community are greatly appreciated.

Cuiunbo

OpenBMB org May 21

The GGUF format will be released soon😃

yaoyuan

OpenBMB org May 22

Thank you all for the valuable feedback! We really appreciate it. We are working on GGUF and ollama for MiniCPM-Llama3-V 2.5, which will be available soon. We hope the model can help the community and the people in need.

Cuiunbo

OpenBMB org May 23

•

edited May 23

MiniCPM-Llama3-V 2.5 can run with llama.cpp now! See our fork of llama.cpp for more detail.

and here is our model in gguf format.
https://huggingface.co./openbmb/MiniCPM-Llama3-V-2_5-gguf
@ibrahimkettaneh

xzintsux

May 23

thanks @Cuiunbo what are vram requirements for each of " ggml-model-Q4_K_M.gguf - mmproj-model-f16.gguf "

Cuiunbo

OpenBMB org May 24

For memory consumption, mmproj-model-f16.gguf is about 1g+, and ggml-model-Q4_K_M.gguf is 5g.
But we optimize to run vit first, then free it, so that the overall consumption peaks at only 5g.
@xzintsux

yaoyuan

OpenBMB org May 25

We have supported Ollama! Please visit our GitHub for more usage info! https://github.com/OpenBMB/MiniCPM-V

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment