please create a quantized version, preferably using bitsandbytes!

by ctranslate2-4you - opened 2 days ago

Discussion

ctranslate2-4you

2 days ago

Really like the model but would like to use it with BitsandBytes...

amanrangapur

Ai2 org 2 days ago

Hey @ctranslate2-4you , check this out: https://huggingface.co./allenai/olmOCR-7B-0225-preview-GGUF

cnmoro

1 day ago

How do we perform inference on a pair of image + prompt, using gguf?

ctranslate2-4you

about 6 hours ago

Hey @ctranslate2-4you , check this out: https://huggingface.co./allenai/olmOCR-7B-0225-preview-GGUF

Nice, but I prefer to use BNB for now. do you guys plan on making one? Otherwise, I'd have to add a bunch of llama.cpp dependencies just for this..

amanrangapur

Ai2 org about 3 hours ago

•

edited about 3 hours ago

Hey @ctranslate2-4you , check this out: https://huggingface.co./allenai/olmOCR-7B-0225-preview-GGUF

Nice, but I prefer to use BNB for now. do you guys plan on making one? Otherwise, I'd have to add a bunch of llama.cpp dependencies just for this..

Noo, not atm. If you plan to make one, can you push it to HF.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment