Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model Paper • 2408.11039 • Published about 1 month ago • 54
Llama-3.1 Quantization Collection Neural Magic quantized Llama-3.1 models • 21 items • Updated 8 days ago • 32
view article Article ColPali: Efficient Document Retrieval with Vision Language Models 👀 By manu • Jul 5 • 85
view article Article A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes Aug 17, 2022 • 56
view article Article From cloud to developers: Hugging Face and Microsoft Deepen Collaboration May 21 • 8
PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma • 16 items • Updated Jul 31 • 133
SigLIP Collection Contrastive (sigmoid) image-text models from https://arxiv.org/abs/2303.15343 • 8 items • Updated Jul 31 • 32
Multilingual Transformer Decoders Collection That definitely have experienced Vietnamese • 10 items • Updated Apr 29 • 1