Centurio: On Drivers of Multilingual Ability of Large Vision-Language Model Paper • 2501.05122 • Published 4 days ago • 15
Centurio Collection Artifacts of the paper "Centurio: On Drivers of Multilingual Ability of Large Vision-Language Model" • 5 items • Updated 3 days ago • 4
PPTAgent: Generating and Evaluating Presentations Beyond Text-to-Slides Paper • 2501.03936 • Published 5 days ago • 17
view article Article **Fine-tune SmolLM's on custom synthetic data** By prithivMLmods • 7 days ago • 15
Facilitating large language model Russian adaptation with Learned Embedding Propagation Paper • 2412.21140 • Published 13 days ago • 14
Sentence Encoders Collection Collection of models and dataset for sentence encoder task • 4 items • Updated Nov 25, 2024 • 7
rusBeIR-datasets Collection Collection of datasets used in rusBeIR • 37 items • Updated 16 days ago • 4
Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling Paper • 2412.05271 • Published Dec 6, 2024 • 124
PaliGemma 2 Release Collection Vision-Language Models available in multiple 3B, 10B and 28B variants. • 23 items • Updated about 1 month ago • 126
Unsloth 4-bit Dynamic Quants Collection Unsloths Dynamic 4bit Quants selectively skips quantizing certain parameters; greatly improving accuracy while only using <10% more VRAM than BnB 4bit • 8 items • Updated about 3 hours ago • 22
Speculative Decoding Draft Models Collection Collection of OpenVINO optimized efficient draft models for speculative decoding • 2 items • Updated 20 days ago • 7
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. • 7 items • Updated 6 days ago • 33
OpenScholar_V1 Collection The set of models, index, data associated with the paper "OpenScholar: Synthesizing Scientific Literature with Retrieval-Augmented LMs". • 8 items • Updated Nov 22, 2024 • 31