view article Article MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era By MiniMax-AI β’ 19 days ago β’ 40
view article Article Timm β€οΈ Transformers: Use any timm model with transformers 18 days ago β’ 37
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper β’ 2412.13663 β’ Published Dec 18, 2024 β’ 125
PaliGemma 2: A Family of Versatile VLMs for Transfer Paper β’ 2412.03555 β’ Published Dec 4, 2024 β’ 125
Common Models Collection The first generation of models pretrained on Common Corpus. β’ 5 items β’ Updated Dec 5, 2024 β’ 28
MARS: Unleashing the Power of Variance Reduction for Training Large Models Paper β’ 2411.10438 β’ Published Nov 15, 2024 β’ 13
Cautious Optimizers: Improving Training with One Line of Code Paper β’ 2411.16085 β’ Published Nov 25, 2024 β’ 15
view article Article π€ Serve any model with Inference Endpoints + Custom Handlers By alvarobartt β’ Nov 22, 2024 β’ 3
RDNet Collection DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs [ECCV 2024] β’ 9 items β’ Updated Oct 16, 2024 β’ 3
timm tiny test models Collection A collection of very small (~300-500k parameter) models at 160x160 resolution, for testing purposes. Trained on ImageNet-1k. β’ 13 items β’ Updated Oct 2, 2024 β’ 5
view article Article LAVE: Zero-shot VQA Evaluation on Docmatix with LLMs - Do We Still Need Fine-Tuning? Jul 25, 2024 β’ 18
π MINT-1T Collection Data for "MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens" β’ 13 items β’ Updated Jul 24, 2024 β’ 58
Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs Paper β’ 2406.16860 β’ Published Jun 24, 2024 β’ 60
An Image is Worth 32 Tokens for Reconstruction and Generation Paper β’ 2406.07550 β’ Published Jun 11, 2024 β’ 57
MobileCLIP Models + DataCompDR Data Collection MobileCLIP: Mobile-friendly image-text models with SOTA zero-shot capabilities. DataCompDR: Improved datasets for training image-text SOTA models. β’ 22 items β’ Updated Oct 4, 2024 β’ 26