Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper • 2412.13663 • Published 6 days ago • 103
The Case for Co-Designing Model Architectures with Hardware Paper • 2401.14489 • Published Jan 25 • 3
FashionComposer: Compositional Fashion Image Generation Paper • 2412.14168 • Published 6 days ago • 16
LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment Paper • 2412.04814 • Published 18 days ago • 45
LiFT-HRA Collection LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment • 2 items • Updated 8 days ago • 1
LiFT-Critic Collection LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment • 5 items • Updated 2 days ago • 3
Falcon3 Collection Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. • 40 items • Updated 5 days ago • 70
My Dataset Spaces Collection Dataset generation and transformation • 8 items • Updated 12 days ago • 1
STIV: Scalable Text and Image Conditioned Video Generation Paper • 2412.07730 • Published 14 days ago • 69
LAION-SG: An Enhanced Large-Scale Dataset for Training Complex Image-Text Models with Structural Annotations Paper • 2412.08580 • Published 13 days ago • 44
SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints Paper • 2412.07760 • Published 14 days ago • 49
EXAONE-3.5 Collection EXAONE 3.5 language model series including instruction-tuned models of 2.4B, 7.8B, and 32B. • 10 items • Updated 15 days ago • 79
Positions Datasets Collection These are chess datasets where each row is a chess position • 3 items • Updated Nov 2 • 6
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated 27 days ago • 442
Qwen2.5-Coder Collection Code-specific model series based on Qwen2.5 • 40 items • Updated 27 days ago • 257
view article Article Releasing the largest multilingual open pretraining dataset By Pclanglais • Nov 13 • 98