view article Article Fine-tune ModernBERT for RAG with Synthetic Data By sdiazlor and 2 others • Jan 20 • 36
view article Article Fine-tune a SmolLM on domain-specific synthetic data from a LLM By davidberenstein1957 • Jan 3 • 35
Running 1.79k 1.79k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
Running 531 531 Open Source Ai Year In Review 2024 😻 What happened in open-source AI this year, and what’s next?
view article Article Introducing the Synthetic Data Generator - Build Datasets with Natural Language Dec 16, 2024 • 109
Running on CPU Upgrade 12.6k 12.6k Open LLM Leaderboard 🏆 Track, rank and evaluate open LLMs and chatbots
Running on CPU Upgrade 37 37 FineWeb-c - Annotation 🌐 Launch Argilla for data labeling and annotation
data-is-better-together/open-image-preferences-v1-binarized Viewer • Updated Dec 9, 2024 • 7.46k • 401 • 44