view article Article Releasing Outlines-core 0.1.0: structured generation in Rust and Python 22 days ago • 39
view article Article How to build a custom text classifier without days of human labeling By sdiazlor • 26 days ago • 54
view article Article How to optimize your data labelling project with custom interfaces By burtenshaw • 28 days ago • 18
DSBench: How Far Are Data Science Agents to Becoming Data Science Experts? Paper • 2409.07703 • Published Sep 12 • 66
view article Article Fine-tuning a token classification model for legal data using Argilla and AutoTrain By bikashpatra • Sep 7 • 14
Let Me Speak Freely? A Study on the Impact of Format Restrictions on Performance of Large Language Models Paper • 2408.02442 • Published Aug 5 • 21
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment Paper • 2408.06266 • Published Aug 12 • 9
view article Article 🔥 Argilla 2.0: the data-centric tool for AI makers 🤗 By dvilasuero • Jul 30 • 37
Llama 3.1 GPTQ, AWQ, and BNB Quants Collection Optimised Quants for high-throughput deployments! Compatible with Transformers, TGI & VLLM 🤗 • 9 items • Updated Sep 26 • 54
NuminaMath Collection Datasets and models for training SOTA math LLMs. See our GitHub for training & inference code: https://github.com/project-numina/aimo-progress-prize • 6 items • Updated Jul 21 • 62
view article Article 🧑⚖️ "Replacing Judges with Juries" using distilabel By alvarobartt • May 3 • 17
NER in Spanish Collection Fine-tuned models to perform NER in Spanish using the framework SpanMarker and different encoders and datasets • 3 items • Updated Sep 2 • 4