Argilla 2.4: Easily Build Fine-Tuning and Evaluation datasets on the Hub — No Code Required Nov 4, 2024 • 41
view article Article Crowd-sourced Open Preference Dataset for Text-to-Image Generation By RapidataAI • 5 days ago • 17
Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models Paper • 2412.09645 • Published Dec 10, 2024 • 35
RetroLLM: Empowering Large Language Models to Retrieve Fine-grained Evidence within Generation Paper • 2412.11919 • Published 27 days ago • 33
Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation Paper • 2412.03304 • Published Dec 4, 2024 • 17
view article Article 🐺🐦⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs By wolfram • Dec 4, 2024 • 76
view article Article Use Models from the Hugging Face Hub in LM Studio By yagilb • Nov 28, 2024 • 130
view article Article To what extent are we responsible for our content and how to create safer Spaces? By davidberenstein1957 • Aug 30, 2024 • 3
view article Article Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK By davidberenstein1957 • Nov 21, 2024 • 35
view article Article How to optimize your data labelling project with custom interfaces By burtenshaw • Oct 16, 2024 • 18
view article Article How to build a custom text classifier without days of human labeling By sdiazlor • Oct 17, 2024 • 55
view article Article Model2Vec: Distill a Small Fast Model from any Sentence Transformer By Pringled • Oct 14, 2024 • 61
Direct Preference Optimization: Your Language Model is Secretly a Reward Model Paper • 2305.18290 • Published May 29, 2023 • 50
view article Article Selective fine-tuning of Language Models with Spectrum By anakin87 • Sep 3, 2024 • 30
Minitron Collection A family of compressed models obtained via pruning and knowledge distillation • 12 items • Updated 1 day ago • 60
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone Paper • 2404.14219 • Published Apr 22, 2024 • 254
xGen-MM (BLIP-3): A Family of Open Large Multimodal Models Paper • 2408.08872 • Published Aug 16, 2024 • 98
view article Article Welcome FalconMamba: The first strong attention-free 7B model Aug 12, 2024 • 108