58 32 12

ben burtenshaw

burtenshaw

AI & ML interests

None yet

Recent Activity

reacted to albertvillanova's post with 🔥 5 days ago

Discover all the improvements in the new version of Lighteval: https://huggingface.co./docs/lighteval/

reacted to m-ric's post with 🤗 5 days ago

Since I published it on GitHub a few days ago, Hugging Face's new agentic library 𝘀𝗺𝗼𝗹𝗮𝗴𝗲𝗻𝘁𝘀 has gathered nearly 4k stars 🤯 ➡️ But we are just getting started on agents: so we are hiring an ML Engineer to join me and double down on this effort! The plan is to build GUI agents: agents that can act on your computer with mouse & keyboard, like Claude Computer Use. We will make it work better, and fully open. ✨ Sounds like something you'd like to do? Apply here 👉 https://apply.workable.com/huggingface/j/AF1D4E3FEB/

reacted to m-ric's post with 🚀 5 days ago

View all activity

Articles

Argilla 2.4: Easily Build Fine-Tuning and Evaluation datasets on the Hub — No Code Required

Nov 4, 2024

• 41

How to build a custom text classifier without days of human labeling

Oct 17, 2024

• 55

How to optimize your data labelling project with custom interfaces

Oct 16, 2024

• 18

⚗️ 🔥 Building High-Quality Datasets with distilabel and Prometheus 2

Jun 3, 2024

• 26

⚗️ 🧑🏼‍🌾 Let's grow some Domain Specific Datasets together

Apr 29, 2024

• 29

Organizations

burtenshaw's activity

upvoted an article 5 days ago

Article

Crowd-sourced Open Preference Dataset for Text-to-Image Generation

•

5 days ago

• 17

upvoted 2 papers 26 days ago

Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models

Paper • 2412.09645 • Published Dec 10, 2024 • 35

RetroLLM: Empowering Large Language Models to Retrieve Fine-grained Evidence within Generation

Paper • 2412.11919 • Published 27 days ago • 33

upvoted a paper about 1 month ago

Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation

Paper • 2412.03304 • Published Dec 4, 2024 • 17

upvoted 3 articles about 1 month ago

Article

🐺🐦‍⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs

•

Dec 4, 2024

• 76

Article

Use Models from the Hugging Face Hub in LM Studio

•

Nov 28, 2024

• 130

Article

To what extent are we responsible for our content and how to create safer Spaces?

•

Aug 30, 2024

• 3

upvoted an article about 2 months ago

Article

Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK

•

Nov 21, 2024

• 35

upvoted 4 articles 3 months ago

Article

How to optimize your data labelling project with custom interfaces

•

Oct 16, 2024

• 18

Article

How to build a custom text classifier without days of human labeling

•

Oct 17, 2024

• 55

Article

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

•

Oct 14, 2024

• 61

Article

Recoloring photos with diffusers

•

Oct 9, 2024

• 28

upvoted 2 papers 4 months ago

Making Text Embedders Few-Shot Learners

Paper • 2409.15700 • Published Sep 24, 2024 • 30

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Paper • 2305.18290 • Published May 29, 2023 • 50

upvoted an article 4 months ago

Article

Selective fine-tuning of Language Models with Spectrum

•

Sep 3, 2024

• 30

upvoted a paper 4 months ago

The Future of Open Human Feedback

Paper • 2408.16961 • Published Aug 15, 2024 • 21

upvoted a collection 5 months ago

Minitron

Collection

A family of compressed models obtained via pruning and knowledge distillation • 12 items • Updated 1 day ago • 60

upvoted 2 papers 5 months ago

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22, 2024 • 254

xGen-MM (BLIP-3): A Family of Open Large Multimodal Models

Paper • 2408.08872 • Published Aug 16, 2024 • 98

upvoted an article 5 months ago

Article

Welcome FalconMamba: The first strong attention-free 7B model

Aug 12, 2024

• 108

ben burtenshaw

AI & ML interests

Recent Activity

Articles

Introducing the Synthetic Data Generator - Build Datasets with Natural Language

Open Preference Dataset for Text-to-Image Generation by the 🤗 Community

Let’s make a generation of amazing image generation models

Zero to Hero with the TRL learning link bomb 💣

Low Code Large Language Model Alignment

Argilla 2.4: Easily Build Fine-Tuning and Evaluation datasets on the Hub — No Code Required

How to build a custom text classifier without days of human labeling

How to optimize your data labelling project with custom interfaces

⚗️ 🔥 Building High-Quality Datasets with distilabel and Prometheus 2

⚗️ 🧑🏼‍🌾 Let's grow some Domain Specific Datasets together

Organizations

burtenshaw's activity

Crowd-sourced Open Preference Dataset for Text-to-Image Generation

🐺🐦‍⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs

Use Models from the Hugging Face Hub in LM Studio

To what extent are we responsible for our content and how to create safer Spaces?

Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK

How to optimize your data labelling project with custom interfaces

How to build a custom text classifier without days of human labeling

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

Recoloring photos with diffusers

Selective fine-tuning of Language Models with Spectrum

Welcome FalconMamba: The first strong attention-free 7B model