11 44 64

Agustín Piqueres Lajarín

plaguss

plaguss

AI & ML interests

None yet

Recent Activity

updated a dataset 4 days ago

plaguss/SYNTHETIC-1-SFT-Data-Code_decont

published a dataset 4 days ago

plaguss/SYNTHETIC-1-SFT-Data-Code_decont

updated a dataset 4 days ago

open-r1/SYNTHETIC-1-SFT-Data-Code_decontaminated

View all activity

Organizations

plaguss's activity

upvoted an article 18 days ago

Article

Open R1: Update #2

and 6 others •

18 days ago

• 191

upvoted a paper 22 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 24 days ago • 195

upvoted an article 25 days ago

Article

FuseO1-Preview: System-II Reasoning Fusion of LLMs

and 4 others •

Jan 20

• 17

upvoted an article 26 days ago

Article

Open-R1: Update #1

and 7 others •

27 days ago

• 289

upvoted an article about 1 month ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 782

upvoted a paper about 1 month ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 92

upvoted an article about 2 months ago

Article

Python Is All You Need? Introducing Dria-Agent-α

and 1 other •

Jan 10

• 24

upvoted a collection about 2 months ago

Scaling Test-Time Compute with Open Models

Collection

Models and datasets used in our blog post: https://huggingface.co./spaces/HuggingFaceH4/blogpost-scaling-test-time-compute • 10 items • Updated Jan 6 • 23

upvoted an article about 2 months ago

Article

Process Reinforcement through Implicit Rewards

and 1 other •

Jan 3

• 24

upvoted 3 papers 3 months ago

upvoted a collection 3 months ago

SmolVLM

Collection

State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct • 5 items • Updated 8 days ago • 34

upvoted 2 articles 3 months ago

Article

Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK

and 1 other •

Nov 21, 2024

• 35

Article

Halo: Open Source Health Tracking with Wearables

•

Nov 19, 2024

• 107

upvoted a paper 4 months ago

Aligning Large Language Models via Self-Steering Optimization

Paper • 2410.17131 • Published Oct 22, 2024 • 23

upvoted 3 articles 4 months ago

Article

Releasing the largest multilingual open pretraining dataset

and 2 others •

Nov 13, 2024

• 99

Article

Releasing Outlines-core 0.1.0: structured generation in Rust and Python

Oct 22, 2024

• 44

Article

How to build a custom text classifier without days of human labeling

and 4 others •

Oct 17, 2024

• 55

upvoted an article 5 months ago

Article

How to optimize your data labelling project with custom interfaces

and 9 others •

Oct 16, 2024

• 18