Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
Aurelien Lucchi
alucchi
Follow
0 followers
·
1 following
AI & ML interests
None yet
Recent Activity
new
activity
11 days ago
open-r1/README:
[Experiment] Applying GRPO to DeepSeek-R1-Distill-Qwen-1.5B with LIMO
updated
a dataset
18 days ago
alucchi/Llama-3.2-1B-Instruct-best_of_n-prm-DPOpairs
published
a dataset
18 days ago
alucchi/Llama-3.2-1B-Instruct-best_of_n-prm-DPOpairs
View all activity
Organizations
None yet
alucchi
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
open-r1/README
11 days ago
[Experiment] Applying GRPO to DeepSeek-R1-Distill-Qwen-1.5B with LIMO
16
#15 opened 11 days ago by
lewtun
updated
a dataset
18 days ago
alucchi/Llama-3.2-1B-Instruct-best_of_n-prm-DPOpairs
Viewer
•
Updated
18 days ago
•
114
•
89
published
a dataset
18 days ago
alucchi/Llama-3.2-1B-Instruct-best_of_n-prm-DPOpairs
Viewer
•
Updated
18 days ago
•
114
•
89
updated
a dataset
18 days ago
alucchi/Llama-3.2-1B-Instruct-best_of_n-prm-completions
Viewer
•
Updated
18 days ago
•
10
•
101
published
a dataset
18 days ago
alucchi/Llama-3.2-1B-Instruct-best_of_n-prm-completions
Viewer
•
Updated
18 days ago
•
10
•
101
Load more