Aurelien Lucchi's picture

1

Aurelien Lucchi

alucchi

·

AI & ML interests

None yet

Recent Activity

new activity 11 days ago

open-r1/README:[Experiment] Applying GRPO to DeepSeek-R1-Distill-Qwen-1.5B with LIMO

updated a dataset 18 days ago

alucchi/Llama-3.2-1B-Instruct-best_of_n-prm-DPOpairs

published a dataset 18 days ago

alucchi/Llama-3.2-1B-Instruct-best_of_n-prm-DPOpairs

View all activity

Organizations

None yet

alucchi's activity

New activity in open-r1/README 11 days ago

[Experiment] Applying GRPO to DeepSeek-R1-Distill-Qwen-1.5B with LIMO

#15 opened 11 days ago by

updated a dataset 18 days ago

alucchi/Llama-3.2-1B-Instruct-best_of_n-prm-DPOpairs

Viewer • Updated 18 days ago • 114 • 89

published a dataset 18 days ago

alucchi/Llama-3.2-1B-Instruct-best_of_n-prm-DPOpairs

Viewer • Updated 18 days ago • 114 • 89

updated a dataset 18 days ago

alucchi/Llama-3.2-1B-Instruct-best_of_n-prm-completions

Viewer • Updated 18 days ago • 10 • 101

published a dataset 18 days ago

alucchi/Llama-3.2-1B-Instruct-best_of_n-prm-completions

Viewer • Updated 18 days ago • 10 • 101