Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
2140.8
TFLOPS
1206
60
53
Quentin Gallouédec
qgallouedec
Follow
MajaQuark's profile picture
tacigomess's profile picture
fdb75017's profile picture
139 followers
·
75 following
QGallouedec
qgallouedec
qgallouedec
qgallouedec.bsky.social
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 2 hours ago
The Llama 3 Herd of Models
updated
a dataset
3 days ago
trl-lib/documentation-images
updated
a dataset
4 days ago
qgallouedec/trl-metrics
View all activity
Organizations
Articles
4
Article
288
Open-R1: Update #1
Article
188
Visualize and understand GPU memory in PyTorch
View all Articles
Papers
4
arxiv:
2402.09844
arxiv:
2402.03046
arxiv:
2208.14928
arxiv:
2106.13687
spaces
1
Running
9
Train Memory
📈
Generate memory usage forecast for model training
models
715
Sort: Recently updated
qgallouedec/Qwen2.5-0.5B-GRPO-main
Text Generation
•
Updated
9 days ago
•
6
qgallouedec/gemma-2-2B-it-thinking-function_calling
Updated
10 days ago
qgallouedec/Qwen2.5-0.5B-GRPO-2873
Updated
11 days ago
qgallouedec/Qwen2.5-0.5B-GRPO-2776-next
Updated
16 days ago
qgallouedec/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
Updated
20 days ago
•
14
qgallouedec/Qwen2.5-32B-Open-R1-GRPO
Updated
22 days ago
•
1
qgallouedec/Qwen2.5-14B-Open-R1-GRPO
Updated
22 days ago
qgallouedec/Qwen2.5-7B-Open-R1-GRPO
Updated
22 days ago
qgallouedec/Qwen2-0.5B-GRPO
Updated
Jan 19
qgallouedec/tiny-Qwen2ForSequenceClassification-2.5
Text Classification
•
Updated
Jan 14
•
12
Expand 715 models
datasets
67
Sort: Recently updated
qgallouedec/trl-metrics
Viewer
•
Updated
4 days ago
•
86.1k
•
3.81k
•
1
qgallouedec/prm800k
Viewer
•
Updated
Dec 17, 2024
•
41.2k
•
151
•
3
qgallouedec/ultrafeedback-prompt
Viewer
•
Updated
Sep 9, 2024
•
60.9k
•
72
qgallouedec/ultrafeedback-gpt-3.5-turbo-helpfulness
Viewer
•
Updated
Sep 9, 2024
•
16.6k
•
105
qgallouedec/lm-human-preferences-descriptiveness
Viewer
•
Updated
Sep 9, 2024
•
6.26k
•
59
qgallouedec/lm-human-preferences-sentiment
Viewer
•
Updated
Sep 9, 2024
•
6.26k
•
76
qgallouedec/tldr-preference
Viewer
•
Updated
Sep 9, 2024
•
179k
•
77
qgallouedec/tldr
Viewer
•
Updated
Sep 9, 2024
•
130k
•
74
qgallouedec/hh-rlhf-helpful-base
Viewer
•
Updated
Sep 5, 2024
•
46.2k
•
61
qgallouedec/hh-rlhf-helpful-base-trl-style
Viewer
•
Updated
Sep 5, 2024
•
46.2k
•
84
Expand 67 datasets