Nguyễn Minh Phúc
DatPySci
·
AI & ML interests
Reinforcement learning, NLP
Recent Activity
updated
a dataset
3 days ago
DatPySci/Llama-3.1-8B-rm-anthropic-hh
published
a dataset
3 days ago
DatPySci/Llama-3.1-8B-rm-anthropic-hh
updated
a dataset
3 days ago
DatPySci/Llama-3.1-8B-rm-tldr-pref
Organizations
Collections
1
models
85
![](https://cdn-avatars.huggingface.co/v1/production/uploads/637262ce2f98dcc049b41031/ZC6RgI8qwp7U0lxr7pPlT.jpeg)
DatPySci/EleutherAI_pythia-1b-deduped__dpo_shift_beta_0.01_steps_72000__tldr
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/637262ce2f98dcc049b41031/ZC6RgI8qwp7U0lxr7pPlT.jpeg)
DatPySci/EleutherAI_pythia-1b-deduped__dpo_shift_beta_0.1_steps_32400__tldr
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/637262ce2f98dcc049b41031/ZC6RgI8qwp7U0lxr7pPlT.jpeg)
DatPySci/EleutherAI_pythia-1b-deduped__dpo_shift_beta_0.05_steps_32400__tldr
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/637262ce2f98dcc049b41031/ZC6RgI8qwp7U0lxr7pPlT.jpeg)
DatPySci/EleutherAI_pythia-1b-deduped__dpo_shift_beta_0.01_steps_32400__tldr
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/637262ce2f98dcc049b41031/ZC6RgI8qwp7U0lxr7pPlT.jpeg)
DatPySci/llama3-1b_reward_tldr
Text Classification
•
Updated
•
110
![](https://cdn-avatars.huggingface.co/v1/production/uploads/637262ce2f98dcc049b41031/ZC6RgI8qwp7U0lxr7pPlT.jpeg)
DatPySci/EleutherAI_pythia-2.8b-deduped__dpo_pythia-2.8b_beta-0.01__tldr
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/637262ce2f98dcc049b41031/ZC6RgI8qwp7U0lxr7pPlT.jpeg)
DatPySci/EleutherAI_pythia-410m-deduped__ipo_ipo_pythia-1b_beta-0.03__tldr
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/637262ce2f98dcc049b41031/ZC6RgI8qwp7U0lxr7pPlT.jpeg)
DatPySci/EleutherAI_pythia-410m-deduped__length_IS_ipo_pythia-1b_beta-0.03__tldr
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/637262ce2f98dcc049b41031/ZC6RgI8qwp7U0lxr7pPlT.jpeg)
DatPySci/EleutherAI_pythia-410m-deduped__ipo_ipo_pythia-1b_beta-0.02__tldr
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/637262ce2f98dcc049b41031/ZC6RgI8qwp7U0lxr7pPlT.jpeg)
DatPySci/EleutherAI_pythia-410m-deduped__length_IS_ipo_pythia-1b_beta-0.02__tldr
Updated
datasets
57
DatPySci/Llama-3.1-8B-rm-anthropic-hh
Viewer
•
Updated
•
140k
•
16
DatPySci/Llama-3.1-8B-rm-tldr-pref
Viewer
•
Updated
•
177k
•
30
DatPySci/tldr_pythia-6.9b_pref
Viewer
•
Updated
•
94.9k
•
68
DatPySci/tldr_synthetic_llama3_3b_32
Viewer
•
Updated
•
5.47k
•
64
DatPySci/llama3_3b_sft_tldr_synthetic
Viewer
•
Updated
•
5.47k
•
117
DatPySci/weak_gpt2_large_dpo_hh
Viewer
•
Updated
•
8k
•
46
DatPySci/weak_gpt2_medium_dpo_hh
Viewer
•
Updated
•
8k
•
57
DatPySci/weak_gpt2_dpo_hh
Viewer
•
Updated
•
8k
•
59
DatPySci/Llama-3.2-3B_refine_gpt2-large_tldr
Viewer
•
Updated
•
8k
•
82
DatPySci/Llama-3.2-3B_refine_gpt2-medium_tldr
Viewer
•
Updated
•
8k
•
80