Nguyễn Minh Phúc's picture

4

Nguyễn Minh Phúc

DatPySci

·

AI & ML interests

Reinforcement learning, NLP

Recent Activity

updated a dataset 3 days ago

DatPySci/Llama-3.1-8B-rm-anthropic-hh

published a dataset 3 days ago

DatPySci/Llama-3.1-8B-rm-anthropic-hh

updated a dataset 3 days ago

DatPySci/Llama-3.1-8B-rm-tldr-pref

View all activity

Organizations

Collections 1

models 85

DatPySci/EleutherAI_pythia-1b-deduped__dpo_shift_beta_0.01_steps_72000__tldr

Updated Nov 18, 2024

DatPySci/EleutherAI_pythia-1b-deduped__dpo_shift_beta_0.1_steps_32400__tldr

Updated Nov 18, 2024

DatPySci/EleutherAI_pythia-1b-deduped__dpo_shift_beta_0.05_steps_32400__tldr

Updated Nov 18, 2024

DatPySci/EleutherAI_pythia-1b-deduped__dpo_shift_beta_0.01_steps_32400__tldr

Updated Nov 18, 2024

DatPySci/llama3-1b_reward_tldr

Text Classification • Updated Nov 11, 2024 • 110

DatPySci/EleutherAI_pythia-2.8b-deduped__dpo_pythia-2.8b_beta-0.01__tldr

Updated Sep 30, 2024

DatPySci/EleutherAI_pythia-410m-deduped__ipo_ipo_pythia-1b_beta-0.03__tldr

Updated Sep 28, 2024

DatPySci/EleutherAI_pythia-410m-deduped__length_IS_ipo_pythia-1b_beta-0.03__tldr

Updated Sep 28, 2024

DatPySci/EleutherAI_pythia-410m-deduped__ipo_ipo_pythia-1b_beta-0.02__tldr

Updated Sep 28, 2024

DatPySci/EleutherAI_pythia-410m-deduped__length_IS_ipo_pythia-1b_beta-0.02__tldr

Updated Sep 28, 2024

datasets 57

DatPySci/Llama-3.1-8B-rm-anthropic-hh

Viewer • Updated 3 days ago • 140k • 16

DatPySci/Llama-3.1-8B-rm-tldr-pref

Viewer • Updated 3 days ago • 177k • 30

DatPySci/tldr_pythia-6.9b_pref

Viewer • Updated 7 days ago • 94.9k • 68

DatPySci/tldr_synthetic_llama3_3b_32

Viewer • Updated 21 days ago • 5.47k • 64

DatPySci/llama3_3b_sft_tldr_synthetic

Viewer • Updated 25 days ago • 5.47k • 117

DatPySci/weak_gpt2_large_dpo_hh

Viewer • Updated Jan 9 • 8k • 46

DatPySci/weak_gpt2_medium_dpo_hh

Viewer • Updated Jan 9 • 8k • 57

DatPySci/weak_gpt2_dpo_hh

Viewer • Updated Jan 9 • 8k • 59

DatPySci/Llama-3.2-3B_refine_gpt2-large_tldr

Viewer • Updated Jan 8 • 8k • 82

DatPySci/Llama-3.2-3B_refine_gpt2-medium_tldr

Viewer • Updated Jan 8 • 8k • 80