40 4 186

kaeru39 PRO

ryota39

AI & ML interests

LLM × RL

Recent Activity

updated a dataset 3 days ago

preference-team/dataset-for-annotation-v2-annotated

updated a dataset 3 days ago

preference-team/progress

liked a model 3 days ago

HuggingFaceTB/SmolLM2-1.7B

View all activity

Organizations

Collections 7

spaces 2

Sleeping

Sake Sonar

🍶

Ask questions about sake brewing

Sleeping

ICLR2025 Sonar-v1

🪄

Search for ICLR2025 papers using keywords

models 17

datasets 28

ryota39/wild_chat_ja

Viewer • Updated Jan 23 • 3.49k • 58

ryota39/aya-evol-instruct

Viewer • Updated Jan 6 • 29.2k • 74

ryota39/JCommonsenseMorality

Viewer • Updated Nov 29, 2024 • 9.98k • 125

ryota39/hh-rlhf

Viewer • Updated Nov 26, 2024 • 169k • 94

ryota39/preference-en-ja-100k

Viewer • Updated Nov 19, 2024 • 101k • 76 • 1

ryota39/preference_test

Viewer • Updated Nov 16, 2024 • 29.6k • 78

ryota39/preference_test_annotated

Viewer • Updated Nov 10, 2024 • 5 • 70

ryota39/open_preference_v0.4

Viewer • Updated Aug 26, 2024 • 202k • 94 • 1

ryota39/webgpt_comparisons-ja

Viewer • Updated Aug 16, 2024 • 17.4k • 90 • 1

ryota39/synthetic-instruct-gptj-pairwise-ja

Viewer • Updated Aug 16, 2024 • 33.1k • 76 • 1

kaeru39 PRO

AI & ML interests

Recent Activity

Organizations

Collections 7

ryota39/Phi-3-mini-4k-instruct-dpo

ryota39/truthy-dpo-ja

ryota39/hh-rlhf-12k-ja_orpo

ryota39/dpo-ja-45k

ryota39/Tora-7B-v0.1

ryota39/Tora-7B-v0.2

ryota39/Tora-12B

spaces 2

Sake Sonar

ICLR2025 Sonar-v1

models 17

ryota39/gemma-2-2b-jpn-it-q8

ryota39/Tora-12B

ryota39/Tora-7B-v0.1

ryota39/mluke-large-lite-reward

ryota39/retriva-bert-preference-classifier

ryota39/Tora-7B-v0.2

ryota39/llm-jp-1b-sft-100k-LoRA-dpo-12k

ryota39/Phi-3-mini-4k-instruct-dpo

ryota39/llm-jp-1b-sft-15k

ryota39/llm-jp-1b-sft-100k-LoRA

datasets 28

ryota39/wild_chat_ja

ryota39/aya-evol-instruct

ryota39/JCommonsenseMorality

ryota39/hh-rlhf

ryota39/preference-en-ja-100k

ryota39/preference_test

ryota39/preference_test_annotated

ryota39/open_preference_v0.4

ryota39/webgpt_comparisons-ja

ryota39/synthetic-instruct-gptj-pairwise-ja

kaeru39 PRO

AI & ML interests

Recent Activity

Organizations

Collections 7

spaces 2 Sort: Recently updated

Sake Sonar

ICLR2025 Sonar-v1

models 17 Sort: Recently updated

datasets 28 Sort: Recently updated

spaces 2

models 17

datasets 28