メタデータラボ様からの計算資源のご提供により構築したモデルおよびデータセットhttps://prtimes.jp/main/html/rd/p/000000008.000056944.html
kaeru39 PRO
ryota39
AI & ML interests
LLM × RL
Recent Activity
updated
a dataset
3 days ago
preference-team/dataset-for-annotation-v2-annotated
updated
a dataset
3 days ago
preference-team/progress
liked
a model
3 days ago
HuggingFaceTB/SmolLM2-1.7B
Organizations
Collections
7
spaces
2
models
17

ryota39/gemma-2-2b-jpn-it-q8
Updated
•
25

ryota39/Tora-12B
Text Generation
•
Updated
•
13
•
1

ryota39/Tora-7B-v0.1
Text Generation
•
Updated
•
35
•
2

ryota39/mluke-large-lite-reward
Text Classification
•
Updated
•
90

ryota39/retriva-bert-preference-classifier
Text Classification
•
Updated
•
140

ryota39/Tora-7B-v0.2
Text Generation
•
Updated
•
12
•
1

ryota39/llm-jp-1b-sft-100k-LoRA-dpo-12k
Text Generation
•
Updated
•
95

ryota39/Phi-3-mini-4k-instruct-dpo
Text Generation
•
Updated
•
179
•
3

ryota39/llm-jp-1b-sft-15k
Text Generation
•
Updated
•
111

ryota39/llm-jp-1b-sft-100k-LoRA
Text Generation
•
Updated
•
21
datasets
28
ryota39/wild_chat_ja
Viewer
•
Updated
•
3.49k
•
58
ryota39/aya-evol-instruct
Viewer
•
Updated
•
29.2k
•
74
ryota39/JCommonsenseMorality
Viewer
•
Updated
•
9.98k
•
125
ryota39/hh-rlhf
Viewer
•
Updated
•
169k
•
94
ryota39/preference-en-ja-100k
Viewer
•
Updated
•
101k
•
76
•
1
ryota39/preference_test
Viewer
•
Updated
•
29.6k
•
78
ryota39/preference_test_annotated
Viewer
•
Updated
•
5
•
70
ryota39/open_preference_v0.4
Viewer
•
Updated
•
202k
•
94
•
1
ryota39/webgpt_comparisons-ja
Viewer
•
Updated
•
17.4k
•
90
•
1
ryota39/synthetic-instruct-gptj-pairwise-ja
Viewer
•
Updated
•
33.1k
•
76
•
1