jaigouk
's Collections
datasets
updated
argilla/distilabel-intel-orca-dpo-pairs
Viewer
•
Updated
•
12.9k
•
495
•
170
Viewer
•
Updated
•
66.4k
•
164
•
204
argilla/ultrafeedback-binarized-preferences-cleaned
Viewer
•
Updated
•
60.9k
•
4.86k
•
130
Viewer
•
Updated
•
15.3k
•
67
•
18
theblackcat102/evol-codealpaca-v1
Viewer
•
Updated
•
111k
•
814
•
156
Viewer
•
Updated
•
395k
•
6.26k
•
350
glaiveai/glaive-code-assistant-v2
Viewer
•
Updated
•
215k
•
64
•
44
Viewer
•
Updated
•
12.9k
•
1.24k
•
293
Viewer
•
Updated
•
183k
•
484
•
285
garage-bAInd/Open-Platypus
Viewer
•
Updated
•
24.9k
•
3.34k
•
376
LLM360/CrystalCoderDatasets
Updated
•
1.92k
•
20
protectai/deberta-v3-base-prompt-injection
Text Classification
•
Updated
•
19.2k
•
73
nampdn-ai/tiny-orca-textbooks
Viewer
•
Updated
•
147k
•
51
•
38
code-search-net/code_search_net
Updated
•
3.67k
•
279
WhiteRabbitNeo/WRN-Chapter-1
Viewer
•
Updated
•
7.75k
•
62
•
47
WhiteRabbitNeo/WRN-Chapter-2
Viewer
•
Updated
•
11.1k
•
42
•
19
llm-blender/PairRM
Text Generation
•
Updated
•
6.75k
•
196
Viewer
•
Updated
•
31.1M
•
13.5k
•
574
Viewer
•
Updated
•
3.54k
•
128
•
55
NousResearch/json-mode-eval
Viewer
•
Updated
•
100
•
650
•
33
Viewer
•
Updated
•
2.75M
•
8.21k
•
340
Viewer
•
Updated
•
518k
•
34
•
1
laurentiubp/openhermes-scored
Viewer
•
Updated
•
185k
•
36
•
1
Towards Best Practices for Open Datasets for LLM Training
Paper
•
2501.08365
•
Published
•
47