IvanHU commited on
Commit
b721e32
·
verified ·
1 Parent(s): b9cb38c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +35 -1
README.md CHANGED
@@ -1,4 +1,38 @@
1
  ---
2
  license: mit
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
3
  ---
4
- Coming Soon...
 
1
  ---
2
  license: mit
3
+ datasets:
4
+ - HuggingFaceFW/fineweb-edu
5
+ - bigcode/the-stack-v2
6
+ - mlfoundations/dclm-baseline-1.0
7
+ - math-ai/AutoMathText
8
+ - gair-prox/open-web-math-pro
9
+ - RUC-AIBOX/long_form_thought_data_5k
10
+ - internlm/Lean-Workbook
11
+ - internlm/Lean-Github
12
+ - deepseek-ai/DeepSeek-Prover-V1
13
+ - ScalableMath/Lean-STaR-base
14
+ - ScalableMath/Lean-STaR-plus
15
+ - ScalableMath/Lean-CoT-base
16
+ - ScalableMath/Lean-CoT-plus
17
+ - opencsg/chinese-fineweb-edu
18
+ - liwu/MNBVC
19
+ - vikp/textbook_quality_programming
20
+ - HuggingFaceTB/smollm-corpus
21
+ - OpenCoder-LLM/opc-annealing-corpus
22
+ - OpenCoder-LLM/opc-sft-stage1
23
+ - OpenCoder-LLM/opc-sft-stage2
24
+ - XinyaoHu/AMPS_mathematica
25
+ - deepmind/math_dataset
26
+ - mrfakename/basic-math-10m
27
+ - microsoft/orca-math-word-problems-200k
28
+ - AI-MO/NuminaMath-CoT
29
+ - HuggingFaceTB/cosmopedia
30
+ - MU-NLPC/Calc-ape210k
31
+ - manu/project_gutenberg
32
+ - storytracer/LoC-PD-Books
33
+ - allenai/dolma
34
+ language:
35
+ - en
36
+ - zh
37
  ---
38
+ Coming Soon...