gss1147 commited on
Commit
a15c275
·
verified ·
1 Parent(s): 7504163

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +35 -0
README.md ADDED
@@ -0,0 +1,35 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ datasets:
3
+ - bigcode/the-stack-v2
4
+ - yulan-team/YuLan-Mini-Datasets
5
+ - HuggingFaceFW/fineweb-edu
6
+ - bigcode/the-stack-v2
7
+ - mlfoundations/dclm-baseline-1.0
8
+ - math-ai/AutoMathText
9
+ - gair-prox/open-web-math-pro
10
+ - RUC-AIBOX/long_form_thought_data_5k
11
+ - internlm/Lean-Workbook
12
+ - internlm/Lean-Github
13
+ - deepseek-ai/DeepSeek-Prover-V1
14
+ - ScalableMath/Lean-STaR-base
15
+ - ScalableMath/Lean-STaR-plus
16
+ - ScalableMath/Lean-CoT-base
17
+ - ScalableMath/Lean-CoT-plus
18
+ - opencsg/chinese-fineweb-edu
19
+ - liwu/MNBVC
20
+ - vikp/textbook_quality_programming
21
+ - HuggingFaceTB/smollm-corpus
22
+ - OpenCoder-LLM/opc-annealing-corpus
23
+ - OpenCoder-LLM/opc-sft-stage1
24
+ - OpenCoder-LLM/opc-sft-stage2
25
+ - XinyaoHu/AMPS_mathematica
26
+ - deepmind/math_dataset
27
+ - mrfakename/basic-math-10m
28
+ - microsoft/orca-math-word-problems-200k
29
+ - AI-MO/NuminaMath-CoT
30
+ - HuggingFaceTB/cosmopedia
31
+ - MU-NLPC/Calc-ape210k
32
+ - manu/project_gutenberg
33
+ - storytracer/LoC-PD-Books
34
+ - allenai/dolma
35
+ ---