Feature Extraction
Transformers
Safetensors
English
bamboo
custom_code
yixinsong commited on
Commit
f7a2a30
1 Parent(s): d3735d4

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -44,7 +44,7 @@ The following table shows the hyper-paramters we used in our training process.
44
  | Batch Size | 4M |
45
  | Weight Decay | 0.1 |
46
 
47
- **Second phase**: We further adjusted the training corpus ratio, incorporating more domain-specific datasets(Math, Coding), and continued training for 50B tokens.
48
 
49
  | Hyper-parameters | |
50
  | --------------------- | ----------- |
 
44
  | Batch Size | 4M |
45
  | Weight Decay | 0.1 |
46
 
47
+ **Second phase**: We further adjusted the training corpus ratio, incorporating more domain-specific datasets (e.g., Math, Coding), and continued training for 50B tokens.
48
 
49
  | Hyper-parameters | |
50
  | --------------------- | ----------- |