riotu-lab commited on
Commit
8a9d3f9
1 Parent(s): a802db3

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -22,7 +22,7 @@ This model represents a significant stride in LLM research, specifically address
22
  - **Context Window Size**: 1024 tokens
23
 
24
  ## Training
25
- - **Dataset**: C4, Twitter, Wiki
26
  - **Data Size**: 23 GB
27
  - **Tokenizer**: Aranizer 64K
28
  - **Tokens**: Over 3.3 billion
 
22
  - **Context Window Size**: 1024 tokens
23
 
24
  ## Training
25
+ - **Dataset**: Scraped texts contains scientific articles, and general texts
26
  - **Data Size**: 23 GB
27
  - **Tokenizer**: Aranizer 64K
28
  - **Tokens**: Over 3.3 billion