riotu-lab commited on
Commit
45d6d3c
1 Parent(s): f224ad6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -22,7 +22,7 @@ This model represents a significant stride in LLM research, specifically address
22
  - **Context Window Size**: 768 tokens
23
 
24
  ## Training
25
- - **Dataset**: Abu Elkhiar Corpus
26
  - **Data Size**: 15.5 GB
27
  - **Words**: 237.8 million
28
  - **Tokenizer**: Aranizer 64K
 
22
  - **Context Window Size**: 768 tokens
23
 
24
  ## Training
25
+ - **Dataset**: Scraped Arabic newspaper articles
26
  - **Data Size**: 15.5 GB
27
  - **Words**: 237.8 million
28
  - **Tokenizer**: Aranizer 64K