riotu-lab commited on
Commit
a3f8bd9
1 Parent(s): 2d3b862

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +20 -13
README.md CHANGED
@@ -13,21 +13,28 @@ tags:
13
  ArabianGPT is a GPT-2 based model, custom-trained for the Arabic language, as part of the ArabianLLM initiatives at Prince Sultan University's Robotics and Internet of Things Lab.
14
 
15
  ## Key Features
16
- - **Architecture**: GPT-2
17
- - **Model Size**: 134 million parameters
18
- - **Layers**: 12
19
- - **Model Attention Layers (MAL)**: 12
20
- - **Context Window Size**: 768 tokens
 
 
 
21
 
22
  ## Training
23
- - **Dataset**: Abu Elkhiar Corpus
24
- - **Data Size**: 15.5 GB
25
- - **Words**: 237.8 million
26
- - **Tokens**: Over 1.75 billion
27
- - **Hardware**: NDIVIA A100
28
- - **Training Scale**: 7.5 million examples
29
- - **Training Duration**: 3 days
30
- - **Performance**: Final loss of 3.97
 
 
 
 
31
 
32
  ## Role in ArabianLLM Initiatives
33
  ArabianGPT 0.1B is crucial for advancing Arabic language processing, addressing challenges unique to Arabic morphology and dialects.
 
13
  ArabianGPT is a GPT-2 based model, custom-trained for the Arabic language, as part of the ArabianLLM initiatives at Prince Sultan University's Robotics and Internet of Things Lab.
14
 
15
  ## Key Features
16
+
17
+ | Feature | Description |
18
+ |---------------------------|----------------------------|
19
+ | **Architecture** | GPT-2 |
20
+ | **Model Size** | 134 million parameters |
21
+ | **Layers** | 12 |
22
+ | **Model Attention Layers**| 12 (MAL) |
23
+ | **Context Window Size** | 768 tokens |
24
 
25
  ## Training
26
+
27
+ | Aspect | Details |
28
+ |---------------------|-------------------------------|
29
+ | **Dataset** | Abu Elkhiar Corpus |
30
+ | **Data Size** | 15.5 GB |
31
+ | **Words** | 237.8 million |
32
+ | **Tokens** | Over 1.75 billion |
33
+ | **Hardware** | NDIVIA A100 |
34
+ | **Training Scale** | 7.5 million examples |
35
+ | **Training Duration**| 3 days |
36
+ | **Performance** | Final loss of 3.97 |
37
+
38
 
39
  ## Role in ArabianLLM Initiatives
40
  ArabianGPT 0.1B is crucial for advancing Arabic language processing, addressing challenges unique to Arabic morphology and dialects.