riotu-lab
/

ArabianGPT-01B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

riotu-lab commited on Dec 19, 2023

Commit

ba4a718

•

1 Parent(s): a3f8bd9

Update README.md

Files changed (1) hide show

README.md +19 -22

README.md CHANGED Viewed

@@ -10,37 +10,34 @@ tags:
 # ArabianGPT Model Overview
 ## Introduction
-ArabianGPT is a GPT-2 based model, custom-trained for the Arabic language, as part of the ArabianLLM initiatives at Prince Sultan University's Robotics and Internet of Things Lab.
 ## Key Features
-| Feature                   | Description                |
-|---------------------------|----------------------------|
-| **Architecture**          | GPT-2                      |
-| **Model Size**            | 134 million parameters     |
-| **Layers**                | 12                         |
-| **Model Attention Layers**| 12 (MAL)                   |
-| **Context Window Size**   | 768 tokens                 |
 ## Training
-| Aspect              | Details                       |
-|---------------------|-------------------------------|
-| **Dataset**         | Abu Elkhiar Corpus            |
-| **Data Size**       | 15.5 GB                       |
-| **Words**           | 237.8 million                 |
-| **Tokens**          | Over 1.75 billion             |
-| **Hardware**        | NDIVIA A100                   |
-| **Training Scale**  | 7.5 million examples          |
-| **Training Duration**| 3 days                       |
-| **Performance**     | Final loss of 3.97            |
 ## Role in ArabianLLM Initiatives
-ArabianGPT 0.1B is crucial for advancing Arabic language processing, addressing challenges unique to Arabic morphology and dialects.
 ## Usage
-Suitable for Arabic text generation tasks. Example usage with Transformers SummarizationPipeline:
 ```python
 from transformers import pipeline

 # ArabianGPT Model Overview
 ## Introduction
+ArabianGPT-0.1B, developed under the ArabianLLM initiatives, is a specialized GPT-2 model optimized for Arabic language modeling.
+It's a product of the collaborative efforts at Prince Sultan University's Robotics and Internet of Things Lab, focusing on enhancing natural language modeling and generation in Arabic.
+This model represents a significant stride in LLM research, specifically addressing the linguistic complexities and nuances of the Arabic language.
 ## Key Features
+- **Architecture**: GPT-2
+- **Model Size**: 134 million parameters
+- **Layers**: 12
+- **Model Attention Layers (MAL)**: 12
+- **Context Window Size**: 768 tokens
 ## Training
+- **Dataset**: Abu Elkhiar Corpus
+- **Data Size**: 15.5 GB
+- **Words**: 237.8 million
+- **Tokenizer**: Aranizer 64K
+- **Tokens**: Over 1.75 billion
+- **Hardware**: 2 NDIVIA A100 GPUs
+- **Training Scale**: 7.5 million examples
+- **Training Duration**: 3 days
+- **Performance**: Final loss of 3.97
 ## Role in ArabianLLM Initiatives
+ArabianGPT-0.1B (Base Model) is crucial for advancing Arabic language processing, addressing challenges unique to Arabic morphology and dialects.
 ## Usage
+Suitable for Arabic text generation tasks. Example usage with Transformers Pipeline:
 ```python
 from transformers import pipeline