akhooli commited on
Commit
4673e52
1 Parent(s): e617f12

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +43 -0
README.md ADDED
@@ -0,0 +1,43 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language: "ar"
3
+ tags:
4
+ - text-generation
5
+ datasets:
6
+ - APCD
7
+ ---
8
+
9
+ # GPT2-Arabic-Poetry-2023
10
+
11
+ ## Model description
12
+
13
+ Fine-tuned model of Arabic poetry dataset based on aragpt2-medium.
14
+
15
+ ## Intended uses & limitations
16
+
17
+ #### How to use
18
+
19
+ An example is provided in this [colab notebook](todo).
20
+
21
+ #### Limitations and bias
22
+
23
+ Both the GPT2-small-arabic (trained on Arabic Wikipedia) and this model have several limitations in terms of coverage and training performance.
24
+ Use them as demonstrations or proof of concepts but not as production code.
25
+
26
+ ## Training data
27
+
28
+ This pretrained model used the [dataset](todo) from several eras with a total of around 1.4m lines.
29
+ The dataset was trained (fine-tuned) based on the [aragpt2-medium](https://huggingface.co/aubmindlab/aragpt2-medium) transformer model.
30
+
31
+ ## Training procedure
32
+
33
+ Training was done using [Simple Transformers](https://github.com/ThilinaRajapakse/simpletransformers) library on Colab, using free GPU.
34
+
35
+ ## Eval results
36
+ Final perplexity reached was 49.56, train loss: 3.336
37
+
38
+ ### BibTeX entry and citation info
39
+
40
+ ```bibtex
41
+ @inproceedings{Abed Khooli,
42
+ year={2023}
43
+ }