Ahmed Abdelali
commited on
Commit
·
ef485b3
1
Parent(s):
6c37233
pushd updates for README and config
Browse files- README.md +4 -2
- config.json +1 -0
README.md
CHANGED
@@ -3,6 +3,8 @@ language: ar
|
|
3 |
tags:
|
4 |
- pytorch
|
5 |
- tf
|
|
|
|
|
6 |
datasets:
|
7 |
- arabic_billion_words
|
8 |
- open_subtitles
|
@@ -29,11 +31,11 @@ QARiB: Is the Arabic name for "Boat".
|
|
29 |
- Number of Layers: 12
|
30 |
|
31 |
## Training QARiB
|
32 |
-
|
33 |
|
34 |
## Using QARiB
|
35 |
|
36 |
-
You can use the raw model for either masked language modeling or next sentence prediction, but it's mostly intended to be fine-tuned on a downstream task. See the model hub to look for fine-tuned versions on a task that interests you. For more details, see [Using QARiB](
|
37 |
|
38 |
### How to use
|
39 |
You can use this model directly with a pipeline for masked language modeling:
|
|
|
3 |
tags:
|
4 |
- pytorch
|
5 |
- tf
|
6 |
+
- QARiB
|
7 |
+
- qarib
|
8 |
datasets:
|
9 |
- arabic_billion_words
|
10 |
- open_subtitles
|
|
|
31 |
- Number of Layers: 12
|
32 |
|
33 |
## Training QARiB
|
34 |
+
See details in [Training QARiB](https://github.com/qcri/QARIB/Training_QARiB.md)
|
35 |
|
36 |
## Using QARiB
|
37 |
|
38 |
+
You can use the raw model for either masked language modeling or next sentence prediction, but it's mostly intended to be fine-tuned on a downstream task. See the model hub to look for fine-tuned versions on a task that interests you. For more details, see [Using QARiB](https://github.com/qcri/QARIB/Using_QARiB.md)
|
39 |
|
40 |
### How to use
|
41 |
You can use this model directly with a pipeline for masked language modeling:
|
config.json
CHANGED
@@ -5,6 +5,7 @@
|
|
5 |
"attention_probs_dropout_prob": 0.1,
|
6 |
"directionality": "bidi",
|
7 |
"hidden_act": "gelu",
|
|
|
8 |
"hidden_dropout_prob": 0.1,
|
9 |
"hidden_size": 768,
|
10 |
"initializer_range": 0.02,
|
|
|
5 |
"attention_probs_dropout_prob": 0.1,
|
6 |
"directionality": "bidi",
|
7 |
"hidden_act": "gelu",
|
8 |
+
"model_type": "bert",
|
9 |
"hidden_dropout_prob": 0.1,
|
10 |
"hidden_size": 768,
|
11 |
"initializer_range": 0.02,
|