julien-c HF staff commited on
Commit
182fbe1
1 Parent(s): b204cc7

Migrate model card from transformers-repo

Browse files

Read announcement at https://discuss.huggingface.co/t/announcement-all-model-cards-will-be-migrated-to-hf-co-model-repos/2755
Original file history: https://github.com/huggingface/transformers/commits/master/model_cards/lanwuwei/GigaBERT-v3-Arabic-and-English/README.md

Files changed (1) hide show
  1. README.md +12 -3
README.md CHANGED
@@ -1,3 +1,13 @@
 
 
 
 
 
 
 
 
 
 
1
  ## GigaBERT-v3
2
  GigaBERT-v3 is a customized bilingual BERT for English and Arabic. It was pre-trained in a large-scale corpus (Gigaword+Oscar+Wikipedia) with ~10B tokens, showing state-of-the-art zero-shot transfer performance from English to Arabic on information extraction (IE) tasks. More details can be found in the following paper:
3
 
@@ -8,11 +18,10 @@ GigaBERT-v3 is a customized bilingual BERT for English and Arabic. It was pre-tr
8
  year = {2020}
9
  }
10
 
11
- ## Download
12
  ```
13
  from transformers import *
14
  tokenizer = BertTokenizer.from_pretrained("lanwuwei/GigaBERT-v3-Arabic-and-English", do_lower_case=True)
15
  model = BertForTokenClassification.from_pretrained("lanwuwei/GigaBERT-v3-Arabic-and-English")
16
  ```
17
- Here is downloadable link [GigaBERT-v3](https://drive.google.com/drive/u/1/folders/1zgUXz8FQPHmWVNR7tHyPq1E6SmrMuPv6).
18
-
1
+ ---
2
+ language:
3
+ - en
4
+ - ar
5
+ datasets:
6
+ - gigaword
7
+ - oscar
8
+ - wikipedia
9
+ ---
10
+
11
  ## GigaBERT-v3
12
  GigaBERT-v3 is a customized bilingual BERT for English and Arabic. It was pre-trained in a large-scale corpus (Gigaword+Oscar+Wikipedia) with ~10B tokens, showing state-of-the-art zero-shot transfer performance from English to Arabic on information extraction (IE) tasks. More details can be found in the following paper:
13
 
18
  year = {2020}
19
  }
20
 
21
+ ## Usage
22
  ```
23
  from transformers import *
24
  tokenizer = BertTokenizer.from_pretrained("lanwuwei/GigaBERT-v3-Arabic-and-English", do_lower_case=True)
25
  model = BertForTokenClassification.from_pretrained("lanwuwei/GigaBERT-v3-Arabic-and-English")
26
  ```
27
+ More code examples can be found [here](https://github.com/lanwuwei/GigaBERT).