Back to all models

Unable to determine this model’s pipeline type. Check the docs .

Monthly model downloads

lanwuwei/GigaBERT-v4-Arabic-and-English lanwuwei/GigaBERT-v4-Arabic-and-English
last 30 days



Contributed by

lanwuwei Wuwei Lan
3 models

How to use this model directly from the 🤗/transformers library:

Copy to clipboard
from transformers import AutoTokenizer, AutoModel tokenizer = AutoTokenizer.from_pretrained("lanwuwei/GigaBERT-v4-Arabic-and-English") model = AutoModel.from_pretrained("lanwuwei/GigaBERT-v4-Arabic-and-English")
Uploaded in S3


GigaBERT-v4 is a continued pre-training of GigaBERT-v3 on code-switched data, showing improved zero-shot transfer performance from English to Arabic on information extraction (IE) tasks. More details can be found in the following paper:

  author     = {Lan, Wuwei and Chen, Yang and Xu, Wei and Ritter, Alan},
    title      = {GigaBERT: Zero-shot Transfer Learning from English to Arabic},
    booktitle  = {Proceedings of The 2020 Conference on Empirical Methods on Natural Language Processing (EMNLP)},
    year       = {2020}


from transformers import *
tokenizer = BertTokenizer.from_pretrained("lanwuwei/GigaBERT-v4-Arabic-and-English", do_lower_case=True)
model = BertForTokenClassification.from_pretrained("lanwuwei/GigaBERT-v4-Arabic-and-English")

Here is downloadable link GigaBERT-v4.