microsoft
/

deberta-v3-large

Inference Endpoints

Model card Files Files and versions Community

DeBERTa commited on Oct 20, 2021

Commit

c29fd30

·

1 Parent(s): 5dc4f57

Update README.md

Files changed (1) hide show

README.md +35 -0

README.md CHANGED Viewed

@@ -29,6 +29,41 @@ We present the dev results on SQuAD 1.1/2.0 and MNLI tasks.
 | **DeBERTa-v3-large**  | -/-   | 91.5/89.0 | **91.9**   |
 | DeBERTa-v2-xxlarge|96.1/91.4	|**92.2/89.7**	|  91.7  |
 ### Citation
 If you find DeBERTa useful for your work, please cite the following paper:

 | **DeBERTa-v3-large**  | -/-   | 91.5/89.0 | **91.9**   |
 | DeBERTa-v2-xxlarge|96.1/91.4	|**92.2/89.7**	|  91.7  |
+#### Fine-tuning with HF transformers
+```bash
+#!/bin/bash
+cd transformers/examples/pytorch/text-classification/
+pip install datasets
+export TASK_NAME=mnli
+output_dir="ds_results"
+num_gpus=8
+batch_size=8
+python -m torch.distributed.launch --nproc_per_node=${num_gpus} \
+  run_glue.py \
+  --model_name_or_path microsoft/deberta-v3-large \
+  --task_name $TASK_NAME \
+  --do_train \
+  --do_eval \
+  --evaluation_strategy steps \
+  --max_seq_length 256 \
+  --warmup_steps 1000 \
+  --per_device_train_batch_size ${batch_size} \
+  --learning_rate 6e-6 \
+  --num_train_epochs 2 \
+  --output_dir $output_dir \
+  --overwrite_output_dir \
+  --logging_steps 1000 \
+  --logging_dir $output_dir
+```
 ### Citation
 If you find DeBERTa useful for your work, please cite the following paper: