Graphcore
/

deberta-base-ipu

Model card Files Files and versions Community

Dongsung commited on May 19, 2022

Commit

d1d36a8

•

1 Parent(s): 415fb61

Update model descripton

Files changed (1) hide show

README.md +5 -0

README.md CHANGED Viewed

@@ -6,6 +6,11 @@ This model contains just the `IPUConfig` files for running the DeBERTa-base mode
 **This model contains no model weights, only an IPUConfig.**
 ## Usage
 ```

 **This model contains no model weights, only an IPUConfig.**
+## Model description
+DeBERTa([Decoding-enhanced BERT with Disentangled Attention ](https://arxiv.org/abs/2006.03654 )) improves the BERT and RoBERTa models using the disentangled attention mechanism and an enhanced mask decoder which is used to replace the output softmax layer to predict the masked tokens for model pretraining.
+Through two techniques, it could significantly improve the efficiency of model pre-training and performance of downstream tasks.
 ## Usage
 ```