pascalrai
/

nep-summ-BART

Text2Text Generation

nepali text summary

Inference Endpoints

Model card Files Files and versions Community

pascalrai commited on Dec 19, 2023

Commit

2a37fef

•

1 Parent(s): 024106f

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -29,7 +29,7 @@ The parameter size for the model is 101M.
 The model is trained using BART noising techniques like sentence permutation, token deletion, and random token masking.
 <br>The noisy data is fed into the encoder of the transformer and the denoising task/ objective is fulfilled by the decoder of the transformer model.
-Normal cross-entropy loss is used for both the pre-training and fine-tuning of the model.
 The Loss for pre-training is as follows:

 The model is trained using BART noising techniques like sentence permutation, token deletion, and random token masking.
 <br>The noisy data is fed into the encoder of the transformer and the denoising task/ objective is fulfilled by the decoder of the transformer model.
+Cross-entropy loss is used for both the pre-training and fine-tuning of the model.
 The Loss for pre-training is as follows: