minhtriphan
/

LongFinBERT-base

Inference Endpoints

Model card Files Files and versions Community

minhtriphan commited on Aug 30, 2023

Commit

49e32df

•

1 Parent(s): 0ec865c

Update README.md

Files changed (1) hide show

README.md +6 -4

README.md CHANGED Viewed

@@ -19,12 +19,14 @@ We compare the time and space efficiency of this model and some competitors. For
 The experiments are implemented with an NVIDIA A100-SXM4-40GB. Batch size of 1. The figures show the time and memory needed to run one batch. In the training mode, forward pass and backpropagation is included. In the inferring model, only forward pass is included.
 ## Training mode
-![image/png](https://cdn-uploads.huggingface.co/production/uploads/61d2d2993c2083e1c08af221/clg3lSItrQuXL5YYh7dmm.png)
-![image/png](https://cdn-uploads.huggingface.co/production/uploads/61d2d2993c2083e1c08af221/zCwoR6oimLFEO0llErb0g.png)
 # Inferring mode
-![image/png](https://cdn-uploads.huggingface.co/production/uploads/61d2d2993c2083e1c08af221/GKkLON8R1bqa7XRvOoFOp.png)
-![image/png](https://cdn-uploads.huggingface.co/production/uploads/61d2d2993c2083e1c08af221/bmEHrGIaAGGwe75Msx3PL.png)
 # Introduction
 This is the implementation of the BERT model using the LongNet structure (paper: https://arxiv.org/pdf/2307.02486.pdf).

 The experiments are implemented with an NVIDIA A100-SXM4-40GB. Batch size of 1. The figures show the time and memory needed to run one batch. In the training mode, forward pass and backpropagation is included. In the inferring model, only forward pass is included.
 ## Training mode
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/61d2d2993c2083e1c08af221/kbwNUDuHfsJy6FtfoekXi.png)
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/61d2d2993c2083e1c08af221/f-d3hhFAljYMrKkPfn2MJ.png)
 # Inferring mode
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/61d2d2993c2083e1c08af221/9-PCSONEVOTzZgPuaPSzo.png)
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/61d2d2993c2083e1c08af221/q4zLyOQkZ4phmMKPiddSa.png)
 # Introduction
 This is the implementation of the BERT model using the LongNet structure (paper: https://arxiv.org/pdf/2307.02486.pdf).