IDEA-CCNL
/

Erlangshen-Longformer-110M

Inference Endpoints

Model card Files Files and versions Community

suolyer commited on Apr 14, 2022

Commit

a925679

•

1 Parent(s): 689f704

Update README.md

Files changed (1) hide show

README.md +40 -40

README.md CHANGED Viewed

@@ -1,41 +1,41 @@
----
-language:
-  - zh
-license: apache-2.0
-widget:
-- text: "生活的真谛是[MASK]。"
----
-# longformer model (Chinese)，one model of [Fengshenbang-LM](https://github.com/IDEA-CCNL/Fengshenbang-LM).
-We modify the original position code of longformer to rotational position coding，and on the basis of [chinese_roformer_L-12_H-768_A-12.zip](https://github.com/ZhuiyiTechnology/roformer), use 180G of data to continue training
-## Usage
-There is no structure of Longformer-base in [Transformers](https://github.com/huggingface/transformers), you can run follow code to get structure of longformer from [Fengshenbang-LM](https://github.com/IDEA-CCNL/Fengshenbang-LM)
- ```shell
- git clone https://github.com/IDEA-CCNL/Fengshenbang-LM.git
- ```
-### Load Model
-```python
-from fengshen import LongformerModel
-from fengshen import LongformerConfig
-from transformers import BertTokenizer
-tokenizer = BertTokenizer.from_pretrained("IDEA-CCNL/longformer_base")
-config = LongformerConfig.from_pretrained("IDEA-CCNL/longformer_base")
-model = LongformerModel.from_pretrained("IDEA-CCNL/longformer_base")
-```
-## Citation
-If you find the resource is useful, please cite the following website in your paper.
-```
-@misc{Fengshenbang-LM,
-  title={Fengshenbang-LM},
-  author={IDEA-CCNL},
-  year={2021},
-  howpublished={\url{https://github.com/IDEA-CCNL/Fengshenbang-LM}},
-}
 ```

+---
+language:
+  - zh
+license: apache-2.0
+widget:
+- text: "生活的真谛是[MASK]。"
+---
+# longformer model (Chinese)，one model of [Fengshenbang-LM](https://github.com/IDEA-CCNL/Fengshenbang-LM).
+We modify the original position code of longformer to rotational position coding，and on the basis of [chinese_roformer_L-12_H-768_A-12.zip](https://github.com/ZhuiyiTechnology/roformer), use 180G of data to continue training
+## Usage
+There is no structure of Longformer-base in [Transformers](https://github.com/huggingface/transformers), you can run follow code to get structure of longformer from [Fengshenbang-LM](https://github.com/IDEA-CCNL/Fengshenbang-LM)
+ ```shell
+ git clone https://github.com/IDEA-CCNL/Fengshenbang-LM.git
+ ```
+### Load Model
+```python
+from fengshen import LongformerModel
+from fengshen import LongformerConfig
+from transformers import BertTokenizer
+tokenizer = BertTokenizer.from_pretrained("IDEA-CCNL/Erlangshen-Longformer-110M")
+config = LongformerConfig.from_pretrained("IDEA-CCNL/Erlangshen-Longformer-110M")
+model = LongformerModel.from_pretrained("IDEA-CCNL/Erlangshen-Longformer-110M")
+```
+## Citation
+If you find the resource is useful, please cite the following website in your paper.
+```
+@misc{Fengshenbang-LM,
+  title={Fengshenbang-LM},
+  author={IDEA-CCNL},
+  year={2021},
+  howpublished={\url{https://github.com/IDEA-CCNL/Fengshenbang-LM}},
+}
 ```