EmmaStrong
/

RA-IT-NER-zh-7B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

EmmaStrong commited on Jun 26

Commit

76129db

•

1 Parent(s): 94209ab

Upload README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -8,13 +8,13 @@ language:
 # RA-IT-NER-zh-7B
-**Description**: The RA-IT-NER-zh-7B model is trained from Qwen1.5-7B using the proposed Retrieval Augmented Instruction Tuning (RA-IT) approach. The training data is our constructed [Sky-NER ](https://huggingface.co/datasets/EmmaStrong/Sky-NER), an instruction tuning dataset for Chinese OpenNER. We follow the recipe of [UniversalNER](https://arxiv.org/abs/2308.03279) and use the large-scale [SkyPile Corpus](https://huggingface.co/datasets/Skywork/SkyPile-150B) to construct this dataset. The data was collected by prompting gpt-3.5-turbo-0125 to label entities from passages and provide entity tags. The data collection prompt is as follows:
 <div style="background-color: #f6f8fa; padding: 20px; border-radius: 10px; border: 1px solid #e1e4e8; box-shadow: 0 2px 5px rgba(0,0,0,0.1);">
 <strong>Instruction:</strong><br/>
 给定一段文本，你的任务是抽取所有实体并识别它们的实体类别。输出应为以下JSON格式：[{"实体1": "实体1的类别"}, ...]。</div>
-Check our [paper](todo) for more information. Check our [github repo](https://github.com/Emma1066/Retrieval-Augmented-IT-OpenNER) about how to use the model.
 ## Inference
 The template for inference instances is as follows:

 # RA-IT-NER-zh-7B
+**Description**: The RA-IT-NER-zh-7B model is trained from Qwen1.5-7B using the proposed Retrieval Augmented Instruction Tuning (RA-IT) approach. This model can be used for Chinese Open NER with and without RAG. The training data is our constructed [Sky-NER ](https://huggingface.co/datasets/EmmaStrong/Sky-NER), an instruction tuning dataset for Chinese OpenNER. We follow the recipe of [UniversalNER](https://arxiv.org/abs/2308.03279) and use the large-scale [SkyPile Corpus](https://huggingface.co/datasets/Skywork/SkyPile-150B) to construct this dataset. The data was collected by prompting gpt-3.5-turbo-0125 to label entities from passages and provide entity tags. The data collection prompt is as follows:
 <div style="background-color: #f6f8fa; padding: 20px; border-radius: 10px; border: 1px solid #e1e4e8; box-shadow: 0 2px 5px rgba(0,0,0,0.1);">
 <strong>Instruction:</strong><br/>
 给定一段文本，你的任务是抽取所有实体并识别它们的实体类别。输出应为以下JSON格式：[{"实体1": "实体1的类别"}, ...]。</div>
+Check our [paper](https://arxiv.org/abs/2406.17305) for more information. Check our [github repo](https://github.com/Emma1066/Retrieval-Augmented-IT-OpenNER) about how to use the model.
 ## Inference
 The template for inference instances is as follows: