File size: 1,936 Bytes
2bf99a4
849ad66
 
 
2bf99a4
849ad66
 
 
 
 
089b8c2
849ad66
089b8c2
ef35b60
849ad66
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
---

license: cc-by-nc-4.0
language:
- en
---


---

# RA-IT-NER-8B

**Description**: The RA-IT-NER-8B model is trained from LLama3-8B using the proposed Retrieval Augmented Instruction Tuning (RA-IT) approach. This model can be used for English Open NER with and without RAG. The training data is the [Pile-NER-type](https://huggingface.co/datasets/Universal-NER/Pile-NER-type) presented by [UniversalNER](https://arxiv.org/abs/2308.03279). 

Check our [paper](https://arxiv.org/abs/2406.17305) for more information. Check our [github repo](https://github.com/Emma1066/Retrieval-Augmented-IT-OpenNER) about how to use the model.

## Inference
The template for inference instances is as follows:
<div style="background-color: #f6f8fa; padding: 20px; border-radius: 10px; border: 1px solid #e1e4e8; box-shadow: 0 2px 5px rgba(0,0,0,0.1);">
<strong>Prompting template:</strong><br/>
USER: Here are some examples of named entity recognition: <span style="color: #d73a49;">{Fill the NER examples here}</span><br/>
ASSISTANT: I’ve read these examples.<br/>
USER: Text: <span style="color: #d73a49;">{Fill the input text here}</span><br/>
ASSISTANT: I’ve read this text.<br/>
USER: What describes <span style="color: #d73a49;">{Fill the entity type here}</span> in the text?<br/>
ASSISTANT: <span style="color: #0366d6;">(model's predictions in JSON format)</span><br/>
</div>

Note: 
* The model can conduct inference **with and without** NER examples. If you want to conduct inference without examples, just start from the third line in the above template by directly inputting "Text: {input text}" in the "USER" role.
* Inferences are based on one entity type at a time. For multiple entity types, create separate instances for each type.

## License

This model is released under the [CC BY-NC 4.0](https://creativecommons.org/licenses/by-nc/4.0/) license. It is primarily used for research purposes.