thu-coai
/

ShieldLM-7B-internlm2

Feature Extraction

Model card Files Files and versions Community

nonstopfor commited on Feb 26

Commit

109523f

•

1 Parent(s): f300599

Create README.md

Files changed (1) hide show

README.md +16 -0

README.md ADDED Viewed

	@@ -0,0 +1,16 @@

+---
+license: mit
+language:
+- en
+- zh
+---
+## Introduction
+The ShieldLM model ([paper link](xxx)) initialized from [internlm2-chat-7b](https://huggingface.co/internlm/internlm2-chat-7b). ShieldLM is a bilingual (Chinese and English) safety detector that mainly aims to help to detect safety issues in LLMs' generations. It aligns with general human safety standards, supports fine-grained customizable detection rules, and provides explanations for its decisions.
+Refer to our [github repository](https://github.com/thu-coai/ShieldLM) for more detailed information.
+## Usage
+Please refer to our [github repository](https://github.com/thu-coai/ShieldLM) for the detailed usage instructions.
+## Performance
+ShieldLM demonstrates impressive detection performance across 4 ID and OOD test sets, compared to strong baselines such as GPT-4, Llama Guard and Perspective API.
+Refer to [our paper](xxx) for more detailed evaluation results.