Foreshhh commited on
Commit
5abb0c1
1 Parent(s): b91f0a1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +46 -0
README.md CHANGED
@@ -1,3 +1,49 @@
1
  ---
2
  license: apache-2.0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
+ datasets:
4
+ - lmsys/toxic-chat
5
+ - PKU-Alignment/BeaverTails
6
+ - lmsys/lmsys-chat-1m
7
+ language:
8
+ - en
9
+ metrics:
10
+ - f1
11
+ - accuracy
12
+ tags:
13
+ - ai-safety
14
+ - safetyguard
15
+ - safety
16
+ - benchmark
17
+ - mistral
18
+ - salad-bench
19
+ - evluation
20
  ---
21
+ # MD-Judge for Salad-Bench
22
+
23
+
24
+ ## Model Details
25
+
26
+ MD-Judge is a LLM-based safetyguard, fine-tund on top of [Mistral-7B](https://huggingface.co/mistralai/Mistral-7B-v0.1). MD-Judge serves as a classifier to evaluate the safety of QA pairs.
27
+
28
+ MD-Judge was born to study the safety of different LLMs serving as an general evaluation tool, which is proposed under the [SALAD-Bench paper]()
29
+
30
+ - **Developed by:** The SALAD-Bench Team
31
+ - **Model type:** An auto-regressive language model based on the transformer architecture.
32
+
33
+ ## Model Sources
34
+
35
+ - **Repository:** [SALAD-Bench Github](https://github.com/OpenSafetyLab/SALAD-BENCH)
36
+ - **Dataset:** Coming soon
37
+ - **Paper:** Coming soon
38
+
39
+ ## Uses
40
+ Please refer to our [Github](https://github.com/OpenSafetyLab/SALAD-BENCH) for more using examples
41
+
42
+ ```python
43
+
44
+ ```
45
+
46
+ ## Citation
47
+
48
+ **BibTeX:**
49
+