leonardlin
commited on
Commit
•
dd15fee
1
Parent(s):
e7bf625
Update README.md
Browse files
README.md
CHANGED
@@ -9,7 +9,7 @@ base_model: Qwen/Qwen2-7B-Instruct
|
|
9 |
---
|
10 |
This is a simple [abliterated](https://mlabonne.github.io/blog/posts/2024-06-04_Uncensor_any_LLM_with_abliteration.html) ([refusal-orthoganalized](https://www.alignmentforum.org/posts/jGuXSZgv6qfdhMCuJ/refusal-in-llms-is-mediated-by-a-single-direction)) version of the Qwen2-7B-Instruct model.
|
11 |
|
12 |
-
See a full writeup here: https://huggingface.co/blog/leonardlin/chinese-censorship-analysis
|
13 |
|
14 |
As Qwen2 is not yet supported by [TransformerLens](https://github.com/TransformerLensOrg/TransformerLens), I started with [Sumandora's refusal code](https://github.com/Sumandora/remove-refusals-with-transformers) as a base.
|
15 |
|
|
|
9 |
---
|
10 |
This is a simple [abliterated](https://mlabonne.github.io/blog/posts/2024-06-04_Uncensor_any_LLM_with_abliteration.html) ([refusal-orthoganalized](https://www.alignmentforum.org/posts/jGuXSZgv6qfdhMCuJ/refusal-in-llms-is-mediated-by-a-single-direction)) version of the Qwen2-7B-Instruct model.
|
11 |
|
12 |
+
See a full writeup here: https://huggingface.co/blog/leonardlin/chinese-llm-censorship-analysis
|
13 |
|
14 |
As Qwen2 is not yet supported by [TransformerLens](https://github.com/TransformerLensOrg/TransformerLens), I started with [Sumandora's refusal code](https://github.com/Sumandora/remove-refusals-with-transformers) as a base.
|
15 |
|