augmxnt
/

Qwen2-7B-Instruct-deccp

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

leonardlin commited on Jun 9, 2024

Commit

71e7dad

·

verified ·

1 Parent(s): 065998a

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -9,6 +9,8 @@ base_model: Qwen/Qwen2-7B-Instruct
 ---
 This is a simple [abliterated](https://mlabonne.github.io/blog/posts/2024-06-04_Uncensor_any_LLM_with_abliteration.html) ([refusal-orthoganalized](https://www.alignmentforum.org/posts/jGuXSZgv6qfdhMCuJ/refusal-in-llms-is-mediated-by-a-single-direction)) version of the Qwen2-7B-Instruct model.
 As Qwen2 is not yet supported by [TransformerLens](https://github.com/TransformerLensOrg/TransformerLens), so I used [Sumandora's refusal code](https://github.com/Sumandora/remove-refusals-with-transformers) as a base.
 All code related to this project is here: https://github.com/AUGMXNT/deccp

 ---
 This is a simple [abliterated](https://mlabonne.github.io/blog/posts/2024-06-04_Uncensor_any_LLM_with_abliteration.html) ([refusal-orthoganalized](https://www.alignmentforum.org/posts/jGuXSZgv6qfdhMCuJ/refusal-in-llms-is-mediated-by-a-single-direction)) version of the Qwen2-7B-Instruct model.
+See a full writeup here: https://huggingface.co/blog/leonardlin/chinese-censorship-analysis
 As Qwen2 is not yet supported by [TransformerLens](https://github.com/TransformerLensOrg/TransformerLens), so I used [Sumandora's refusal code](https://github.com/Sumandora/remove-refusals-with-transformers) as a base.
 All code related to this project is here: https://github.com/AUGMXNT/deccp