aashish1904's picture
Upload README.md with huggingface_hub
b936011 verified
|
raw
history blame
2.33 kB
---
base_model:
- meta-llama/Meta-Llama-3.1-8B-Instruct
- grimjim/Llama-3-Instruct-abliteration-LoRA-8B
library_name: transformers
tags:
- mergekit
- merge
license: llama3.1
pipeline_tag: text-generation
---
![](https://lh7-rt.googleusercontent.com/docsz/AD_4nXeiuCm7c8lEwEJuRey9kiVZsRn2W-b4pWlu3-X534V3YmVuVc2ZL-NXg2RkzSOOS2JXGHutDuyyNAUtdJI65jGTo8jT9Y99tMi4H4MqL44Uc5QKG77B0d6-JfIkZHFaUA71-RtjyYZWVIhqsNZcx8-OMaA?key=xt3VSDoCbmTY7o-cwwOFwQ)
# QuantFactory/Llama-3.1-8B-Instruct-abliterated_via_adapter-GGUF
This is quantized version of [grimjim/Llama-3.1-8B-Instruct-abliterated_via_adapter](https://huggingface.co./grimjim/Llama-3.1-8B-Instruct-abliterated_via_adapter) created using llama.cpp
# Original Model Card
# Llama-3.1-8B-Instruct-abliterated_via_adapter
This model is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
A LoRA was applied to "abliterate" refusals in [meta-llama/Meta-Llama-3.1-8B-Instruct](https://huggingface.co./meta-llama/Meta-Llama-3.1-8B-Instruct). The result appears to work despite the LoRA having been derived from Llama 3 instead of Llama 3.1, which implies that there is significant feature commonality between the 3 and 3.1 models.
The LoRA was extracted from [failspy/Meta-Llama-3-8B-Instruct-abliterated-v3](https://huggingface.co./failspy/Meta-Llama-3-8B-Instruct-abliterated-v3) using [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co./meta-llama/Meta-Llama-3-8B-Instruct) as a base.
Built with Llama.
## Merge Details
### Merge Method
This model was merged using the [task arithmetic](https://arxiv.org/abs/2212.04089) merge method using [meta-llama/Meta-Llama-3.1-8B-Instruct](https://huggingface.co./meta-llama/Meta-Llama-3.1-8B-Instruct) + [grimjim/Llama-3-Instruct-abliteration-LoRA-8B](https://huggingface.co./grimjim/Llama-3-Instruct-abliteration-LoRA-8B) as a base.
### Configuration
The following YAML configuration was used to produce this model:
```yaml
base_model: meta-llama/Meta-Llama-3.1-8B-Instruct+grimjim/Llama-3-Instruct-abliteration-LoRA-8B
dtype: bfloat16
merge_method: task_arithmetic
parameters:
normalize: false
slices:
- sources:
- layer_range: [0, 32]
model: meta-llama/Meta-Llama-3.1-8B-Instruct+grimjim/Llama-3-Instruct-abliteration-LoRA-8B
parameters:
weight: 1.0
```