Added banner.
Browse files
README.md
CHANGED
@@ -10,6 +10,8 @@ pipeline_tag: text-generation
|
|
10 |
library_name: peft
|
11 |
---
|
12 |
# GPT4chan 8B QLoRA
|
|
|
|
|
13 |
This model is [meta-llama/Llama-3.1-8B](https://huggingface.co/meta-llama/Llama-3.1-8B) QLoRA fine-tuned on [v2ray/4chan](https://huggingface.co/datasets/v2ray/4chan) using [QLoRA](https://github.com/LagPixelLOL/qlora).
|
14 |
|
15 |
Trained using 8x H100 with global batch size 64, using 2e-4 learning rate, for 4000 steps, which is approximately 5 epochs.
|
|
|
10 |
library_name: peft
|
11 |
---
|
12 |
# GPT4chan 8B QLoRA
|
13 |
+

|
14 |
+
|
15 |
This model is [meta-llama/Llama-3.1-8B](https://huggingface.co/meta-llama/Llama-3.1-8B) QLoRA fine-tuned on [v2ray/4chan](https://huggingface.co/datasets/v2ray/4chan) using [QLoRA](https://github.com/LagPixelLOL/qlora).
|
16 |
|
17 |
Trained using 8x H100 with global batch size 64, using 2e-4 learning rate, for 4000 steps, which is approximately 5 epochs.
|