munish0838
commited on
Commit
•
2c52a4f
1
Parent(s):
6e527d7
Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,41 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: llama3
|
3 |
+
pipeline_tag: text-generation
|
4 |
+
base_model: OwenArli/ArliAI-Llama-3-8B-Dolfin-v0.3
|
5 |
+
tags:
|
6 |
+
- llama
|
7 |
+
- conversational
|
8 |
+
---
|
9 |
+
|
10 |
+
# QuantFactory/ArliAI-Llama-3-8B-Dolfin-v0.3-GGUF
|
11 |
+
This is quantized version of [OwenArli/ArliAI-Llama-3-8B-Dolfin-v0.3](https://huggingface.co/OwenArli/ArliAI-Llama-3-8B-Dolfin-v0.3) created using llama.cpp
|
12 |
+
|
13 |
+
# Model Description
|
14 |
+
Based on Meta-Llama-3-8b-Instruct, and is governed by Meta Llama 3 License agreement:
|
15 |
+
https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct
|
16 |
+
|
17 |
+
|
18 |
+
This is a fine tune using an improved Dolphin and WizardLM dataset intended to make the model follow instructions better and refuse less. OpenLLM benchmark is running...
|
19 |
+
|
20 |
+
|
21 |
+
OpenLLM Benchmark:
|
22 |
+
|
23 |
+
|
24 |
+
|
25 |
+
Training:
|
26 |
+
- 2048 sequence length since the dataset has an average length of under 1000 tokens, while the base model is 8192 sequence length. From testing it still performs the same 8192 context just fine.
|
27 |
+
- Training duration is around 1 days on 2xRTX 3090, using 4-bit loading and Qlora 64-rank 128-alpha resulting in ~2% trainable weights.
|
28 |
+
|
29 |
+
|
30 |
+
Instruct format:
|
31 |
+
```
|
32 |
+
<|begin_of_text|><|start_header_id|>system<|end_header_id|>
|
33 |
+
|
34 |
+
{{ system_prompt }}<|eot_id|><|start_header_id|>user<|end_header_id|>
|
35 |
+
|
36 |
+
{{ user_message_1 }}<|eot_id|><|start_header_id|>assistant<|end_header_id|>
|
37 |
+
|
38 |
+
{{ model_answer_1 }}<|eot_id|><|start_header_id|>user<|end_header_id|>
|
39 |
+
|
40 |
+
{{ user_message_2 }}<|eot_id|><|start_header_id|>assistant<|end_header_id|>
|
41 |
+
```
|