munish0838 commited on
Commit
d6031e6
1 Parent(s): 279fb31

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +41 -0
README.md ADDED
@@ -0,0 +1,41 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: llama3
3
+ base_model: OwenArli/ArliAI-Llama-3-8B-Cumulus-v0.3
4
+ pipeline_tag: text-generation
5
+ tags:
6
+ - llama
7
+ - conversational
8
+ ---
9
+
10
+ # QuantFactory/ArliAI-Llama-3-8B-Cumulus-v0.3-GGUF
11
+ This is quantized evrsion of [OwenArli/ArliAI-Llama-3-8B-Cumulus-v0.3](https://huggingface.co/OwenArli/ArliAI-Llama-3-8B-Cumulus-v0.3) created using llama.cpp
12
+
13
+
14
+ # Model Description
15
+ Based on Meta-Llama-3-8b-Instruct, and is governed by Meta Llama 3 License agreement:
16
+ https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct
17
+
18
+
19
+ In terms of reasoning and intelligence, this model is probably a bit worse than the OG model because of the decensoring. However, this model is better at long back and forth chats and will refuse less.
20
+
21
+
22
+ OpenLLM Benchmark:
23
+
24
+
25
+ Training:
26
+ - Full 8192 sequence length.
27
+ - Training duration is around 2 days on an RTX 4090, using 4-bit loading and Qlora 64-rank 64-alpha resulting in ~2% trainable weights.
28
+
29
+
30
+ Instruct format:
31
+ ```
32
+ <|begin_of_text|><|start_header_id|>system<|end_header_id|>
33
+
34
+ {{ system_prompt }}<|eot_id|><|start_header_id|>user<|end_header_id|>
35
+
36
+ {{ user_message_1 }}<|eot_id|><|start_header_id|>assistant<|end_header_id|>
37
+
38
+ {{ model_answer_1 }}<|eot_id|><|start_header_id|>user<|end_header_id|>
39
+
40
+ {{ user_message_2 }}<|eot_id|><|start_header_id|>assistant<|end_header_id|>
41
+ ```