gbueno86 commited on
Commit
fb61af4
1 Parent(s): 1bdc5ab

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +45 -3
README.md CHANGED
@@ -1,3 +1,45 @@
1
- ---
2
- license: llama3.1
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: []
3
+ library_name: transformers
4
+ tags:
5
+ - mergekit
6
+ - merge
7
+
8
+ ---
9
+ # Brinebreath-Llama-70B
10
+
11
+ This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
12
+
13
+ ## Merge Details
14
+ ### Merge Method
15
+
16
+ This model was merged using the SLERP merge method.
17
+
18
+ ### Models Merged
19
+
20
+ The following models were included in the merge:
21
+ * /Volumes/external/VAGOsolutions_Llama-3-SauerkrautLM-70b-Instruct
22
+ * /Volumes/external2/models/Drake-Llama-3.1-70B
23
+
24
+ ### Configuration
25
+
26
+ The following YAML configuration was used to produce this model:
27
+
28
+ ```yaml
29
+ base_model: /Volumes/external/VAGOsolutions_Llama-3-SauerkrautLM-70b-Instruct
30
+ dtype: bfloat16
31
+ merge_method: slerp
32
+ parameters:
33
+ t:
34
+ - filter: self_attn
35
+ value: [0.0, 0.5, 0.3, 0.7, 1.0]
36
+ - filter: mlp
37
+ value: [1.0, 0.5, 0.7, 0.3, 0.0]
38
+ - value: 0.5
39
+ slices:
40
+ - sources:
41
+ - layer_range: [0, 80]
42
+ model: /Volumes/external/VAGOsolutions_Llama-3-SauerkrautLM-70b-Instruct
43
+ - layer_range: [0, 80]
44
+ model: /Volumes/external2/models/Drake-Llama-3.1-70B
45
+ ```