raincandy-u
commited on
Commit
•
cd7dbcb
1
Parent(s):
c5fdf15
Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,60 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
language:
|
3 |
+
- en
|
4 |
+
pipeline_tag: text-generation
|
5 |
+
tags:
|
6 |
+
- facebook
|
7 |
+
- meta
|
8 |
+
- pytorch
|
9 |
+
- llama
|
10 |
+
- llama-3
|
11 |
+
- moe
|
12 |
+
- code
|
13 |
+
license: other
|
14 |
+
license_name: llama3
|
15 |
+
license_link: LICENSE
|
16 |
+
---
|
17 |
+
# Aplite-Instruct-4x8B-Llama-3
|
18 |
+
|
19 |
+
Aplite-Instruct-4x8B-Llama-3 is a experimental MoE (Mixture of Experts) model based on the Llama-3 architecture using Mergekit.
|
20 |
+
|
21 |
+
## Disclaimer
|
22 |
+
|
23 |
+
This model is a research experiment and may generate incorrect or harmful content. The model's outputs should not be taken as factual or representative of the views of the model's creator or any other individual.
|
24 |
+
|
25 |
+
The model's creator is not responsible for any harm or damage caused by the model's outputs.
|
26 |
+
|
27 |
+
## Merge Details
|
28 |
+
|
29 |
+
```
|
30 |
+
base_model: Meta-Llama-3-8B-Instruct
|
31 |
+
experts:
|
32 |
+
- source_model: Meta-Llama-3-8B-Instruct
|
33 |
+
positive_prompts:
|
34 |
+
- "explain"
|
35 |
+
- "chat"
|
36 |
+
- "assistant"
|
37 |
+
- source_model: Llama3-8B-OpenHermes-DPO
|
38 |
+
positive_prompts:
|
39 |
+
- "python"
|
40 |
+
- "math"
|
41 |
+
- "solve"
|
42 |
+
- "code"
|
43 |
+
- source_model: Llama-3-SLERP-8B
|
44 |
+
positive_prompts:
|
45 |
+
- "chat"
|
46 |
+
- "assistant"
|
47 |
+
- "AI"
|
48 |
+
- source_model: hf-llama3-8b-orpo-v0.0
|
49 |
+
positive_prompts:
|
50 |
+
- "think"
|
51 |
+
- "chat"
|
52 |
+
- "code"
|
53 |
+
- "roleplay"
|
54 |
+
gate_mode: hidden
|
55 |
+
dtype: float16
|
56 |
+
```
|
57 |
+
|
58 |
+
## Join out Discord
|
59 |
+
|
60 |
+
If you'd like to discuss potential collaborations or applications, feel free to reach out to me on Discord: [insert Discord link here]
|