Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,73 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: cc-by-nc-4.0
|
3 |
+
tags:
|
4 |
+
- not-for-all-audiences
|
5 |
+
- nsfw
|
6 |
+
---
|
7 |
+
|
8 |
+
## Lumimaid 0.1
|
9 |
+
|
10 |
+
<center><div style="width: 100%;">
|
11 |
+
<img src="https://cdn-uploads.huggingface.co/production/uploads/630dfb008df86f1e5becadc3/d3QMaxy3peFTpSlWdWF-k.png" style="display: block; margin: auto;">
|
12 |
+
</div></center>
|
13 |
+
|
14 |
+
**NOTICE: GGUF quants seems to be broken atm, when fixed we will reupload.**
|
15 |
+
|
16 |
+
This model uses the Llama3 **prompting format**
|
17 |
+
|
18 |
+
Llama3 trained on our RP datasets, we tried to have a balance between the ERP and the RP, not too horny, but just enough.
|
19 |
+
|
20 |
+
We also added some non-RP dataset, making the model less dumb overall. It should look like a 40%/60% ratio for Non-RP/RP+ERP data.
|
21 |
+
|
22 |
+
This model includes the new Luminae dataset from Ikari.
|
23 |
+
|
24 |
+
|
25 |
+
If you consider trying this model please give us some feedback either on the Community tab on hf or on our [Discord Server](https://discord.gg/MtCVRWTZXY).
|
26 |
+
|
27 |
+
## Credits:
|
28 |
+
- Undi
|
29 |
+
- IkariDev
|
30 |
+
|
31 |
+
## Description
|
32 |
+
|
33 |
+
This repo contains GGUF files of Lumimaid-8B-v0.1.
|
34 |
+
|
35 |
+
Switch: [8B](https://huggingface.co/NeverSleep/Llama-3-Lumimaid-8B-v0.1-GGUF) - [70B](https://huggingface.co/NeverSleep/Llama-3-Lumimaid-70B-v0.1-GGUF) - [70B-alt](https://huggingface.co/NeverSleep/Llama-3-Lumimaid-70B-alt-v0.1-GGUF)
|
36 |
+
|
37 |
+
## Training data used:
|
38 |
+
- [Aesir datasets](https://huggingface.co/MinervaAI)
|
39 |
+
- [NoRobots](https://huggingface.co/datasets/Doctor-Shotgun/no-robots-sharegpt)
|
40 |
+
- [limarp](https://huggingface.co/datasets/lemonilia/LimaRP) - 8k ctx
|
41 |
+
- [toxic-dpo-v0.1-sharegpt](https://huggingface.co/datasets/Undi95/toxic-dpo-v0.1-sharegpt)
|
42 |
+
- [ToxicQAFinal](https://huggingface.co/datasets/NobodyExistsOnTheInternet/ToxicQAFinal)
|
43 |
+
- Luminae-i1 (70B/70B-alt) (i2 was not existing when the 70b started training) | Luminae-i2 (8B) (this one gave better results on the 8b) - Ikari's Dataset
|
44 |
+
- [Squish42/bluemoon-fandom-1-1-rp-cleaned](https://huggingface.co/datasets/Squish42/bluemoon-fandom-1-1-rp-cleaned) - 50% (randomly)
|
45 |
+
- [NobodyExistsOnTheInternet/PIPPAsharegptv2test](https://huggingface.co/datasets/NobodyExistsOnTheInternet/PIPPAsharegptv2test) - 5% (randomly)
|
46 |
+
- [cgato/SlimOrcaDedupCleaned](https://huggingface.co/datasets/cgato/SlimOrcaDedupCleaned) - 5% (randomly)
|
47 |
+
- Airoboros (reduced)
|
48 |
+
- [Capybara](https://huggingface.co/datasets/Undi95/Capybara-ShareGPT/) (reduced)
|
49 |
+
|
50 |
+
|
51 |
+
## Models used (only for 8B)
|
52 |
+
|
53 |
+
- Initial LumiMaid 8B Finetune
|
54 |
+
- Undi95/Llama-3-Unholy-8B-e4
|
55 |
+
- Undi95/Llama-3-LewdPlay-8B
|
56 |
+
|
57 |
+
## Prompt template: Llama3
|
58 |
+
|
59 |
+
```
|
60 |
+
<|begin_of_text|><|start_header_id|>system<|end_header_id|>
|
61 |
+
|
62 |
+
{system_prompt}<|eot_id|><|start_header_id|>user<|end_header_id|>
|
63 |
+
|
64 |
+
{input}<|eot_id|><|start_header_id|>assistant<|end_header_id|>
|
65 |
+
|
66 |
+
{output}<|eot_id|>
|
67 |
+
```
|
68 |
+
|
69 |
+
## Others
|
70 |
+
|
71 |
+
Undi: If you want to support us, you can [here](https://ko-fi.com/undiai).
|
72 |
+
|
73 |
+
IkariDev: Visit my [retro/neocities style website](https://ikaridevgit.github.io/) please kek
|