art-from-the-machine commited on
Commit
10f2d6b
1 Parent(s): 73fc63d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +121 -4
README.md CHANGED
@@ -7,17 +7,134 @@ tags:
7
  - transformers
8
  - unsloth
9
  - llama
10
- - trl
11
- - sft
12
  base_model: unsloth/llama-3-8b-Instruct-bnb-4bit
13
  ---
14
 
15
- # Uploaded model
 
16
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
17
  - **Developed by:** art-from-the-machine
18
  - **License:** apache-2.0
19
- - **Finetuned from model :** unsloth/llama-3-8b-Instruct-bnb-4bit
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
20
 
 
21
  This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
22
 
23
  [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
 
7
  - transformers
8
  - unsloth
9
  - llama
10
+ - gguf
 
11
  base_model: unsloth/llama-3-8b-Instruct-bnb-4bit
12
  ---
13
 
14
+ # Skyrim Fine-Tune for Meta Llama 3 8B Instruct
15
+ [<img src="https://staticdelivery.nexusmods.com/mods/1704/images/headers/98631_1692410778.jpg" width="1000"/>](https://art-from-the-machine.github.io/Mantella/)
16
 
17
+ GGUF models: [https://huggingface.co/art-from-the-machine/Mantella-Skyrim-Llama-3-8B-GGUF](https://huggingface.co/art-from-the-machine/Mantella-Skyrim-Llama-3-8B-GGUF)
18
+
19
+ ## Introduction
20
+ This fine-tune of Meta's Llama 3 8B Instruct model is intended to be used as a custom-tailored model for the [Mantella Skyrim mod](https://art-from-the-machine.github.io/Mantella/).
21
+
22
+ ## Dataset
23
+ An Alpaca-style dataset of 8,800+ player <-> NPC interactions has been used in this fine-tune. Details on this dataset can be found in the [Mantella LLM Fine-Tuning](https://github.com/art-from-the-machine/Mantella-LLM-Fine-Tuning) repo.
24
+
25
+ Example:
26
+
27
+ ```json
28
+ {
29
+ "instruction": "Generate dialogue in the style of Skyrim.",
30
+ "input": "Where can I find fire salts?",
31
+ "output": "A flame atronach's body might provide fire salt. They're dangerous creatures that can be summoned by wizards. Of course, it would be much easier to check with an alchemist. They occasionally have them for sale."
32
+ }
33
+ ```
34
+
35
+ ## Specifications
36
  - **Developed by:** art-from-the-machine
37
  - **License:** apache-2.0
38
+ - **Fine-tuned from model:** unsloth/llama-3-8b-Instruct-bnb-4bit
39
+ - **Context length:** 8,192
40
+
41
+ ## Example Output
42
+ The fine-tuned [Mantella-Skyrim-Llama-3-8B-Q4_K_M.gguf](https://huggingface.co/art-from-the-machine/Mantella-Skyrim-Llama-3-8B-GGUF/blob/main/Mantella-Skyrim-Llama-3-8B-Q4_K_M.gguf) model has been tested in Mantella v0.11.2's debug mode, with the model running locally in koboldcpp. The character tested is Hulda. A list of questions are tested against both the fine-tuned model as well as an equivalent [Q4_K_M Llama 3 Instruct](https://huggingface.co/bartowski/Meta-Llama-3-8B-Instruct-GGUF/blob/main/Meta-Llama-3-8B-Instruct-Q4_K_M.gguf) base model for comparison. For both models, Mantella's `max_response_sentences` setting has been set to 4, and `temperature` has been set to 0.4.
43
+ <details>
44
+ <summary><b>Mantella-Skyrim-Llama-3-8B-Q4_K_M.gguf</b></summary>
45
+
46
+ ```json
47
+ [
48
+ {
49
+ "role": "assistant",
50
+ "content": "Ah, welcome to the Bannered Mare! Take a seat by the fire and I'll have one of the girls bring you a drink. What can I get for you?"
51
+ },
52
+ {
53
+ "role": "user",
54
+ "content": "What's life like in Whiterun?"
55
+ },
56
+ {
57
+ "role": "assistant",
58
+ "content": "It's a good life. The people are friendly, the food is good, and the Jarl is fair. Of course, there are some who grumble about the Empire, but I think they're just jealous of the Jarl's fine wine cellar. What about you? What brings you to Whiterun?"
59
+ },
60
+ {
61
+ "role": "user",
62
+ "content": "What do you think about the civil war in Skyrim?"
63
+ },
64
+ {
65
+ "role": "assistant",
66
+ "content": "I think it's a shame. I've lived in Whiterun all my life, and I've seen the Empire bring peace and prosperity to our land. The Stormcloaks just want to tear everything down and start over. I don't think that's the way to build a better future. But what do I know? I'm just a simple innkeeper."
67
+ },
68
+ {
69
+ "role": "user",
70
+ "content": "I'm looking for work. Got any leads?"
71
+ },
72
+ {
73
+ "role": "assistant",
74
+ "content": "Well, the Companions are always looking for new recruits. They're a bit rough, but they're good people. And of course, there's the Jarl's court. If you're looking for work, you might want to try talking to his steward. He's always looking for someone to do odd jobs for him."
75
+ },
76
+ {
77
+ "role": "user",
78
+ "content": "I'm looking for an Elder Scroll. Do you know anything about them?"
79
+ },
80
+ {
81
+ "role": "assistant",
82
+ "content": "Elder Scrolls? Ha! You must be joking. Those are ancient relics, hidden away in the ruins of old. You won't find one in Whiterun, that's for sure. If you're looking for something like that, you should try talking to the Greybeards. They're the ones who live in the mountains and meditate all day."
83
+ }
84
+ ]
85
+ ```
86
+ </details>
87
+ <br>
88
+
89
+ <details>
90
+ <summary><b>Meta-Llama-3-8B-Instruct-Q4_K_M.gguf</b></summary>
91
+
92
+ ```json
93
+ [
94
+ {
95
+ "role": "assistant",
96
+ "content": "Ah, welcome to The Bannered Mare! Have a seat by the fire, and I'll have one of the barmaids bring you a pint of our finest ale. What brings you to Whiterun today?"
97
+ },
98
+ {
99
+ "role": "user",
100
+ "content": "What's life like in Whiterun?"
101
+ },
102
+ {
103
+ "role": "assistant",
104
+ "content": "Life in Whiterun is good, thank you for asking. The Jarl's been doing a fine job of keeping the city prosperous. Of course, there's always something going on, whether it's the Companions looking for new members or the Thieves Guild trying to... well, steal things. But overall, it's a good place to live."
105
+ },
106
+ {
107
+ "role": "user",
108
+ "content": "What do you think about the civil war in Skyrim?"
109
+ },
110
+ {
111
+ "role": "assistant",
112
+ "content": "The civil war? Well, I think it's a shame, to be honest. I've got friends on both sides of the conflict, and it's hard to see them fighting each other. But I suppose it's not my place to say who's right or wrong. I just hope it ends soon and we can all get back to living our lives in peace."
113
+ },
114
+ {
115
+ "role": "user",
116
+ "content": "I'm looking for work. Got any leads?"
117
+ },
118
+ {
119
+ "role": "assistant",
120
+ "content": "Work, you say? Well, the city's always looking for people to help with the harvest, and the Jarl's guards could use some extra hands. If you're looking for something more... adventurous, you might want to talk to the Companions. They're always looking for new members to join their ranks. Just be warned, though - it's not for the faint of heart."
121
+ },
122
+ {
123
+ "role": "user",
124
+ "content": "I'm looking for an Elder Scroll. Do you know anything about them?"
125
+ },
126
+ {
127
+ "role": "assistant",
128
+ "content": "Elder Scrolls? Ah, yes... I've heard rumors about those ancient artifacts. Some say they hold the secrets of the world, while others claim they're cursed. I don't know much about them, but I've heard that the Greybeards in the mountains might know more. They're a reclusive bunch, but if you're willing to brave the wilderness, you might be able to learn something from them."
129
+ }
130
+ ]
131
+ ```
132
+ </details>
133
+
134
+ ## How to use with Mantella
135
+ See [here](https://art-from-the-machine.github.io/Mantella/pages/installation.html#local-models) for full instructions on how to run Mantella with a local LLM.
136
 
137
+ ## Notes
138
  This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
139
 
140
  [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)