TheBloke commited on
Commit
c987de8
1 Parent(s): ed8b061

Upload README.md

Browse files
Files changed (1) hide show
  1. README.md +173 -0
README.md ADDED
@@ -0,0 +1,173 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: nRuaif/Kimiko-Mistral-7B
3
+ inference: false
4
+ license: apache-2.0
5
+ model-index:
6
+ - name: Kimiko-Mistral-7B
7
+ results: []
8
+ model_creator: nRuaif
9
+ model_name: Kimiko Mistral 7B
10
+ model_type: mistral
11
+ prompt_template: 'You are a helpful AI assistant.
12
+
13
+
14
+ USER: {prompt}
15
+
16
+ ASSISTANT:
17
+
18
+ '
19
+ quantized_by: TheBloke
20
+ tags:
21
+ - generated_from_trainer
22
+ ---
23
+
24
+ <!-- header start -->
25
+ <!-- 200823 -->
26
+ <div style="width: auto; margin-left: auto; margin-right: auto">
27
+ <img src="https://i.imgur.com/EBdldam.jpg" alt="TheBlokeAI" style="width: 100%; min-width: 400px; display: block; margin: auto;">
28
+ </div>
29
+ <div style="display: flex; justify-content: space-between; width: 100%;">
30
+ <div style="display: flex; flex-direction: column; align-items: flex-start;">
31
+ <p style="margin-top: 0.5em; margin-bottom: 0em;"><a href="https://discord.gg/theblokeai">Chat & support: TheBloke's Discord server</a></p>
32
+ </div>
33
+ <div style="display: flex; flex-direction: column; align-items: flex-end;">
34
+ <p style="margin-top: 0.5em; margin-bottom: 0em;"><a href="https://www.patreon.com/TheBlokeAI">Want to contribute? TheBloke's Patreon page</a></p>
35
+ </div>
36
+ </div>
37
+ <div style="text-align:center; margin-top: 0em; margin-bottom: 0em"><p style="margin-top: 0.25em; margin-bottom: 0em;">TheBloke's LLM work is generously supported by a grant from <a href="https://a16z.com">andreessen horowitz (a16z)</a></p></div>
38
+ <hr style="margin-top: 1.0em; margin-bottom: 1.0em;">
39
+ <!-- header end -->
40
+
41
+ # Kimiko Mistral 7B - FP16
42
+ - Model creator: [nRuaif](https://huggingface.co/nRuaif)
43
+ - Original model: [Kimiko Mistral 7B](nRuaif/Kimiko-Mistral-7B)
44
+
45
+ <!-- description start -->
46
+ ## Description
47
+
48
+ This repo contains pytorch format fp16 model files for [nRuaif's Kimiko Mistral 7B](nRuaif/Kimiko-Mistral-7B).
49
+
50
+ It is the result of either merging a LoRA, or converting the source repository to float16.
51
+
52
+ <!-- description end -->
53
+ <!-- repositories-available start -->
54
+ ## Repositories available
55
+
56
+ * [AWQ model(s) for GPU inference.](https://huggingface.co/TheBloke/Kimiko-Mistral-7B-AWQ)
57
+ * [GPTQ models for GPU inference, with multiple quantisation parameter options.](https://huggingface.co/TheBloke/Kimiko-Mistral-7B-GPTQ)
58
+ * [2, 3, 4, 5, 6 and 8-bit GGUF models for CPU+GPU inference](https://huggingface.co/TheBloke/Kimiko-Mistral-7B-GGUF)
59
+ * [Unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/TheBloke/Kimiko-Mistral-7B-fp16)
60
+ * [nRuaif's original LoRA adapter, which can be merged on to the base model.](https://huggingface.co/nRuaif/Kimiko-Mistral-7B)
61
+
62
+ <!-- repositories-available start -->
63
+
64
+ <!-- prompt-template start -->
65
+ ## Prompt template: Vicuna-Short
66
+
67
+ ```
68
+ You are a helpful AI assistant.
69
+
70
+ USER: {prompt}
71
+ ASSISTANT:
72
+
73
+ ```
74
+
75
+ <!-- prompt-template end -->
76
+
77
+
78
+
79
+
80
+ <!-- footer start -->
81
+ <!-- 200823 -->
82
+ ## Discord
83
+
84
+ For further support, and discussions on these models and AI in general, join us at:
85
+
86
+ [TheBloke AI's Discord server](https://discord.gg/theblokeai)
87
+
88
+ ## Thanks, and how to contribute
89
+
90
+ Thanks to the [chirper.ai](https://chirper.ai) team!
91
+
92
+ Thanks to Clay from [gpus.llm-utils.org](llm-utils)!
93
+
94
+ I've had a lot of people ask if they can contribute. I enjoy providing models and helping people, and would love to be able to spend even more time doing it, as well as expanding into new projects like fine tuning/training.
95
+
96
+ If you're able and willing to contribute it will be most gratefully received and will help me to keep providing more models, and to start work on new AI projects.
97
+
98
+ Donaters will get priority support on any and all AI/LLM/model questions and requests, access to a private Discord room, plus other benefits.
99
+
100
+ * Patreon: https://patreon.com/TheBlokeAI
101
+ * Ko-Fi: https://ko-fi.com/TheBlokeAI
102
+
103
+ **Special thanks to**: Aemon Algiz.
104
+
105
+ **Patreon special mentions**: Pierre Kircher, Stanislav Ovsiannikov, Michael Levine, Eugene Pentland, Andrey, 준교 김, Randy H, Fred von Graf, Artur Olbinski, Caitlyn Gatomon, terasurfer, Jeff Scroggin, James Bentley, Vadim, Gabriel Puliatti, Harry Royden McLaughlin, Sean Connelly, Dan Guido, Edmond Seymore, Alicia Loh, subjectnull, AzureBlack, Manuel Alberto Morcote, Thomas Belote, Lone Striker, Chris Smitley, Vitor Caleffi, Johann-Peter Hartmann, Clay Pascal, biorpg, Brandon Frisco, sidney chen, transmissions 11, Pedro Madruga, jinyuan sun, Ajan Kanaga, Emad Mostaque, Trenton Dambrowitz, Jonathan Leane, Iucharbius, usrbinkat, vamX, George Stoitzev, Luke Pendergrass, theTransient, Olakabola, Swaroop Kallakuri, Cap'n Zoog, Brandon Phillips, Michael Dempsey, Nikolai Manek, danny, Matthew Berman, Gabriel Tamborski, alfie_i, Raymond Fosdick, Tom X Nguyen, Raven Klaugh, LangChain4j, Magnesian, Illia Dulskyi, David Ziegler, Mano Prime, Luis Javier Navarrete Lozano, Erik Bjäreholt, 阿明, Nathan Dryer, Alex, Rainer Wilmers, zynix, TL, Joseph William Delisle, John Villwock, Nathan LeClaire, Willem Michiel, Joguhyik, GodLy, OG, Alps Aficionado, Jeffrey Morgan, ReadyPlayerEmma, Tiffany J. Kim, Sebastain Graf, Spencer Kim, Michael Davis, webtim, Talal Aujan, knownsqashed, John Detwiler, Imad Khwaja, Deo Leter, Jerry Meng, Elijah Stavena, Rooh Singh, Pieter, SuperWojo, Alexandros Triantafyllidis, Stephen Murray, Ai Maven, ya boyyy, Enrico Ros, Ken Nordquist, Deep Realms, Nicholas, Spiking Neurons AB, Elle, Will Dee, Jack West, RoA, Luke @flexchar, Viktor Bowallius, Derek Yates, Subspace Studios, jjj, Toran Billups, Asp the Wyvern, Fen Risland, Ilya, NimbleBox.ai, Chadd, Nitin Borwankar, Emre, Mandus, Leonard Tan, Kalila, K, Trailburnt, S_X, Cory Kujawski
106
+
107
+
108
+ Thank you to all my generous patrons and donaters!
109
+
110
+ And thank you again to a16z for their generous grant.
111
+
112
+ <!-- footer end -->
113
+
114
+ # Original model card: nRuaif's Kimiko Mistral 7B
115
+
116
+
117
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
118
+ should probably proofread and complete it, then remove this comment. -->
119
+
120
+ [<img src="https://raw.githubusercontent.com/OpenAccess-AI-Collective/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/OpenAccess-AI-Collective/axolotl)
121
+ # Kimiko-Mistral-7B
122
+
123
+ This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the Kimiko dataset.
124
+ It achieves the following results on the evaluation set:
125
+ - Loss: 2.1173
126
+
127
+ ## Model description
128
+
129
+ Same dataset as Kimiko-v2 but on new model. THIS IS NOT TRAIN ON V3 DATASET
130
+
131
+ ## Intended uses & limitations
132
+
133
+ As a finetuning experiment on new 7B model. You can use this for roleplay or as an assistant
134
+
135
+ # Prompt Template Structure
136
+ ```
137
+ This is a chat between ASSISTANT and USER
138
+ USER: What is 4x8?
139
+ ASSISTANT:
140
+
141
+ ```
142
+
143
+
144
+ ### Training hyperparameters
145
+
146
+ The following hyperparameters were used during training:
147
+ - learning_rate: 0.00005
148
+ - train_batch_size: 4
149
+ - eval_batch_size: 4
150
+ - seed: 42
151
+ - gradient_accumulation_steps: 16
152
+ - total_train_batch_size: 64
153
+ - optimizer: Adam with betas=(0.9,0.95) and epsilon=1e-05
154
+ - lr_scheduler_type: cosine
155
+ - lr_scheduler_warmup_steps: 10
156
+ - num_epochs: 2
157
+
158
+ ### Training results
159
+
160
+ | Training Loss | Epoch | Step | Validation Loss |
161
+ |:-------------:|:-----:|:----:|:---------------:|
162
+ | 1.5675 | 0.47 | 25 | 2.1323 |
163
+ | 1.4721 | 0.95 | 50 | 2.1209 |
164
+ | 1.472 | 1.42 | 75 | 2.1177 |
165
+ | 1.5445 | 1.9 | 100 | 2.1173 |
166
+
167
+
168
+ ### Framework versions
169
+
170
+ - Transformers 4.34.0.dev0
171
+ - Pytorch 2.0.1+cu118
172
+ - Datasets 2.14.5
173
+ - Tokenizers 0.14.0