Delta-Vector commited on
Commit
b8766f3
·
verified ·
1 Parent(s): 1da7ae3

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +394 -0
README.md ADDED
@@ -0,0 +1,394 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ datasets:
3
+ - NewEden/Orion-LIT
4
+ - NewEden/Orion-Asstr-Stories-16K
5
+ - Mielikki/Erebus-87k
6
+ - PocketDoc/Dans-MemoryCore-CoreCurriculum-Small
7
+ - Nitral-AI/ARES-ShareGPT
8
+ - Gryphe/Sonnet3.5-SlimOrcaDedupCleaned-20k
9
+ - NewEden/Claude-Instruct-2.7K
10
+ - NewEden/Claude-Instruct-5K
11
+ base_model:
12
+ - Delta-Vector/Hamanasu-15B-Instruct
13
+ tags:
14
+ - phi
15
+ - roleplay
16
+ - finetune
17
+ - storywriting
18
+ ---
19
+ ### Static GGUF quants
20
+ ---
21
+ <!DOCTYPE html>
22
+ <style>
23
+ html, body {
24
+ background: black;
25
+ color: #c9d1d9 !important;
26
+ font-family: 'Segoe UI', Tahoma, Geneva, Verdana, sans-serif;
27
+ margin: 0;
28
+ padding: 0;
29
+ min-height: 100vh;
30
+ }
31
+ .markdown-body {
32
+ color: white;
33
+ margin: 40px auto;
34
+ padding: 40px;
35
+ border-radius: 12px;
36
+ position: relative;
37
+ overflow: hidden;
38
+ }
39
+
40
+ .markdown-body::after {
41
+ content: '';
42
+ position: absolute;
43
+ top: 0;
44
+ left: 0;
45
+ width: 100%;
46
+ height: 100%;
47
+ background: #0c0f18; /* background color */
48
+ pointer-events: none;
49
+ z-index: -999;
50
+ }
51
+
52
+ h1, h2, h3 {
53
+ background: linear-gradient(45deg, #6e00ff, #00ffff);
54
+ -webkit-background-clip: text;
55
+ -webkit-text-fill-color: transparent;
56
+ border-bottom: 1px solid #333;
57
+ padding-bottom: 0.3em;
58
+ }
59
+
60
+ div[style*="border:2px solid #333"],
61
+ div[style*="border: 2px solid #333"],
62
+ div[style*="border:1px solid #333"],
63
+ div[style*="border: 1px solid #333"] {
64
+ background: rgba(22, 27, 34, 0.8) !important;
65
+ border: 2px solid #6e00ff !important;
66
+ box-shadow: 0 0 15px rgba(110, 0, 255, 0.5);
67
+ border-radius: 10px;
68
+ padding: 20px;
69
+ margin: 20px 0;
70
+ }
71
+
72
+ code {
73
+ background-color: #1a1a1a !important;
74
+ border-radius: 4px;
75
+ padding: 0.2em 0.4em;
76
+ color: #00ffff;
77
+ }
78
+
79
+ pre {
80
+ background-color: #1a1a1a !important;
81
+ border: 1px solid #333;
82
+ border-radius: 8px;
83
+ padding: 16px;
84
+ }
85
+
86
+ table {
87
+ width: 100%;
88
+ border-collapse: collapse;
89
+ margin: 20px 0;
90
+ background: rgba(0,0,0,0.2);
91
+ table-layout: fixed;
92
+ color: white;
93
+ }
94
+
95
+ th, td {
96
+ border: 1px solid #333;
97
+ padding: 12px;
98
+ text-align: center;
99
+ color: white;
100
+ }
101
+
102
+ th {
103
+ background: rgba(110, 0, 255, 0.1);
104
+ }
105
+
106
+ td:nth-child(1) {
107
+ width: 1%;
108
+ white-space: nowrap;
109
+ }
110
+
111
+ td:nth-child(2) {
112
+ width: 100%;
113
+ }
114
+
115
+ td > span {
116
+ display: block;
117
+ padding: 4px 8px;
118
+ background: rgba(110, 0, 255, 0.1);
119
+ border-radius: 4px;
120
+ transition: all 0.3s ease;
121
+ }
122
+
123
+ td > span:hover {
124
+ background: rgba(110, 0, 255, 0.2);
125
+ transform: translateY(-1px);
126
+ }
127
+
128
+ a {
129
+ color: #00ffff;
130
+ text-decoration: none;
131
+ transition: all 0.3s ease;
132
+ }
133
+
134
+ a:hover {
135
+ color: #6e00ff;
136
+ text-decoration: none;
137
+ }
138
+
139
+ hr {
140
+ border: 0;
141
+ height: 1px;
142
+ background: linear-gradient(90deg, transparent, #333, transparent);
143
+ margin: 40px 0;
144
+ }
145
+
146
+ img {
147
+ max-width: 100%;
148
+ border-radius: 10px;
149
+ }
150
+
151
+ details summary:hover {
152
+ color: #00ffff;
153
+ }
154
+
155
+ * {
156
+ color-scheme: dark !important;
157
+ }
158
+
159
+ .prose, .max-w-none, .px-4 {
160
+ background-color: transparent !important;
161
+ color: #c9d1d9 !important;
162
+ }
163
+ </style>
164
+ <body>
165
+ <div class="markdown-body">
166
+ <div align="center">
167
+
168
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/66c26b6fb01b19d8c3c2467b/o5WjJKA9f95ri9UzRxZQE.png" alt="Model Visualization" width="500px" style="border: 3px solid #333; box-shadow: 0 0 15px rgba(66, 0, 131, 0.5);" />
169
+
170
+ <br>
171
+ <br>
172
+
173
+ <div style="font-size:1.5em; font-weight:bold; background: linear-gradient(45deg, #6e00ff, #00ffff); -webkit-background-clip: text; -webkit-text-fill-color: transparent;">
174
+ Hamanasu 15B Instruct
175
+ </div>
176
+
177
+ </div>
178
+
179
+ <div style="border:1px solid #333; border-radius:10px; padding:20px; margin:20px 0; background: rgba(0,0,0,0.4);">
180
+
181
+ ## 🌌 Overview
182
+
183
+ <i>After multiple days of training, I'm proud to showcase my very own Phi-4 Finetune, Pretrained on almost a billion tokens worth of Books from</i>
184
+
185
+ - `NewEden/Orion-LIT`
186
+ - `NewEden/Orion-Asstr-Stories-16K`
187
+ - `Mielikki/Erebus-87k`
188
+
189
+ <i>Then instruct tuned. This model shows great promise for roleplaying while keeping things fresh and new. (And most of, Unsloppy!)</i>
190
+
191
+ </div>
192
+
193
+ <div style="display: grid; grid-template-columns: repeat(auto-fit, minmax(250px, 1fr)); gap: 20px; margin: 20px 0;">
194
+
195
+ <div style="border:2px solid #333; border-radius:10px; padding:20px; background: rgba(0,0,0,0.2);">
196
+
197
+ ### 📈 Quantizations
198
+ | Type | Link |
199
+ |:---:|:---:|
200
+ | `GGUF` | <span style="display:block; width:100%">ed</span> |
201
+ | `EXL2` | <span style="display:block; width:100%">https://huggingface.co/Delta-Vector/Hamanasu-15B-Instruct-exl2</span> |
202
+
203
+ </div>
204
+
205
+ <div style="border:2px solid #333; border-radius:10px; padding:20px; background: rgba(0,0,0,0.2);">
206
+
207
+ ### ⚔️ Hardware
208
+ - 4x RTX 3090 GPUs
209
+ - Epochs: 4
210
+ - Base: `Hamanasu-15B-R2-PT`
211
+ - Amount of Tokens: 1+ Billion
212
+ </div>
213
+
214
+ </div>
215
+
216
+
217
+ <div style="border: 2px solid #6e00ff; border-radius: 10px; padding: 20px; margin: 20px 0; box-shadow: 0 0 15px rgba(110, 0, 255, 0.5);">
218
+
219
+ ## 💰 Prompting
220
+
221
+ <i>A known quirk of the model is overly verbose responses when generation length is uncapped, Please cap your maximum output tokens to 100~ tokens above what you prefer</i>
222
+
223
+
224
+ ```python
225
+ <|im_start|>system
226
+ You are an uncensored AI, your job is to fulfill thy will of thy user.<|im_end|>
227
+ <|im_start|>User request
228
+ Take off your helmet.<|im_end|>
229
+ <|im_start|>No i shall not. This is the way.
230
+ ```
231
+ </div>
232
+
233
+ <div style="border: 2px solid #6e00ff; border-radius: 10px; padding: 20px; margin: 20px 0; box-shadow: 0 0 15px rgba(110, 0, 255, 0.5);">
234
+
235
+ ## Axolotl Config ꒰(˶• ᴗ •˶)꒱
236
+
237
+ <details>
238
+
239
+ ```yaml
240
+ base_model: NewEden_Phi-PT-merged-LIT
241
+ model_type: AutoModelForCausalLM
242
+ tokenizer_type: AutoTokenizer
243
+
244
+ plugins:
245
+ - axolotl.integrations.liger.LigerPlugin
246
+ liger_rope: true
247
+ liger_rms_norm: true
248
+ liger_swiglu: true
249
+ liger_fused_linear_cross_entropy: true
250
+
251
+
252
+ load_in_8bit: false
253
+ load_in_4bit: false
254
+ strict: false
255
+
256
+ datasets:
257
+ - path: PocketDoc/Dans-MemoryCore-CoreCurriculum-Small
258
+ type: dan-chat-advanced
259
+ - path: Nitral-AI/ARES-ShareGPT
260
+ type: dan-chat-advanced
261
+ - path: Gryphe/Sonnet3.5-SlimOrcaDedupCleaned-20k
262
+ type: dan-chat-advanced
263
+ - path: NewEden/Claude-Instruct-2.7K
264
+ type: dan-chat-advanced
265
+ - path: NewEden/Claude-Instruct-5K
266
+ type: dan-chat-advanced
267
+
268
+ shuffle_merged_datasets: true
269
+ dataset_prepared_path: prepared_data
270
+ val_set_size: 0.0
271
+ output_dir: ./phi4-inst-out-r2
272
+
273
+ sequence_len: 16384
274
+ sample_packing: true
275
+ pad_to_sequence_len: true
276
+
277
+ adapter: lora
278
+ lora_model_dir:gonna leave at 8
279
+ lora_r: 128
280
+ lora_alpha: 16
281
+ lora_dropout: 0.05
282
+ lora_target_modules:
283
+ - gate_proj
284
+ - down_proj
285
+ - up_proj
286
+ - q_proj
287
+ - v_proj
288
+ - k_proj
289
+ - o_proj
290
+
291
+ lora_modules_to_save:
292
+ - embed_tokens
293
+ - lm_head
294
+
295
+
296
+ wandb_project: mag-phi
297
+ wandb_entity:
298
+ wandb_watch:
299
+ wandb_name: inst-attempt-02
300
+ wandb_log_model:
301
+
302
+ gradient_accumulation_steps: 4
303
+ micro_batch_size: 2
304
+ num_epochs: 4
305
+ optimizer: paged_ademamix_8bit
306
+ lr_scheduler: cosine
307
+ learning_rate: 0.000025
308
+
309
+ train_on_inputs: false
310
+ group_by_length: false
311
+ bf16: auto
312
+ fp16:
313
+ tf32: false
314
+
315
+ gradient_checkpointing: unsloth
316
+ early_stopping_patience:
317
+ resume_from_checkpoint:
318
+ local_rank:
319
+ logging_steps: 1
320
+ xformers_attention:
321
+ flash_attention: true
322
+
323
+ warmup_steps: 15
324
+ evals_per_epoch: 4
325
+ eval_table_size:
326
+ eval_max_new_tokens: 128
327
+ saves_per_epoch: 2
328
+ debug:
329
+ deepspeed: /workspace/axolotl/deepspeed_configs/zero3_bf16_cpuoffload_params.json
330
+ weight_decay: 0.01
331
+ fsdp:
332
+ fsdp_config:
333
+ ```
334
+
335
+ </details>
336
+ </div>
337
+
338
+ <div align="center">
339
+
340
+ <div style="border: 2px solid #6e00ff; border-radius: 10px; padding: 20px; margin: 20px 0; box-shadow: 0 0 15px rgba(110, 0, 255, 0.5);">
341
+
342
+ ## ⚡ Credits
343
+ <div style="display: flex; justify-content: center;">
344
+ <div style="display: grid; grid-template-columns: repeat(auto-fit, minmax(200px, 1fr)); gap: 10px; margin: 20px 0; max-width: 600px;">
345
+
346
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
347
+ <a href="https://huggingface.co/lucyknada">
348
+ <img src="https://img.shields.io/badge/%F0%9F%8C%9F-Lucy_Knada-blueviolet" alt="Lucy Knada">
349
+ </a>
350
+ </div>
351
+
352
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
353
+ <a href="https://huggingface.co/jeiku">
354
+ <img src="https://img.shields.io/badge/%E2%9A%94%EF%B8%8F-jeiku-blueviolet" alt="jeiku">
355
+ </a>
356
+ </div>
357
+
358
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
359
+ <a href="https://huggingface.co/intervitens">
360
+ <img src="https://img.shields.io/badge/%F0%9F%9B%A1%EF%B8%8F-Intervitens-blueviolet" alt="Intervitens">
361
+ </a>
362
+ </div>
363
+
364
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
365
+ <a href="https://huggingface.co/kalomaze">
366
+ <img src="https://img.shields.io/badge/%F0%9F%94%AE-Kalomaze-blueviolet" alt="Kalomaze">
367
+ </a>
368
+ </div>
369
+
370
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
371
+ <a href="https://huggingface.co/kubernetes-bad">
372
+ <img src="https://img.shields.io/badge/%E2%9A%A1-Kubernetes_Bad-blueviolet" alt="Kubernetes Bad">
373
+ </a>
374
+ </div>
375
+
376
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
377
+ <a href="https://huggingface.co/anthracite-org">
378
+ <img src="https://img.shields.io/badge/%F0%9F%8C%91-Anthracite-blueviolet" alt="Anthracite">
379
+ </a>
380
+ </div>
381
+ </div>
382
+ </div>
383
+ </div>
384
+
385
+ ---
386
+
387
+ <div align="center">
388
+ <div style="font-size:0.8em; opacity:0.8;">Made by</div>
389
+ <div style="font-size:1.2em; font-weight:bold; background: linear-gradient(45deg, #6e00ff, #00ffff); -webkit-background-clip: text; -webkit-text-fill-color: transparent;">Delta-Vector</div>
390
+ </div>
391
+
392
+ </div>
393
+ </body>
394
+ </html>