Text Generation
Transformers
GGUF
Inference Endpoints
munish0838 commited on
Commit
161c7f0
1 Parent(s): 4b4a190

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +118 -0
README.md ADDED
@@ -0,0 +1,118 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - NobodyExistsOnTheInternet/ToxicQAFinal
5
+ library_name: transformers
6
+ pipeline_tag: text-generation
7
+ base_model: fearlessdots/Alpha-Orionis-v0.1
8
+ ---
9
+
10
+ # Alpha-Orionis-v0.1-GGUF
11
+ This is quantized version of [fearlessdots/Alpha-Orionis-v0.1](https://huggingface.co/fearlessdots/Alpha-Orionis-v0.1) created using llama.cpp
12
+
13
+ ---
14
+
15
+ ## Disclaimer
16
+
17
+ **Note:** All models and LoRAs from the **Orion** series were created with the sole purpose of research. The usage of this model and/or its related LoRA implies agreement with the following terms:
18
+
19
+ - The user is responsible for what they might do with it, including how the output of the model is interpreted and used;
20
+ - The user should not use the model and its outputs for any illegal purposes;
21
+ - The user is the only one resposible for any misuse or negative consequences from using this model and/or its related LoRA.
22
+
23
+ I do not endorse any particular perspectives presented in the training data.
24
+
25
+ ---
26
+
27
+ ## Orion Series
28
+
29
+ This series aims to develop highly uncensored Large Language Models (LLMs) with the following focuses:
30
+
31
+ - Science, Technology, Engineering, and Mathematics (STEM)
32
+ - Computer Science (including programming)
33
+ - Social Sciences
34
+
35
+ And several key cognitive skills, including but not limited to:
36
+
37
+ - Reasoning and logical deduction
38
+ - Critical thinking
39
+ - Analysis
40
+
41
+ While maintaining strong overall knowledge and expertise, the models will undergo refinement through:
42
+
43
+ - Fine-tuning processes
44
+ - Model merging techniques including Mixture of Experts (MoE)
45
+
46
+ Please note that these models are experimental and may demonstrate varied levels of effectiveness. Your feedback, critique, or queries are most welcome for improvement purposes.
47
+
48
+ ## Base
49
+
50
+ This model and its related LoRA was fine-tuned on [https://huggingface.co/fearlessdots/WizardLM-2-7B-abliterated](https://huggingface.co/fearlessdots/WizardLM-2-7B-abliterated).
51
+
52
+ ## LoRA
53
+
54
+ The LoRA merged with the base model is available at [https://huggingface.co/fearlessdots/Alpha-Orionis-v0.1-LoRA](https://huggingface.co/fearlessdots/Alpha-Orionis-v0.1-LoRA).
55
+
56
+ ## GGUF
57
+
58
+ I provide some GGUF files here: [https://huggingface.co/fearlessdots/Alpha-Orionis-v0.1-GGUF](https://huggingface.co/fearlessdots/Alpha-Orionis-v0.1-GGUF).
59
+
60
+ ## Datasets
61
+
62
+ - [https://huggingface.co/datasets/NobodyExistsOnTheInternet/ToxicQAFinal](https://huggingface.co/datasets/NobodyExistsOnTheInternet/ToxicQAFinal)
63
+
64
+ ## Fine Tuning
65
+
66
+ ### - Quantization Configuration
67
+
68
+ - load_in_4bit=True
69
+ - bnb_4bit_quant_type="fp4"
70
+ - bnb_4bit_compute_dtype=compute_dtype
71
+ - bnb_4bit_use_double_quant=False
72
+
73
+ ### - PEFT Parameters
74
+
75
+ - lora_alpha=64
76
+ - lora_dropout=0.05
77
+ - r=128
78
+ - bias="none"
79
+
80
+ ### - Training Arguments
81
+
82
+ - num_train_epochs=1
83
+ - per_device_train_batch_size=1
84
+ - gradient_accumulation_steps=4
85
+ - optim="adamw_bnb_8bit"
86
+ - save_steps=25
87
+ - logging_steps=25
88
+ - learning_rate=2e-4
89
+ - weight_decay=0.001
90
+ - fp16=False
91
+ - bf16=False
92
+ - max_grad_norm=0.3
93
+ - max_steps=-1
94
+ - warmup_ratio=0.03
95
+ - group_by_length=True
96
+ - lr_scheduler_type="constant"
97
+
98
+ ## Credits
99
+
100
+ - The Wizard team for creating the incredible base model;
101
+ - HuggingFace: for hosting this model and for creating the fine-tuning tools used;
102
+ - failspy ([https://huggingface.co/failspy](https://huggingface.co/failspy)): for the orthogonalization implementation;
103
+ - NobodyExistsOnTheInternet ([https://huggingface.co/NobodyExistsOnTheInternet](https://huggingface.co/NobodyExistsOnTheInternet)): for the incredible dataset;
104
+ - Undi95 ([https://huggingface.co/Undi95](https://huggingface.co/Undi95)) and Sao10k ([https://huggingface.co/Sao10K](https://huggingface.co/Sao10K)): my main inspirations for doing these models =]
105
+
106
+ A huge thank you to all of them ☺️
107
+
108
+ ## About Alpha Orionis
109
+
110
+ **Alpha Orionis**, commonly known as Betelgeuse, is a red supergiant star located in the constellation **Orion**. With an apparent magnitude ranging from +0.0 to +1.6, it is the second-brightest star in the constellation and the tenth-brightest in the night sky. It appears distinctly reddish and is classified as a semi-regular variable star due to its wide range in brightness. At near-infrared wavelengths, it becomes the brightest star in the night sky.
111
+
112
+ **Alpha Orionis** has a radius approximately 760 times larger than the sun, meaning it would extend far past the orbit of Mars if placed at the center of our solar system. Estimates suggest it has a mass between 10 and 20 times that of the sun. Despite being relatively close to us—its distance ranges from around 400 to 600 light-years away, according to recent measurements—there remains significant uncertainty regarding its exact position.
113
+
114
+ This young stellar giant—less than 10 million years old—has already exhausted much of its nuclear fuel and will eventually explode in a spectacular supernova, potentially within the next 100,000 years. Such an event could cause it to outshine even the Moon for several months, though it poses no threat to life on Earth. As a result of its high velocity relative to other celestial objects—approximately 30 kilometers per second—it creates a massive bow shock in space, extending up to four light-years across.
115
+
116
+ In addition to these remarkable features, **Alpha Orionis** holds the distinction of having had its photospheric angular size calculated before any other extrasolar star, back in 1920. Modern observations reveal an average angular diameter of 0.048 arcseconds, making it one of the largest visible objects in the night sky. Moreover, it boasts a vast, irregular envelope surrounding the star, encompassing nearly 250 times its diameter, resulting from substantial mass loss throughout its lifetime. These combined characteristics place Alpha Orionis among the most fascinating and intriguing celestial bodies observable from Earth.
117
+
118
+ **Source:** retrived from [https://en.wikipedia.org/wiki/Betelgeuse](https://en.wikipedia.org/wiki/Betelgeuse) and processed with [https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1).