grimulkan
/

aurelian-v0.5-70b-rope8-32K-fp16

@@ -8,6 +8,17 @@ license: unknown
 * Greatly minimizes "chatGPTisms". No more feeling empowered by the shared bonds of friendship with renewed determination for challenges to come.
 * Increased diversity of creative prose.
 ### Details (same as alpha)
 * Base model: [llama2_70b_longlora_fp16_32k_ROPE8](https://huggingface.co/grimulkan/llama2_70b_longlora_fp16_32k_ROPE8) (no base instruction tuning)
@@ -18,25 +29,6 @@ license: unknown
 * Intended to be used in instruct mode (rather than notebook mode/completions).
 * **This model is not censored, and is capable of producing offensive and NSFW content. Please use this model with caution, and do not use if you are offended by such content.**
-## Available Quantizations
-* [bfloat16](https://huggingface.co/grimulkan/aurelian-v0.5-70b-rope8-32K-fp16)
-* [EXL2 2.4bit]() fits in 1x24GB using Exllamav2 & 8-bit cache @ 10K context
-* [EXL2 4bit]() fits in 2x24GB (19/24) using Exllamav2 @ 16K context
-* [EXL2 6bit](https://huggingface.co/grimulkan/aurelian-v0.5-70b-rope8-32K-6bpw_h8_exl2) fits in 48GB+24GB (36/24 split) or 3x24GB (16/17/20 split) using Exllamav2 @ 32k context
-* [GGUF]()
-### Examples
-Examples here are SFW & non-cherry picked, generated with `Mirostat tau = 2`.
-* **Multi-Round Story Writing**: [Sci-Fi Story]()
-* **Oneshot Story-writing**: [Fantasy Story]()
-* **Story Brainstorming/Speculation**: [Harry Potter Brainstorming]()
-* **Document Q&A and Summarization**: [Wikipedia example]()
-* **Roleplaying (RP)**: [RP example]()
-* **Interactive World Exploration**: [Explore a cyberpunk world]()
 ## Tips
 * Treat the first prompt like you normally would the system prompt.
@@ -47,6 +39,14 @@ Examples here are SFW & non-cherry picked, generated with `Mirostat tau = 2`.
   * Statements like `Respond briefly` would bias it shorter.
 * Explain clearly if you want the content to be SFW or NSFW in the first prompt as well. However, **there are no guarantees that the model won't generate NSFW content**.
 ### Training Data
 85% of the training data was human generated output with synthetic input. 15% was from GPT4.

 * Greatly minimizes "chatGPTisms". No more feeling empowered by the shared bonds of friendship with renewed determination for challenges to come.
 * Increased diversity of creative prose.
+### Examples
+Examples were generated with `Mirostat tau = 2`.
+* **Multi-Round Story Writing**: [Sci-Fi Story]()
+* **Oneshot Story-writing**: [Crime Story]() Generating >2K tokens of meaningful content in a single output response (without multi-round) is challenging. This took a few tries. Smoke and mirrors.
+* **Multi-Round Story Planning/Brainstorming**: [Adventure Story Brainstorming]()
+* **Document Q&A and Summarization**: [Wikipedia example]()
+* **Roleplaying (RP)**: [RP example]()
+* **Interactive World Exploration**: [Explore a cyberpunk world]()
 ### Details (same as alpha)
 * Base model: [llama2_70b_longlora_fp16_32k_ROPE8](https://huggingface.co/grimulkan/llama2_70b_longlora_fp16_32k_ROPE8) (no base instruction tuning)
 * Intended to be used in instruct mode (rather than notebook mode/completions).
 * **This model is not censored, and is capable of producing offensive and NSFW content. Please use this model with caution, and do not use if you are offended by such content.**
 ## Tips
 * Treat the first prompt like you normally would the system prompt.
   * Statements like `Respond briefly` would bias it shorter.
 * Explain clearly if you want the content to be SFW or NSFW in the first prompt as well. However, **there are no guarantees that the model won't generate NSFW content**.
+## Available Quantizations
+* [bfloat16](https://huggingface.co/grimulkan/aurelian-v0.5-70b-rope8-32K-fp16)
+* [EXL2 2.4bit](https://huggingface.co/grimulkan/aurelian-v0.5-70b-rope8-32K-2.4bpw_h6_exl2) fits in 1x24GB using Exllamav2 & 8-bit cache @ 10K context
+* [EXL2 4bit](https://huggingface.co/grimulkan/aurelian-v0.5-70b-rope8-32K-4.65bpw_h6_exl2) fits in 2x24GB (19/24) using Exllamav2 @ 16K context
+* [EXL2 6bit](https://huggingface.co/grimulkan/aurelian-v0.5-70b-rope8-32K-6bpw_h8_exl2) fits in 48GB+24GB (36/24 split) or 3x24GB (16/17/20 split) using Exllamav2 @ 32k context
+* [GGUF](https://huggingface.co/grimulkan/aurelian-v0.5-70b-rope8-32K_GGUF) Q4_K_M, Q5_K_M, Q6_K
 ### Training Data
 85% of the training data was human generated output with synthetic input. 15% was from GPT4.