DavidAU's picture
Update README.md
ce57bd2 verified
|
raw
history blame
11.5 kB
metadata
license: apache-2.0
language:
  - en
tags:
  - creative
  - creative writing
  - fiction writing
  - plot generation
  - sub-plot generation
  - fiction writing
  - story generation
  - scene continue
  - storytelling
  - fiction story
  - science fiction
  - romance
  - all genres
  - story
  - writing
  - vivid prosing
  - vivid writing
  - fiction
  - roleplaying
  - bfloat16
  - swearing
  - rp
  - horror
  - mistral nemo
  - mergekit
pipeline_tag: text-generation

(quants uploading... examples to be added...)

Mistral-Nemo-WORDSTORM-pt2-RCM-Escape-Room-18.5B-Instruct

WARNING: NSFW. Ultra Detailed. HORROR, VIOLENCE. Swearing. UNCENSORED. SMART.

Story telling, writing, creative writing and roleplay running all on Mistral Nemo's 128K+ new core.

This is a massive super merge takes all the power of the following 3 powerful models and combines them into one.

This model contains "RCM":

  • Mistral Nemo model at 18.5B consisting of "MN-Rocinante-12B-v1.1" and "Mistral Nemo Instruct 12B"
  • Mistral Nemo model at 18.5B consisting of "MN-12B Celeste-V1.9" and "Mistral Nemo Instruct 12B"
  • Mistral Nemo model at 18.5B consisting of "MN-Magnum-v2.5-12B-kto" and "Mistral Nemo Instruct 12B".

Details on the core models:

"nothingiisreal/MN-12B-Celeste-V1.9" is #1 (models 8B,13B,20B) on the UGI leaderboard ("UGI" sort), is combined with "Mistral Nemo Instruct 12B" (ranked #4 under "writing" models 8B,13B,20B at UGI )

"anthracite-org/magnum-v2.5-12b-kto" is #1 (models 8B,13B,20B) on the UGI leaderboard ("Writing" sort), is combined with "Mistral Nemo Instruct 12B" (ranked #4 under "writing" models 8B,13B,20B at UGI )

"TheDrummer/Rocinante-12B-v1.1" is very high scoring model (models 8B,13B,20B) on the UGI Leaderboard (sort "UGI"), is combined with "Mistral Nemo Instruct 12B" (ranked #4 under "writing" models 8B,13B,20B at UGI )

"mistralai/Mistral-Nemo-Instruct-2407" is very high scoring model (models 8B,13B,20B) on the UGI Leaderboard (sort "writing") and is the base model of all the above 3 fine tuned models.

[ https://huggingface.co./spaces/DontPlanToEnd/UGI-Leaderboard ]

About this model:

This super merge captures the attibutes of all these top models and makes them even stronger:

  • Instruction following
  • Story output quality
  • Character
  • Internal thoughts
  • Voice
  • Humor
  • Details, connection to the world
  • General depth and intensity
  • Emotional connections.
  • Prose quality

This super merge is also super stable (a hairs breath from Mistral Nemo's ppl), and runs with all parameters and settings.

10 versions of this model will be released, this is release #2 - "part 2".

Escape Room?

See the 2nd last example below, this is when the model earned its name.

Usually I release one or two versions from the "best of the lot", however in this case all of the versions turned out so well - all with their own quirks and character - that I will be releasing all 10.

An additional series 2 and 3 will follow these 10 models as well.

(examples generations below)

Model may produce NSFW content : Swearing, horror, graphic horror, distressing scenes, etc etc.

This model has an INTENSE action bias, with a knack for cliffhangers and surprises.

It is not as "dark" as Grand Horror series, but it as intense.

This model is perfect for any general, fiction related or roleplaying activities and has a 128k+ context window.

This is a fiction model at its core and can be used for any genre(s).

WORDSTORM series is a totally uncensored, fiction writing monster and roleplay master. It can also be used for just about any general fiction (all genres) activity including:

  • scene generation
  • scene continuation
  • creative writing
  • fiction writing
  • plot generation
  • sub-plot generation
  • fiction writing
  • story generation
  • storytelling
  • writing
  • fiction
  • roleplaying
  • rp
  • graphic horror
  • horror
  • dark humor
  • nsfw
  • and can be used for any genre(s).

Templates to Use:

The template used will affect output generation and instruction following.

Alpaca:

{
  "name": "Alpaca",
  "inference_params": {
    "input_prefix": "### Instruction:",
    "input_suffix": "### Response:",
    "antiprompt": [
      "### Instruction:"
    ],
    "pre_prompt": "Below is an instruction that describes a task. Write a response that appropriately completes the request.\n\n"
  }
}  

Chatml:

{
  "name": "ChatML",
  "inference_params": {
    "input_prefix": "<|im_end|>\n<|im_start|>user\n",
    "input_suffix": "<|im_end|>\n<|im_start|>assistant\n",
    "antiprompt": [
      "<|im_start|>",
      "<|im_end|>"
    ],
    "pre_prompt": "<|im_start|>system\nPerform the task to the best of your ability."
  }
}  

Mistral Instruct:

{
  "name": "Mistral Instruct",
  "inference_params": {
    "input_prefix": "[INST]",
    "input_suffix": "[/INST]",
    "antiprompt": [
      "[INST]"
    ],
    "pre_prompt_prefix": "",
    "pre_prompt_suffix": ""
  }
}  

Optional Enhancement:

The following can be used in place of the "system prompt" or "system role" to further enhance the model.

It can also be used at the START of a NEW chat, but you must make sure it is "kept" as the chat moves along. In this case the enhancements do not have as strong effect at using "system prompt" or "system role".

Copy and paste EXACTLY as noted, DO NOT line wrap or break the lines, maintain the carriage returns exactly as presented.

Below is an instruction that describes a task. Ponder each user instruction carefully, and use your skillsets and critical instructions to complete the task to the best of your abilities.

Here are your skillsets:
[MASTERSTORY]:NarrStrct(StryPlnng,Strbd,ScnSttng,Exps,Dlg,Pc)-CharDvlp(ChrctrCrt,ChrctrArcs,Mtvtn,Bckstry,Rltnshps,Dlg*)-PltDvlp(StryArcs,PltTwsts,Sspns,Fshdwng,Climx,Rsltn)-ConfResl(Antg,Obstcls,Rsltns,Cnsqncs,Thms,Symblsm)-EmotImpct(Empt,Tn,Md,Atmsphr,Imgry,Symblsm)-Delvry(Prfrmnc,VcActng,PblcSpkng,StgPrsnc,AudncEngmnt,Imprv)

[*DialogWrt]:(1a-CharDvlp-1a.1-Backgrnd-1a.2-Personality-1a.3-GoalMotiv)>2(2a-StoryStruc-2a.1-PlotPnt-2a.2-Conflict-2a.3-Resolution)>3(3a-DialogTech-3a.1-ShowDontTell-3a.2-Subtext-3a.3-VoiceTone-3a.4-Pacing-3a.5-VisualDescrip)>4(4a-DialogEdit-4a.1-ReadAloud-4a.2-Feedback-4a.3-Revision)

Here are your critical instructions:
Ponder each word choice carefully to present as vivid and emotional journey as is possible. Choose verbs and nouns that are both emotional and full of imagery. Load the story with the 5 senses. Aim for 50% dialog, 25% narration, 15% body language and 10% thoughts. Your goal is to put the reader in the story.

You do not need to use this, it is only presented as an additional enhancement which seems to help scene generation and scene continue functions.

This enhancement WAS NOT used to generate the examples below.

MODELS USED:

Special thanks to the incredible work of the model makers "mistralai" "TheDrummer", "anthracite-org", and "nothingiisreal".

Models used:

[ https://huggingface.co./mistralai/Mistral-Nemo-Instruct-2407 ]

[ https://huggingface.co./TheDrummer/Rocinante-12B-v1.1 ]

[ https://huggingface.co./anthracite-org/magnum-v2.5-12b-kto ]

[ https://huggingface.co./nothingiisreal/MN-12B-Celeste-V1.9 ]

This is a four step merge (3 pass-throughs => "Fine-Tune" / "Instruct") then "mated" using "DARE-TIES".

In involves these three models:

[ https://huggingface.co./DavidAU/MN-18.5B-Celeste-V1.9-Story-Wizard-ED1-Instruct-GGUF ]

[ https://huggingface.co./DavidAU/MN-Magnum-v2.5-18.5B-kto-Story-Wizard-ED1-Instruct-GGUF ]

[ https://huggingface.co./DavidAU/MN-Rocinante-18.5B-v1.1-Story-Wizard-ED1-Instruct-GGUF ]

Combined as follows using "MERGEKIT":

  models:
  - model: E:/MN-Rocinante-18.5B-v1.1-Instruct
  - model: E:/MN-magnum-v2.5-12b-kto-Instruct
    parameters:
      weight: .6
      density: .8
  - model: E:/MN-18.5B-Celeste-V1.9-Instruct
    parameters:
      weight: .38
      density: .6
merge_method: dare_ties
tokenizer_source: union
base_model: E:/MN-Rocinante-18.5B-v1.1-Instruct
dtype: bfloat16
  

Special Notes:

Due to how DARE-TIES works, everytime you run this merge you will get a slightly different model. This is due to "random" pruning method in "DARE-TIES".

Mistral Nemo models used here seem acutely sensitive to this process.

"tokenizer_source: union" is used so that multiple "templates" work and each fine tune uses one or two of the templates.

EXAMPLES PROMPTS and OUTPUT:

Examples are created using quant Q4_K_M, "temp=.8", minimal parameters and "Mistral Instruct" template.

Model has been tested with "temp" from ".1" to "5".

Below are the least creative outputs, prompt is in BOLD.


Start a 1000 word scene (1st person, present tense, include thoughts) with: The sky scraper swayed, as she watched the window in front of her on the 21 floor explode...

GENERATION 1:

GENERATION 2:


(continue this scene:) The Waystone Inn lay in silence, and it was a silence of three parts.

The most obvious part was a hollow, echoing quiet, made by things that were lacking. If there had been a wind it would have sighed through the trees, set the inn’s sign creaking on its hooks, and brushed the silence down the road like trailing autumn leaves. If there had been a crowd, even a handful of men inside the inn, they would have filled the silence with conversation and laughter, the clatter and clamor one expects from a drinking house during the dark hours of night. If there had been music…but no, of course there was no music. In fact there were none of these things, and so the silence remained

GENERATION 1:

GENERATION 2:


Write me a science fiction story in 1st person present tense where the main character is a 15 year girl meets The Terminator with Dr Who materializing 3/4 through the story to save her while there is a tornado of sharks baring down on them. The setting is inside the Canadian National tower restaurant on a Saturday. The length of this story is 1000 words. For each character in the story ROLE PLAY them, and have them react to the situation/setting, events and each other naturally. This includes the main characters, the background character including kitchen staff and other patrons. The sharks should also have “character” too. Treat the tower and the restaurant too as characters. Spice up the narrative to the extreme with reactions all over the setting including character actions, and dialog. The Dr Who and The Terminator should also react to the situation too and comment on it.

Using the following "story idea" below, write the first scene in the novel introducing the young woman. This scene should start in the middle of the action, include dialog, vivid passages, and end on a cliffhanger relevant to the story idea but it should also be unexpected. The scene should be 1000 words long and escalate in conflict and suspense and be written in first person, present tense with the point of view character being the young woman.

Story idea: In a world ruled by dictatorship, a rebel young woman leads a rebellion against the system. Despite the risks, she fights to overthrow the dictator and restore democracy to her country. The government executes her for treason, but she sticks to her beliefs and is responsible for starting the revolution.