--- language: - en - fr - de - es - it - pt - ru - zh - ja license: cc-by-nc-4.0 library_name: transformers tags: - nsfw - Visual novel - roleplay - mergekit - merge base_model: - mistralai/Mistral-Small-Instruct-2409 pipeline_tag: text-generation --- # Model Card for Model ID ![image](./cover.png) Merged model using [mergekit](https://github.com/arcee-ai/mergekit/tree/main/mergekit) This model aimed to act like visual novel character. ## Merge Format ```yaml models: - model: mistralai/Mistral-Small-Instruct-2409_SFT layer_range: [0, 56] - model: mistralai/Mistral-Small-Instruct-2409 layer_range: [0, 56] merge_method: slerp base_model: mistralai/Mistral-Small-Instruct-2409_SFT parameters: t: - filter: self_attn value: [0, 0.5, 0.3, 0.7, 1] - filter: mlp value: [1, 0.5, 0.7, 0.3, 0] - value: 0.5 # fallback for rest of tensors dtype: bfloat16 ``` # WaifuModel Collections - [TTS](https://huggingface.co./spow12/visual_novel_tts) - [Chat](https://huggingface.co./spow12/ChatWaifu_22B_v2.0) - [ASR](https://huggingface.co./spow12/Visual-novel-transcriptor) # Unified demo [WaifuAssistant](https://github.com/yw0nam/WaifuAssistant) # Update 2.0 - 2024.09.23 Update 22B, Ver 2.0 ## Model Details ### Model Description - **Developed by:** spow12(yw_nam) - **Shared by :** spow12(yw_nam) - **Model type:** CausalLM - **Language(s) (NLP):** japanese. English - **Finetuned from model :** [mistralai/Mistral-Small-Instruct-2409](https://huggingface.co./mistralai/Mistral-Small-Instruct-2409) Currently, chatbot has below personality. character | visual_novel | --- | --- | ムラサメ | Senren*Banka | 茉子 | Senren*Banka | 芳乃 | Senren*Banka | レナ | Senren*Banka | 千咲 | Senren*Banka | 芦花 | Senren*Banka | 愛衣 | Café Stella and the Reaper's Butterflies | 栞那 | Café Stella and the Reaper's Butterflies | ナツメ | Café Stella and the Reaper's Butterflies | 希 | Café Stella and the Reaper's Butterflies | 涼音 | Café Stella and the Reaper's Butterflies | あやせ | Riddle Joker | 七海 | Riddle Joker | 羽月 | Riddle Joker | 茉優 | Riddle Joker | 小春 | Riddle Joker | ### Feature - Fluent Chat performance - Reduce repetition problem when generate with many turn(over 20~30) - Zero Shot character persona using description of character. - 128k context window - Memory ability that does not forget even after long-context generation ## Demo You can use Demo in google colab. Check [Here](https://colab.research.google.com/drive/194_FN28reEPTwS51dwpLLBBwEfeoBjP9?usp=sharing) ## Bias, Risks, and Limitations This model trained by japanese dataset included visual novel which contain nsfw content. So, The model may generate NSFW content. ## Use & Credit This model is currently available for non-commercial & Research purpose only. Also, since I'm not detailed in licensing, I hope you use it responsibly. By sharing this model, I hope to contribute to the research efforts of our community (the open-source community and anime persons). This repository can use Visual novel-based RAG, but i will not distribute it yet because i'm not sure if it is permissible to release the data publicly. ## Citation ```bibtex @misc {ChatWaifu_22B_v2.0 author = { YoungWoo Nam }, title = { ChatWaifu_22B_v2.0 }, year = 2024, url = { https://huggingface.co./spow12/ChatWaifu_22B_v2.0 }, publisher = { Hugging Face } } ```