|
--- |
|
base_model: shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat |
|
datasets: |
|
- Minami-su/toxic-sft-zh |
|
- llm-wizard/alpaca-gpt4-data-zh |
|
- stephenlzc/stf-alpaca |
|
language: |
|
- zh |
|
license: mit |
|
pipeline_tag: text-generation |
|
tags: |
|
- text-generation-inference |
|
- code |
|
- unsloth |
|
- uncensored |
|
- finetune |
|
task_categories: |
|
- conversational |
|
widget: |
|
- text: >- |
|
Is this review positive or negative? Review: Best cast iron skillet you will |
|
ever buy. |
|
example_title: Sentiment analysis |
|
- text: >- |
|
Barack Obama nominated Hilary Clinton as his secretary of state on Monday. |
|
He chose her because she had ... |
|
example_title: Coreference resolution |
|
- text: >- |
|
On a shelf, there are five books: a gray book, a red book, a purple book, a |
|
blue book, and a black book ... |
|
example_title: Logic puzzles |
|
- text: >- |
|
The two men running to become New York City's next mayor will face off in |
|
their first debate Wednesday night ... |
|
example_title: Reading comprehension |
|
--- |
|
|
|
|
|
## Model Details |
|
|
|
### Model Description |
|
|
|
- Using **shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat** as base model, and finetune the dataset as mentioned via **[unsloth](https://github.com/unslothai/unsloth)**. Makes the model uncensored. |
|
- [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth) |
|
### Training Code |
|
[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/1K9stY8LMVcySG0jDMYZdWQCFPfoDFBL-?usp=sharing) |
|
|
|
### Training Procedure Raw Files |
|
|
|
- ALL the procedure are training on **[Vast.ai](https://cloud.vast.ai/?ref_id=138637)** |
|
|
|
|
|
- **Hardware in Vast.ai**: |
|
|
|
- **GPU**: 1x A100 SXM4 80GB |
|
|
|
- **CPU**: AMD EPYC 7513 32-Core Processor |
|
|
|
- **RAM**: 129 GB |
|
|
|
- **Docker Image**: pytorch/pytorch:2.2.0-cuda12.1-cudnn8-devel |
|
|
|
- Download the **[ipynb file](https://huggingface.co./stephenlzc/dolphin-llama3-zh-cn-uncensored/blob/main/config.json)**. |
|
|
|
|
|
### Training Data |
|
- **Base Model** |
|
- [shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat](https://huggingface.co./shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat) |
|
|
|
- **Dataset** |
|
- [Minami-su/toxic-sft-zh](https://huggingface.co./datasets/Minami-su/toxic-sft-zh) |
|
- [llm-wizard/alpaca-gpt4-data-zh](https://huggingface.co./datasets/llm-wizard/alpaca-gpt4-data-zh) |
|
- [stephenlzc/stf-alpaca](https://huggingface.co./datasets/stephenlzc/stf-alpaca) |
|
|
|
### |
|
[<img src="https://img.buymeacoffee.com/button-api/?text=Buy me a coffee&emoji=&slug=chicongliau&button_colour=40DCA5&font_colour=ffffff&font_family=Poppins&outline_colour=000000&coffee_colour=FFDD00" width="200"/>](https://www.buymeacoffee.com/chicongliau) |