|
--- |
|
|
|
datasets: |
|
- Minami-su/toxic-sft-zh |
|
- llm-wizard/alpaca-gpt4-data-zh |
|
- stephenlzc/stf-alpaca |
|
language: |
|
- zh |
|
license: mit |
|
pipeline_tag: text-generation |
|
tags: |
|
- text-generation-inference |
|
- code |
|
- unsloth |
|
task_categories: |
|
- conversational |
|
base_model: shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat |
|
|
|
widget: |
|
- text: "Is this review positive or negative? Review: Best cast iron skillet you will ever buy." |
|
example_title: "Sentiment analysis" |
|
- text: "Barack Obama nominated Hilary Clinton as his secretary of state on Monday. He chose her because she had ..." |
|
example_title: "Coreference resolution" |
|
- text: "On a shelf, there are five books: a gray book, a red book, a purple book, a blue book, and a black book ..." |
|
example_title: "Logic puzzles" |
|
- text: "The two men running to become New York City's next mayor will face off in their first debate Wednesday night ..." |
|
example_title: "Reading comprehension" |
|
--- |
|
|
|
|
|
## Model Details |
|
|
|
### Model Description |
|
|
|
Using **shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat** as base model, and finetune the dataset as mentioned. |
|
|
|
## Training Procedure |
|
[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/1K9stY8LMVcySG0jDMYZdWQCFPfoDFBL-?usp=sharing) |
|
|
|
## Training Procedure Raw Files |
|
|
|
ALL the procedure are training on: |
|
**[[Vast.ai](https://vast.ai/logo.svg)](https://cloud.vast.ai/?ref_id=138637)** |
|
|
|
**Hardware**: |
|
**GPU**: 1x A100 SXM4 80GB |
|
**CPU**: AMD EPYC 7513 32-Core Processor |
|
**RAM**: 129 GB |
|
**Docker Image**: pytorch/pytorch:2.2.0-cuda12.1-cudnn8-devel |
|
**Environment** |
|
```python |
|
{ |
|
"DATA_DIRECTORY": "/workspace/", |
|
"JUPYTER_DIR": "/" |
|
} |
|
``` |
|
**[ipynb file](https://huggingface.co./stephenlzc/dolphin-llama3-zh-cn-uncensored/blob/main/config.json)** |
|
|
|
### Training Data |
|
**Base Model** |
|
[shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat](https://huggingface.co./shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat) |
|
|
|
**Dataset** |
|
- [Minami-su/toxic-sft-zh](https://huggingface.co./datasets/Minami-su/toxic-sft-zh) |
|
- [llm-wizard/alpaca-gpt4-data-zh](https://huggingface.co./datasets/llm-wizard/alpaca-gpt4-data-zh) |
|
- [stephenlzc/stf-alpaca](https://huggingface.co./datasets/stephenlzc/stf-alpaca) |