fireballoon
commited on
Commit
•
4445038
1
Parent(s):
e5120e3
Update README.md
Browse files
README.md
CHANGED
@@ -18,6 +18,8 @@ baichuan-vicuna-7b is a chat model supervised finetuned on vicuna sharegpt data.
|
|
18 |
|
19 |
[中文说明](#chinese-model-card)
|
20 |
|
|
|
|
|
21 |
# Load model
|
22 |
```python
|
23 |
from transformers import AutoTokenizer, AutoModelForCausalLM
|
@@ -124,7 +126,7 @@ This algorithm has a runtime complexity of O(log n) and a space complexity of O(
|
|
124 |
baichuan-vicuna-7b是在vicuna sharegpt数据上全参数微调的对话模型。
|
125 |
- 基座模型是[baichuan-7B](https://huggingface.co/baichuan-inc/baichuan-7B),由百川智能开发的可商用大规模预训练模型。
|
126 |
- 微调数据包括[ShareGPT](https://huggingface.co/datasets/anon8231489123/ShareGPT_Vicuna_unfiltered/blob/main/ShareGPT_V3_unfiltered_cleaned_split.json),混合一定比例的[COT](https://huggingface.co/datasets/QingyiSi/Alpaca-CoT)和[Leetcode](https://www.kaggle.com/datasets/erichartford/leetcode-solutions)数据以提升模型的推理和代码能力(数据混合策略受到[TULU](https://arxiv.org/abs/2306.04751)研究成果的启发)。
|
127 |
-
-
|
128 |
|
129 |
|
130 |
# Test examples on Ziyan Eval
|
|
|
18 |
|
19 |
[中文说明](#chinese-model-card)
|
20 |
|
21 |
+
**[New]** [baichuan-vicuna-chinese-7b](https://huggingface.co/fireballoon/baichuan-vicuna-chinese-7b), baichuan finetuned on both English and Chinese ShareGPT.
|
22 |
+
|
23 |
# Load model
|
24 |
```python
|
25 |
from transformers import AutoTokenizer, AutoModelForCausalLM
|
|
|
126 |
baichuan-vicuna-7b是在vicuna sharegpt数据上全参数微调的对话模型。
|
127 |
- 基座模型是[baichuan-7B](https://huggingface.co/baichuan-inc/baichuan-7B),由百川智能开发的可商用大规模预训练模型。
|
128 |
- 微调数据包括[ShareGPT](https://huggingface.co/datasets/anon8231489123/ShareGPT_Vicuna_unfiltered/blob/main/ShareGPT_V3_unfiltered_cleaned_split.json),混合一定比例的[COT](https://huggingface.co/datasets/QingyiSi/Alpaca-CoT)和[Leetcode](https://www.kaggle.com/datasets/erichartford/leetcode-solutions)数据以提升模型的推理和代码能力(数据混合策略受到[TULU](https://arxiv.org/abs/2306.04751)研究成果的启发)。
|
129 |
+
- 训练代码:https://huggingface.co/fireballoon/baichuan-vicuna-7b/blob/main/train_vicuna.py, 代码基于[FastChat](https://github.com/lm-sys/FastChat)。
|
130 |
|
131 |
|
132 |
# Test examples on Ziyan Eval
|