Upload README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,125 @@
|
|
1 |
-
|
2 |
-
|
3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
This repository contains the full weights and LoRA weights for Zh-MT-LLM v1.0 which fine-tuned with ChatGLM3-6b-base.
|
2 |
+
|
3 |
+
### Zh-MT-LLM
|
4 |
+
|
5 |
+
Zheng He Maritime Large Language Model (Zh-MT-LLM) is a vertical domain maritime Large Language Model developed by the Intelligent Technology Laboratory of Dalian Maritime University for practitioners, trainers and students in the maritime field, providing questions and answers on maritime laws and regulations, maritime education and training, and questions and answers on maritime expertise.
|
6 |
+
|
7 |
+
Corresponding to the above three segments, our model has the following three main characteristics:
|
8 |
+
|
9 |
+
- Maritime Laws and Regulations Q&A:
|
10 |
+
|
11 |
+
The model is trained on a wide range of maritime laws and regulations, providing consulting services for those in the maritime field.
|
12 |
+
|
13 |
+
- Maritime Education and Training:
|
14 |
+
|
15 |
+
The model learns from maritime professional test questions, vocational examination syllabi, and high-quality crew common Q&A to provide training knowledge.
|
16 |
+
|
17 |
+
- Maritime Expertise Q&A:
|
18 |
+
|
19 |
+
The model covers ship maintenance, safety management, port operations, maritime logistics, navigation technology, marine environmental protection, and scientific research to answer questions for maritime industry practitioners.
|
20 |
+
|
21 |
+
|
22 |
+
|
23 |
+
### Zh-MT-SFT Dataset
|
24 |
+
|
25 |
+
The specific statistics of the dataset used for the above training are as follows:
|
26 |
+
|
27 |
+
<table>
|
28 |
+
<tbody style="text-align: center;">
|
29 |
+
<tr>
|
30 |
+
<th>Services</th>
|
31 |
+
<th>Subtasks</th>
|
32 |
+
<th>Data sets</th>
|
33 |
+
<th>Data volume</th>
|
34 |
+
</tr>
|
35 |
+
<tr>
|
36 |
+
<td rowspan="4">Maritime Laws and Regulations Q&A</td>
|
37 |
+
<td rowspan="2">Maritime Legal Advice</td>
|
38 |
+
<td>CrimeKgAssitant </td>
|
39 |
+
<td>18,279</td>
|
40 |
+
</tr>
|
41 |
+
<tr>
|
42 |
+
<td>Zh-law-qa</td>
|
43 |
+
<td>59,244</td>
|
44 |
+
</tr>
|
45 |
+
<tr>
|
46 |
+
<td>The Court held</td>
|
47 |
+
<td>Zh-law-court</td>
|
48 |
+
<td>2,684</td>
|
49 |
+
</tr>
|
50 |
+
<tr>
|
51 |
+
<td>Sentence projections</td>
|
52 |
+
<td>Zh-law-predict</td>
|
53 |
+
<td>3,004</td>
|
54 |
+
</tr>
|
55 |
+
<tr>
|
56 |
+
<td rowspan="2">Maritime education and training</td>
|
57 |
+
<td>Maritime Education Counseling</td>
|
58 |
+
<td>Zh-edu-qa</td>
|
59 |
+
<td>41,052</td>
|
60 |
+
</tr>
|
61 |
+
<tr>
|
62 |
+
<td>Maritime Specialization Question Bank</td>
|
63 |
+
<td>Zh-edu-qb</td>
|
64 |
+
<td>23,531</td>
|
65 |
+
</tr>
|
66 |
+
<tr>
|
67 |
+
<td rowspan="4">Maritime Expertise Q&A</td>
|
68 |
+
<td>Ship Knowledge</td>
|
69 |
+
<td rowspan="4">Zh-mt-qa</td>
|
70 |
+
<td rowspan="4">46,759</td>
|
71 |
+
</tr>
|
72 |
+
<tr>
|
73 |
+
<td>Navigational Knowledge</td>
|
74 |
+
</tr>
|
75 |
+
<tr>
|
76 |
+
<td>Port knowledge</td>
|
77 |
+
</tr>
|
78 |
+
<tr>
|
79 |
+
<td>Marine knowledge</td>
|
80 |
+
</tr>
|
81 |
+
<tr>
|
82 |
+
<td rowspan="1">Generic Dialogue</td>
|
83 |
+
<td></td>
|
84 |
+
<td>moss-003-sft-data</td>
|
85 |
+
<td>300,000</td>
|
86 |
+
</tr>
|
87 |
+
<tr>
|
88 |
+
<td>Total</td>
|
89 |
+
<td colspan="3">494,553</td>
|
90 |
+
</tr>
|
91 |
+
</tbody>
|
92 |
+
</table>
|
93 |
+
|
94 |
+
### Code Usage
|
95 |
+
|
96 |
+
You can create a conversation using the Zh-MT-LLM model using the following codes:
|
97 |
+
|
98 |
+
```python
|
99 |
+
>>>from transformers import AutoTokenizer, AutoModel
|
100 |
+
>>>tokenizer = AutoTokenizer.from_pretrained("ZhangFuXi/Zh-MT-LLM",trust_remote_code=True)
|
101 |
+
>>>model = AutoModel.from_pretrained("ZhangFuXi/Zh-MT-LLM", trust_remote_code=True).half().cuda()
|
102 |
+
>>>model = model.eval()
|
103 |
+
>>>response, history = model.chat(tokenizer, "你好", history=[])
|
104 |
+
>>>print(response)
|
105 |
+
```
|
106 |
+
|
107 |
+
|
108 |
+
|
109 |
+
### Declaration
|
110 |
+
|
111 |
+
Due to factors such as the limitation of the number of model parameters and the degree of cleaning of the training data, the model open source in this project may have the following limitations:
|
112 |
+
|
113 |
+
- Because it has not been harmlessly fine-tuned, it may result in discriminatory, harmful and unethical statements.
|
114 |
+
- Lacking an accurate understanding of the real world, the model may produce hallucinatory responses that mislead the user.
|
115 |
+
- The model's training data may contain biased data, and users should be cautious about potential bias in model responses.
|
116 |
+
|
117 |
+
- Due to the limited number of model parameters, it may not be possible to cover all areas of knowledge, resulting in less accurate or complete responses on some topics.
|
118 |
+
|
119 |
+
- When dealing with factual knowledge in a specific domain, models may provide incorrect answers due to insufficient information or misinterpretation, leading to misinformation or confusion.
|
120 |
+
|
121 |
+
Given the limitations of the above model, we request that the code, data, and model of this project not be used for socially harmful purposes and must follow the [MODEL_LICENCE](https://github.com/THUDM/ChatGLM3/blob/main/MODEL_LICENSE) of the base model. We are not responsible for any problems, risks, or adverse consequences arising from the use of Zh-MT-LLM.
|
122 |
+
|
123 |
+
### Licenses
|
124 |
+
|
125 |
+
The use of the source code in this repository complies with the Apache 2.0 License.
|