jxke-MBA-M3-13i commited on
Commit
fef7ac9
1 Parent(s): 9b7f617

Convert Qwen1.5-4B-Chat to RKLLM format for Rockchip devices

Browse files
Files changed (1) hide show
  1. README.md +63 -0
README.md ADDED
@@ -0,0 +1,63 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - zh
4
+ - en
5
+ pipeline_tag: text-generation
6
+ tags:
7
+ - qwen
8
+ - Qwen1.5-4B-Chat
9
+ - rkllm
10
+ - rk3588
11
+ ---
12
+
13
+ # Qwen1.5-4B-Chat-rkllm
14
+ This is a conversion from [Qwen/Qwen1.5-4B-Chat](https://huggingface.co/Qwen/Qwen1.5-4B-Chat)
15
+ to the RKLLM format for chat in Rockchip devices.
16
+
17
+
18
+ - [Qwen1.5-4B-Chat-rkllm](#qwen15-4b-chat-rkllm)
19
+ - [Support Devices](#support-devices)
20
+ - [Convert tools](#convert-tools)
21
+ - [Converted with RKLLM runtime](#converted-with-rkllm-runtime)
22
+ - [License](#license)
23
+ - [Trouble shot](#trouble-shot)
24
+ - [Reference](#reference)
25
+
26
+
27
+ ## Support Devices
28
+ * RK3588/RK3588s
29
+
30
+ ## Convert tools
31
+ To Converting LLMs for Rockchip's NPUs, please see the artical<sup>1,2</sup> for model details.
32
+
33
+ ## Converted with RKLLM runtime
34
+ * RKLLM runtime `1.0.1`
35
+
36
+ ## License
37
+ Same as the original [Qwen/Qwen1.5-4B-Chat](https://huggingface.co/Qwen/Qwen1.5-4B-Chat)
38
+
39
+
40
+ ## Trouble shot
41
+ * `E RKNN: [10:48:59.683] failed to allocate handle, ret: -1, errno: 12, errstr: Cannot allocate memory`
42
+ ```shell
43
+ firefly@firefly:~/Documents/rknn-llm$ rkllm ./chatglm3-6b.rkllm
44
+ rkllm init start
45
+ rkllm-runtime version: 1.0.1, rknpu driver version: 0.8.2, platform: RK3588
46
+ Warning: Your rknpu driver version is too low, please upgrade to 0.9.6.
47
+ E RKNN: [10:48:59.683] failed to allocate handle, ret: -1, errno: 12, errstr: Cannot allocate memory
48
+
49
+ can not create weight memory for domain1
50
+ E RKNN: [10:49:00.480] failed to allocate handle, ret: -1, errno: 12, errstr: Cannot allocate memory
51
+
52
+ can not create weight memory for domain2
53
+ E RKNN: [10:49:05.216] failed to convert handle(1020) to fd, ret: -1, errno: 24, errstr: Too many open files
54
+
55
+ # Solution
56
+ firefly@firefly:~/Documents/rknn-llm$ ulimit -n 102400
57
+ ```
58
+
59
+ ## Reference
60
+ 1. [airockchip/rknn-llm](https://github.com/airockchip/rknn-llm)
61
+ 1. [Pelochus/ezrknn-llm](https://github.com/Pelochus/ezrknn-llm)
62
+ 2. [Qwen/Qwen1.5-4B-Chat](https://huggingface.co/Qwen/Qwen1.5-4B-Chat)
63
+ 3. [跑大模型遇到问题 #62](https://github.com/airockchip/rknn-llm/issues/62)