Commit History
Update README.md
826ca34
verified
Merge branch 'main' of https://huggingface.co./THUDM/chatglm-6b-int4
6c5205c
duzx16
commited on
Update license
bb09de3
duzx16
commited on
Upload pytorch_model.bin
02a065c
Update slack link
e214c5b
Update decode method in tokenizer
d8a6cfc
Add support for parallel quantization on Mac
f6b88da
Remove assert in load_cpu_kernel
63d66b0
Sync with chatglm-6b
f55a108
Remove pytorch_model.bin.index.json
e02ba89
Update slack link
6498797
Add pytorch_model.bin.index.json
1e40d96
Add assertion when loading cpu and cuda kernel fails
630d0ef
songxxzp
commited on
Add assertion when loading cpu and cuda kernel fails
bcc35f0
songxxzp
commited on
Merge branch 'dev'
fe0674f
songxxzp
commited on
Update CPU kernel loading method
c7d8998
songxxzp
commited on