chatglm-6b-int8 / quantization.py

Commit History

Add support for parallel quantization on Mac
a697125

zxdu20 commited on

Remove assert in load_cpu_kernel
3218e92

zxdu20 commited on

Sync with chatglm-6b
216185d

zxdu20 commited on

Init commit
fb85b4d

zxdu20 commited on