arxiv:2407.02068
Kaixin Xu
kartmannXu
AI & ML interests
Neural Network Compression, Efficient AI
Organizations
None yet
Papers
3
models
12
kartmannXu/MiniCPM-2B-128k-pruned-0.3-onnx
Updated
•
8
kartmannXu/MiniCPM-2B-128k-q4f16_1_mlc
Updated
kartmannXu/MiniCPM-2B-128k-prune-ch-0.3-q0f16-mlc
Updated
kartmannXu/MiniCPM-2B-128k-prune-ch-0.25-q0f16-mlc
Updated
kartmannXu/MiniCPM-2B-128k-prune-ch-0.3-q3f16_1-mlc
Updated
kartmannXu/MiniCPM-2B-128k-prune-ch-0.3-q4f16_2-mlc
Updated
kartmannXu/MiniCPM-2B-128k-prune-ch-0.25-q3f16_1-mlc
Updated
kartmannXu/MiniCPM-2B-128k-prune-ch-0.25-q4f16_2-mlc
Updated
kartmannXu/MiniCPM-2B-128k-prune-bl-0.3-q3f16_1-mlc
Updated
kartmannXu/MiniCPM-2B-128k-q4f16_2_mlc
Updated
datasets
None public yet