SmolLM2-1.7B-Instruct / onnx /model_quantized.onnx

Commit History

Fix q8 weights (use uint8 for q8; int8 produces poor results) (#18)
b75eb65
verified

Xenova HF staff commited on

Upload optimized ONNX weights (deduplicated) (#17)
b36fc77
verified

Xenova HF staff commited on