You are viewing main version, which requires installation from source. If you'd like
regular pip install, checkout the latest stable version (v4.46.2).
Optimum
The Optimum library supports quantization for Intel, Furiosa, ONNX Runtime, GPTQ, and lower-level PyTorch quantization functions. Consider using Optimum for quantization if you’re using specific and optimized hardware like Intel CPUs, Furiosa NPUs or a model accelerator like ONNX Runtime.
< > Update on GitHub