Translation
Transformers
Inference Endpoints

Quantization

#3
by laelhalawani - opened

Hi, are there any quantization strategies / libs compatible with this architecture?

Sign up or log in to comment