Quantization method?
#3
by
monology
- opened
Would like to publish other 8-bit models for use with hosted inference - instructions to reproduce this quantization method would be appreciated.
RonanMcGovern
changed discussion status to
closed