Quantization method?

#3
by monology - opened

Would like to publish other 8-bit models for use with hosted inference - instructions to reproduce this quantization method would be appreciated.

RonanMcGovern changed discussion status to closed

Sign up or log in to comment