add example for 4bit inference?

#11
by ct-2 - opened

There seems to be an explanation to finetune the model in 4bit, would it be possible to provide more info on 4bit inference? Thanks!

Sign up or log in to comment