Commit History

QKV fused and all linear layers sym
3cee2a6
verified

GiusFra commited on

QKV fused and all linear layers sym
cf48f0f
verified

GiusFra commited on

QKV fused and sym
6f44cfb
verified

GiusFra commited on

QKV fused and sym
b175bf6
verified

GiusFra commited on

Fused QKV quant_params.json with zp
7e99883
verified

GiusFra commited on

Added vae weights with FP16 fix.
2de7ba8

nickfraser commited on

Fused QKV safetensor with zp
0339659
verified

GiusFra commited on

Fused QKV safetensor
348012d
verified

GiusFra commited on

Fused QKV quant_params.json
a793c5a
verified

GiusFra commited on

Fix model loading
7f81513
verified

GiusFra commited on

Update quant params structure (#2)
6b62ce4
verified

nickfraser commited on

Reference inputs
17638f5
verified

GiusFra commited on

Updated quant_params
fb3aa3b
verified

GiusFra commited on

Updated params.safetensors
36c8b73
verified

GiusFra commited on

Output reference tensors
6e61570
verified

GiusFra commited on

Quantization script
ecec5b7
verified

GiusFra commited on

Remove potential overflow / saturation error.
161df88

nickfraser commited on

Added comments - highlight possible overflow situation
3f5851c

nickfraser commited on

Updated math model to target int8 x int8 kernels.
4024f9d

nickfraser commited on

Updated QOp model to fuse SmoothQuant scales with input quantization
dca9b6e

nickfraser commited on

Output reference tensors
8e3c05a
verified

GiusFra commited on

Add config.json from stable-diffusion-xl-base-1.0/unet
54be8be

Stella Laurenzo commited on

Upload params.safetensors with huggingface_hub
1dad0d1
verified

GiusFra commited on

add missing smoothquant factors
99e9d19
verified

GiusFra commited on

update quant_params with correct shapes
d6a388a
verified

GiusFra commited on

Fix: set `keepdim=True`
9ab1060

nickfraser commited on

[test] Fixed shapes to match new `quant_param.json`
673c9f2

nickfraser commited on

[math_model/test] Added "QOp" implementation and basic tests.
eb5a5f6

nickfraser commited on

Upload quant_param.json with huggingface_hub
d67ece3
verified

GiusFra commited on

Upload quant_param.json with huggingface_hub
bcd05a6
verified

GiusFra commited on

Upload math_model.py with huggingface_hub
049c65f
verified

GiusFra commited on

Upload params.safetensors with huggingface_hub
742c3ad
verified

GiusFra commited on

Upload params.safetensors with huggingface_hub
76a91d8
verified

GiusFra commited on

Upload quant_param.json with huggingface_hub
01fc5a5
verified

GiusFra commited on

Upload math_model.py with huggingface_hub
d5dfd96
verified

GiusFra commited on

Upload quant_param.json with huggingface_hub
88730c2
verified

GiusFra commited on