Commits · amd-shark/sdxl-quant-int8

Feat (math model/tests): Updated math model and tests to match use format

1b07a9d

nickfraser commited on Jun 28

Quantization script

ecec5b7
verified

GiusFra commited on Jun 21

Remove potential overflow / saturation error.

161df88

nickfraser commited on Jun 19

Added comments - highlight possible overflow situation

3f5851c

nickfraser commited on Jun 19

Updated math model to target int8 x int8 kernels.

4024f9d

nickfraser commited on Jun 19

Updated QOp model to fuse SmoothQuant scales with input quantization

dca9b6e

nickfraser commited on Jun 18

Output reference tensors

8e3c05a
verified

GiusFra commited on Jun 14

Add config.json from stable-diffusion-xl-base-1.0/unet

54be8be

Stella Laurenzo commited on Jun 12

Upload params.safetensors with huggingface_hub

1dad0d1
verified

GiusFra commited on Jun 12

add missing smoothquant factors

99e9d19
verified

GiusFra commited on Jun 12

update quant_params with correct shapes

d6a388a
verified

GiusFra commited on Jun 11

Fix: set `keepdim=True`

9ab1060

nickfraser commited on Jun 11

[test] Fixed shapes to match new `quant_param.json`

673c9f2

nickfraser commited on Jun 11

[math_model/test] Added "QOp" implementation and basic tests.

eb5a5f6

nickfraser commited on Jun 11

Upload quant_param.json with huggingface_hub

d67ece3
verified

GiusFra commited on Jun 7

Upload quant_param.json with huggingface_hub

bcd05a6
verified

GiusFra commited on Jun 7

Upload math_model.py with huggingface_hub

049c65f
verified

GiusFra commited on Jun 7

Upload params.safetensors with huggingface_hub

742c3ad
verified

GiusFra commited on Jun 7

Upload params.safetensors with huggingface_hub

76a91d8
verified

GiusFra commited on Jun 7

Upload quant_param.json with huggingface_hub

01fc5a5
verified

GiusFra commited on Jun 7

Upload math_model.py with huggingface_hub

d5dfd96
verified

GiusFra commited on Jun 7

Upload quant_param.json with huggingface_hub

88730c2
verified

GiusFra commited on Jun 6

initial commit

af8fc68
verified

stellaraccident commited on Jun 5

amd-shark
/

sdxl-quant-int8

Commit History

Feat (math model/tests): Updated math model and tests to match use format

1b07a9d

Quantization script

ecec5b7
verified

Remove potential overflow / saturation error.

161df88

Added comments - highlight possible overflow situation

3f5851c

Updated math model to target int8 x int8 kernels.

4024f9d

Updated QOp model to fuse SmoothQuant scales with input quantization

dca9b6e

Output reference tensors

8e3c05a
verified

Add config.json from stable-diffusion-xl-base-1.0/unet

54be8be

Upload params.safetensors with huggingface_hub

1dad0d1
verified

add missing smoothquant factors

99e9d19
verified

update quant_params with correct shapes

d6a388a
verified

Fix: set `keepdim=True`

9ab1060

[test] Fixed shapes to match new `quant_param.json`

673c9f2

[math_model/test] Added "QOp" implementation and basic tests.

eb5a5f6

Upload quant_param.json with huggingface_hub

d67ece3
verified

Upload quant_param.json with huggingface_hub

bcd05a6
verified

Upload math_model.py with huggingface_hub

049c65f
verified

Upload params.safetensors with huggingface_hub

742c3ad
verified

Upload params.safetensors with huggingface_hub

76a91d8
verified

Upload quant_param.json with huggingface_hub

01fc5a5
verified

Upload math_model.py with huggingface_hub

d5dfd96
verified

Upload quant_param.json with huggingface_hub

88730c2
verified

initial commit

af8fc68
verified

Commit History

Feat (math model/tests): Updated math model and tests to match use format 1b07a9d

Quantization script ecec5b7 verified

Remove potential overflow / saturation error. 161df88

Added comments - highlight possible overflow situation 3f5851c

Updated math model to target int8 x int8 kernels. 4024f9d

Updated QOp model to fuse SmoothQuant scales with input quantization dca9b6e

Output reference tensors 8e3c05a verified

Add config.json from stable-diffusion-xl-base-1.0/unet 54be8be

Upload params.safetensors with huggingface_hub 1dad0d1 verified

add missing smoothquant factors 99e9d19 verified

update quant_params with correct shapes d6a388a verified

Fix: set `keepdim=True` 9ab1060

[test] Fixed shapes to match new `quant_param.json` 673c9f2

[math_model/test] Added "QOp" implementation and basic tests. eb5a5f6

Upload quant_param.json with huggingface_hub d67ece3 verified

Upload quant_param.json with huggingface_hub bcd05a6 verified

Upload math_model.py with huggingface_hub 049c65f verified

Upload params.safetensors with huggingface_hub 742c3ad verified

Upload params.safetensors with huggingface_hub 76a91d8 verified

Upload quant_param.json with huggingface_hub 01fc5a5 verified

Upload math_model.py with huggingface_hub d5dfd96 verified

Upload quant_param.json with huggingface_hub 88730c2 verified

initial commit af8fc68 verified

Feat (math model/tests): Updated math model and tests to match use format

1b07a9d

Quantization script

ecec5b7
verified

Remove potential overflow / saturation error.

161df88

Added comments - highlight possible overflow situation

3f5851c

Updated math model to target int8 x int8 kernels.

4024f9d

Updated QOp model to fuse SmoothQuant scales with input quantization

dca9b6e

Output reference tensors

8e3c05a
verified

Add config.json from stable-diffusion-xl-base-1.0/unet

54be8be

Upload params.safetensors with huggingface_hub

1dad0d1
verified

add missing smoothquant factors

99e9d19
verified

update quant_params with correct shapes

d6a388a
verified

Fix: set `keepdim=True`

9ab1060

[test] Fixed shapes to match new `quant_param.json`

673c9f2

[math_model/test] Added "QOp" implementation and basic tests.

eb5a5f6

Upload quant_param.json with huggingface_hub

d67ece3
verified

Upload quant_param.json with huggingface_hub

bcd05a6
verified

Upload math_model.py with huggingface_hub

049c65f
verified

Upload params.safetensors with huggingface_hub

742c3ad
verified

Upload params.safetensors with huggingface_hub

76a91d8
verified

Upload quant_param.json with huggingface_hub

01fc5a5
verified

Upload math_model.py with huggingface_hub

d5dfd96
verified

Upload quant_param.json with huggingface_hub

88730c2
verified

initial commit

af8fc68
verified