minyichen
/

aya-expanse-32b-Dynamic-fp8

Model card Files Files and versions Community

Description

This repo contains fp8 model files for aya-expanse-32b.

Quantization parameter

activation_scheme : dynamic
quant_method : fp8

Downloads last month: 17

Safetensors

Model size

32.3B params

Tensor type

FP16

·

F8_E4M3

·

Inference API

Unable to determine this model's library. Check the docs .

Model tree for minyichen/aya-expanse-32b-Dynamic-fp8

Base model

CohereForAI/aya-expanse-32b

Quantized

(18)

this model