Edit model card

omost-llama-3-8b-4bits is Omost's llama-3 model with 8k context length in nf4.

Downloads last month
4,905
Safetensors
Model size
4.65B params
Tensor type
BF16
F32
U8
Inference Examples
Inference API (serverless) is not available, repository is disabled.