Model Card for Llama-2-7b-chat-hf-AWQ

Model Details

This model is a AWQ quantized version of the meta-llama/Llama-2-7b-chat-hf model.

Developed by: Ted Whooley
Library: Transformers, AWQ
Model type: llama
Model name: Llama-2-7b-chat-hf-AWQ
Pipeline tag: text-generation
Qunatized by: twhoool02
Language(s) (NLP): en
License: other

Downloads last month: 74

Safetensors

Model size

1.13B params

Tensor type

I32

FP16

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for twhoool02/Llama-2-7b-chat-hf-AWQ

Base model

meta-llama/Llama-2-7b-chat-hf

Quantized

(60)

this model