Model Card for Llama-2-7b-chat-hf-AWQ

Model Details

This model is a AWQ quantized version of the meta-llama/Llama-2-7b-chat-hf model.

  • Developed by: Ted Whooley
  • Library: Transformers, AWQ
  • Model type: llama
  • Model name: Llama-2-7b-chat-hf-AWQ
  • Pipeline tag: text-generation
  • Qunatized by: twhoool02
  • Language(s) (NLP): en
  • License: other
Downloads last month
74
Safetensors
Model size
1.13B params
Tensor type
I32
·
FP16
·
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for twhoool02/Llama-2-7b-chat-hf-AWQ

Quantized
(60)
this model