Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Trelis
/
Llama-2-7b-chat-hf-hosted-inference-8bit
like
7
Follow
Trelis
98
Text Generation
Transformers
PyTorch
Safetensors
English
llama
facebook
meta
llama-2
hosted inference
8 bit
8bit
8-bit precision
text-generation-inference
Inference Endpoints
arxiv:
2307.09288
Model card
Files
Files and versions
Community
3
Train
Deploy
Use this model
1244faf
Llama-2-7b-chat-hf-hosted-inference-8bit
Commit History
Adding `safetensors` variant of this model
1244faf
SFconvertbot
commited on
Sep 7, 2023
update notes on inference
0b6cac6
RonanMcGovern
commited on
Aug 14, 2023
Upload LlamaForCausalLM
5344658
RonanMcGovern
commited on
Aug 12, 2023
Delete pytorch_model-00002-of-00002.bin
01cc865
RonanMcGovern
commited on
Aug 12, 2023
Delete pytorch_model-00001-of-00002.bin
bac28cc
RonanMcGovern
commited on
Aug 12, 2023
Upload tokenizer
4f8f95e
RonanMcGovern
commited on
Aug 12, 2023
Upload LlamaForCausalLM
4609063
RonanMcGovern
commited on
Aug 12, 2023
commit 8-bit model
c78a248
RonanMcGovern
commited on
Aug 12, 2023
initial commit
4e110b0
RonanMcGovern
commited on
Aug 12, 2023