Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
neuralmagic
/
Meta-Llama-3.1-405B-Instruct-quantized.w4a16
like
12
Follow
Neural Magic
166
Text Generation
Safetensors
8 languages
llama
int4
vllm
conversational
compressed-tensors
arxiv:
2210.17323
License:
llama3.1
Model card
Files
Files and versions
Community
4
Train
main
Meta-Llama-3.1-405B-Instruct-quantized.w4a16
/
README.md
Commit History
Update README.md
9db6306
verified
alexmarques
commited on
Oct 10
Update README.md
6b753ea
verified
alexmarques
commited on
Sep 30
Update README.md
7d1f72d
verified
alexmarques
commited on
Aug 13
Update README.md
bb83fe4
verified
abhinavnmagic
commited on
Aug 13
Update README.md
91a872b
verified
abhinavnmagic
commited on
Aug 12
Update README.md
a8c9e50
verified
abhinavnmagic
commited on
Aug 9
Create README.md
2abcd4a
verified
abhinavnmagic
commited on
Aug 9