Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
neuralmagic
/
DeepSeek-R1-Distill-Llama-70B-quantized.w8a8
like
1
Follow
Neural Magic
350
Text Generation
Transformers
Safetensors
llama
deepseek
int8
vllm
llmcompressor
conversational
text-generation-inference
Inference Endpoints
8-bit precision
compressed-tensors
arxiv:
2210.17323
License:
mit
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
DeepSeek-R1-Distill-Llama-70B-quantized.w8a8
Commit History
Add reasoning evals
2fb57dc
verified
nm-research
commited on
2 days ago
Update README.md
9037a40
verified
nm-research
commited on
4 days ago
update tokenizer configs
1e8c6bc
nm-research
commited on
6 days ago
Update README.md
d03570d
verified
nm-research
commited on
9 days ago
Update README.md
46cdd00
verified
nm-research
commited on
9 days ago
Update README.md
49529f0
verified
nm-research
commited on
9 days ago
Update README.md
81dc57c
verified
nm-research
commited on
17 days ago
Update README.md
8b2e640
verified
nm-research
commited on
18 days ago
Create README.md
0c1ec85
verified
nm-research
commited on
18 days ago
Upload folder using huggingface_hub
6461fb3
verified
nm-research
commited on
25 days ago
initial commit
b2d55b3
verified
nm-research
commited on
25 days ago