unsloth
/

DeepSeek-R1-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

Resources

View closed (3)

md5 / sha256 hashes please

#18 opened 2 days ago by

Is there a model removing non-shared MoE experts?

#17 opened 2 days ago by

A Step-by-step deployment guide with ollama

#16 opened 3 days ago by

No think tokens visible

#15 opened 4 days ago by

Over 2 tok/sec agg backed by NVMe SSD on 96GB RAM + 24GB VRAM AM5 rig with llama.cpp

#13 opened 4 days ago by

Running the model with vLLM does not actually work

#12 opened 5 days ago by

DeepSeek-R1-GGUF on LMStudio not available

#11 opened 5 days ago by

Where did the BF16 come from?

#10 opened 5 days ago by

Inference speed

#9 opened 5 days ago by

Running this model using vLLM Docker

#8 opened 6 days ago by

UD-IQ1_M models for distilled R1 versions?

#6 opened 6 days ago by

Llama.cpp server chat template

#4 opened 9 days ago by

Are the Q4 and Q5 models R1 or R1-Zero

#2 opened 13 days ago by

What is the VRAM requirement to run this ?

#1 opened 14 days ago by