bartowski
/

DeepSeek-R1-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

Resources

View closed (0)

Deploy model A800

#4 opened 12 days ago by

Inference error: The current context does not support K-shift

#3 opened 26 days ago by

Tested Q6, uses 567Gb Ram

#2 opened about 1 month ago by

Using -ctk q4_0 -ctv q4_0 with llama.cpp server throws flash_attn error

#1 opened about 1 month ago by