Deploy model A800
#4 opened 12 days ago
by
czqqq
Inference error: The current context does not support K-shift
#3 opened 26 days ago
by
lollmaolol
Tested Q6, uses 567Gb Ram
7
#2 opened about 1 month ago
by
krustik
Using -ctk q4_0 -ctv q4_0 with llama.cpp server throws flash_attn error
#1 opened about 1 month ago
by
softwareweaver
