请问这个4bit模型支持vllm启动吗?
#11 opened 8 months ago
by
SongXiaoMao

AWQ quantised?
#8 opened 10 months ago
by
epignatelli
Possible to do inference on long contexts with limited VRAM?
1
#6 opened 10 months ago
by
danabo
Excellant model, fine tuning resources
#5 opened 11 months ago
by
ewre324
Running on 3x24 GB RAM?
3
#3 opened 11 months ago
by
Marcophono