Qwen
/

Qwen2-57B-A14B-Instruct

Text Generation

Inference Endpoints

Model card Files Files and versions Community

Resources

View closed (4)

[AUTOMATED] Model Memory Requirements

#6 opened 8 months ago by

model-sizer-bot

为啥没有 Qwen2-57B-A14B-Instruct-GPTQ-Int8？

#5 opened 8 months ago by

使用vLLM的时候，会报错：CUDA out of memory

#3 opened 9 months ago by