nvidia
/

Llama-3_1-Nemotron-51B-Instruct

Text Generation

Model card Files Files and versions Community

Resources

View closed (13)

Modified llama.cpp to generate GGUFs for Llama-3_1-Nemotron-51

#22 opened 21 days ago by

Documentation about the linear attention used in some layers of this model?

#21 opened 27 days ago by

Comparison to the 70B model?

#20 opened about 1 month ago by

Update README.md

#11 opened 3 months ago by

vLLM compatible?

#10 opened 3 months ago by

AttributeError: 'DeciLMConfig'

#9 opened 3 months ago by

fp8 / int8 inference - use bitsandbytes or awq

#8 opened 3 months ago by

GGUF possible ?

#5 opened 3 months ago by

fine-tuning

#1 opened 3 months ago by