ibnzterrell/Nvidia-Llama-3.1-Nemotron-70B-Instruct-HF-AWQ-INT4 Text Generation • Updated Dec 7, 2024 • 160 • 2
bartowski/Llama-3.1-Nemotron-70B-Instruct-HF-GGUF Text Generation • Updated Oct 16, 2024 • 9.26k • 88
lmstudio-community/Llama-3.1-Nemotron-70B-Instruct-HF-GGUF Text Generation • Updated Oct 15, 2024 • 412 • 37
mlx-community/nvidia_Llama-3.1-Nemotron-70B-Instruct-HF_4bit Text Generation • Updated Oct 16, 2024 • 159k • 11
mlx-community/Llama-3.1-Nemotron-70B-Instruct-HF-8bit Text Generation • Updated Oct 17, 2024 • 25 • 1
neuralmagic/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-dynamic Text Generation • Updated Oct 17, 2024 • 37.4k • 14
DevQuasar/nvidia.Llama-3.1-Nemotron-70B-Instruct-HF-GGUF Text Generation • Updated Dec 4, 2024 • 175 • 1
mlx-community/Llama-3.1-Nemotron-70B-Instruct-HF-4bit Text Generation • Updated Oct 17, 2024 • 29 • 2
joshmiller656/Llama-3.1-Nemotron-70B-Instruct-AWQ-INT4 Text Generation • Updated Nov 5, 2024 • 476 • 2