Edit Models filters

Model Tree

nvidia/Llama-3.1-Nemotron-70B-Instruct-HF

Inference status

Misc

Inference Endpoints

AutoTrain Compatible

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

43

Full-text search

Active filters: nvidia/Llama-3.1-Nemotron-70B-Instruct-HF

unsloth/Llama-3.1-Nemotron-70B-Instruct-bnb-4bit

Text Generation • Updated Oct 17, 2024 • 685 • 19

ibnzterrell/Nvidia-Llama-3.1-Nemotron-70B-Instruct-HF-AWQ-INT4

Text Generation • Updated Dec 7, 2024 • 160 • 2

bartowski/Llama-3.1-Nemotron-70B-Instruct-HF-GGUF

Text Generation • Updated Oct 16, 2024 • 9.26k • 88

lmstudio-community/Llama-3.1-Nemotron-70B-Instruct-HF-GGUF

Text Generation • Updated Oct 15, 2024 • 412 • 37

mlx-community/nvidia_Llama-3.1-Nemotron-70B-Instruct-HF_4bit

Text Generation • Updated Oct 16, 2024 • 159k • 11

XelotX/Llama-3.1-Nemotron-70B-Instruct-HF-GGUF

Text Generation • Updated Oct 16, 2024 • 259 • 1

mlx-community/Llama-3.1-Nemotron-70B-Instruct-HF-8bit

Text Generation • Updated Oct 17, 2024 • 25 • 1

mradermacher/Llama-3.1-Nemotron-70B-Instruct-HF-GGUF

Updated Oct 17, 2024 • 86 • 3

neuralmagic/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-dynamic

Text Generation • Updated Oct 17, 2024 • 37.4k • 14

mradermacher/Llama-3.1-Nemotron-70B-Instruct-HF-i1-GGUF

Updated 17 days ago • 1k • 4

DevQuasar/nvidia.Llama-3.1-Nemotron-70B-Instruct-HF-GGUF

Text Generation • Updated Dec 4, 2024 • 175 • 1

mlx-community/Llama-3.1-Nemotron-70B-Instruct-HF-4bit

Text Generation • Updated Oct 17, 2024 • 29 • 2

second-state/Llama-3.1-Nemotron-70B-Instruct-HF-GGUF

Text Generation • Updated Oct 18, 2024 • 79

gaianet/Llama-3.1-Nemotron-70B-Instruct-HF-GGUF

Text Generation • Updated Oct 18, 2024 • 116

unsloth/Llama-3.1-Nemotron-70B-Instruct-GGUF

Text Generation • Updated Oct 17, 2024 • 41 • 1

win28703/Llama-3.1-Nemotron-70B-Instruct-HF-Q8-mlx

Text Generation • Updated Oct 21, 2024 • 19

VPTQ-community/Llama-3.1-Nemotron-70B-Instruct-HF-v8-k65536-256-woft

Updated Nov 18, 2024 • 6

VPTQ-community/Llama-3.1-Nemotron-70B-Instruct-HF-v16-k65536-65536-woft

Updated Nov 18, 2024 • 2

VPTQ-community/Llama-3.1-Nemotron-70B-Instruct-HF-v8-k65536-65536-woft

Updated Nov 18, 2024 • 3 • 4

VPTQ-community/Llama-3.1-Nemotron-70B-Instruct-HF-v16-k65536-1024-woft

Updated Nov 18, 2024 • 7

VPTQ-community/Llama-3.1-Nemotron-70B-Instruct-HF-v8-k65536-0-woft

Updated Nov 18, 2024 • 3

VPTQ-community/Llama-3.1-Nemotron-70B-Instruct-HF-v16-k65536-16384-woft

Updated Nov 18, 2024

VPTQ-community/Llama-3.1-Nemotron-70B-Instruct-HF-v16-k65536-256-woft

Updated Nov 18, 2024 • 1 • 1

xmadai/Llama-3.1-Nemotron-70B-Instruct-xMADai-INT4

Text Generation • Updated Oct 30, 2024 • 392 • 4

joshmiller656/Llama-3.1-Nemotron-70B-Instruct-AWQ-INT4

Text Generation • Updated Nov 5, 2024 • 476 • 2

RohitPoreddy/Llama-3.1-Nemotron-70B-Instruct-HF-Q4-mlx

Text Generation • Updated Nov 7, 2024 • 15

m7alek/MathQA

Text Generation • Updated Nov 15, 2024 • 204

elitexp/Llama-3.1-Nemotron-70B-Instruct-HF-Q4-mlx

Text Generation • Updated Nov 13, 2024 • 9

mav23/Llama-3.1-Nemotron-92B-Instruct-HF-early-GGUF

Updated Nov 18, 2024 • 8

mav23/Llama-3.1-Nemotron-92B-Instruct-HF-late-GGUF

Updated Nov 18, 2024 • 15