Running 371 371 LLM Model VRAM Calculator ๐ Calculate VRAM requirements for running large language models
nvidia/Llama-3.1-Nemotron-70B-Instruct-HF Text Generation โข Updated Oct 25, 2024 โข 203k โข โข 2.01k