neuralmagic/Sparse-Llama-3.1-8B-gsm8k-2of4-FP8-dynamic Text Generation • Updated Dec 19, 2024 • 174 • 1
neuralmagic/Meta-Llama-3.1-405B-Instruct-quantized.w8a8 Text Generation • Updated Dec 3, 2024 • 312 • 2