neuralmagic/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-FP8-dynamic Text Generation • Updated Dec 19, 2024 • 47 • 1
neuralmagic/Sparse-Llama-3.1-8B-evolcodealpaca-2of4-FP8-dynamic Text Generation • Updated Dec 19, 2024 • 21
neuralmagic/Sparse-Llama-3.1-8B-gsm8k-2of4-FP8-dynamic Text Generation • Updated Dec 19, 2024 • 189 • 1
neuralmagic/Sparse-Llama-3.1-8B-gsm8k-2of4-FP8-dynamic Text Generation • Updated Dec 19, 2024 • 189 • 1
neuralmagic/Sparse-Llama-3.1-8B-gsm8k-2of4-quantized.w4a16 Text Generation • Updated Dec 19, 2024 • 41
neuralmagic/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-quantized.w4a16 Text Generation • Updated Dec 19, 2024 • 118 • 3
neuralmagic/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-FP8-dynamic Text Generation • Updated Dec 19, 2024 • 47 • 1
neuralmagic/Sparse-Llama-3.1-8B-evolcodealpaca-2of4-FP8-dynamic Text Generation • Updated Dec 19, 2024 • 21
neuralmagic/Sparse-Llama-3.1-8B-evolcodealpaca-2of4-quantized.w4a16 Text Generation • Updated Dec 19, 2024 • 13
neuralmagic/Meta-Llama-3.1-8B-Instruct-quantized.w4a16 Text Generation • Updated Dec 17, 2024 • 10.1k • 23
Sparse-Llama-3.1-2of4 Collection 2:4 sparse versions of Llama-3.1, including transfer learning • 10 items • Updated Dec 18, 2024 • 4