neuralmagic/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-FP8-dynamic
Text Generation
•
Updated
•
37
•
1
LLMs, optimization, compression, sparsification, quantization, pruning, distillation, NLP, CV