nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A8_tensor_weight_static_per_tensor_act-e2e Text Generation • Updated about 12 hours ago • 436
nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A8_channel_weight_static_per_tensor-e2e Text Generation • Updated about 13 hours ago • 192
nm-testing/TinyLlama-1.1B-Chat-v1.0-FP8A16_tensor-e2e Text Generation • Updated about 23 hours ago • 88
nm-testing/TinyLlama-1.1B-Chat-v1.0-FP8A16_channel-e2e Text Generation • Updated about 23 hours ago • 155
nm-testing/TinyLlama-1.1B-Chat-v1.0-FP8_DYNAMIC-e2e Text Generation • Updated about 23 hours ago • 233
neuralmagic/Mistral-Small-24B-Instruct-2501-FP8-Dynamic Text Generation • Updated 14 days ago • 11.5k • 5
neuralmagic/Mistral-Small-24B-Instruct-2501-FP8-Dynamic Text Generation • Updated 14 days ago • 11.5k • 5