riczhou
/

Llama-3-70B-Instruct-awq-int8-kv-cache-trt-llm

Inference Endpoints

Model card Files Files and versions Community

Llama-3-70B-Instruct-awq-int8-kv-cache-trt-llm / config.json

Commit History

Upload folder using huggingface_hub

0fdf245
verified

riczhou commited on May 22, 2024