QuantFactory
/

Meta-Llama-3-8B-Instruct-GGUF-v2

Text Generation

Inference Endpoints

Model card Files Files and versions Community

Resources

View closed (0)

Method for loading with transformers without dequantizing

#2 opened 3 months ago by

I'm experiencing the same endless loop with this version as well

#1 opened 6 months ago by