shadowlilac/limax-UE-awq
Low-latency Large Language Model (LLM) optimized for fast response times. Key Features:
- Speed: Designed for minimal latency.
- Quantized: Utilizes AWQ (Adaptive Weight Quantization) for efficiency.
License: All rights reserved. Contact for commercial use inquiries. Only use for non-commercial personal use cases/studying