lilax-UE-awq / README.md
shadowlilac's picture
Update README.md
75641ca verified
|
raw
history blame
No virus
351 Bytes

shadowlilac/limax-UE-awq

Low-latency Large Language Model (LLM) optimized for fast response times. Key Features:

  • Speed: Designed for minimal latency.
  • Quantized: Utilizes AWQ (Adaptive Weight Quantization) for efficiency.

License: All rights reserved. Contact for commercial use inquiries. Only use for non-commercial personal use cases/studying