This is AIDC-ai-business/Marcoroni-70B-v1 quantized to LMDeploy 4bit AWQ with the following config:
python3 -m lmdeploy.lite.apis.auto_awq \
--model ./Marcoroni-70B-v1 \
--w_bits 4 \
--w_group_size 128 \
--work_dir ./quant
Original Model Card:
Marcoroni-70B
Model Details
- Trained by: trained by AIDC AI-Business.
- Model type: Marcoroni-70B is an auto-regressive language model based on the Llama 2 transformer architecture.
- Language(s): English
- License for Marcoroni-70B base weights: Non-Commercial Creative Commons license (CC BY-NC-4.0)
Prompting
Prompt Template for alpaca style
### Instruction:
<prompt> (without the <>)
### Response:
- Downloads last month
- 19
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.