|
--- |
|
library_name: transformers |
|
tags: [] |
|
--- |
|
|
|
# gemma2-mitra-it-int8 |
|
This is an 8int quantized version of gemma-2-mitra-it: https://huggingface.co./buddhist-nlp/gemma-2-mitra-it |
|
The quantization was done with llm compressor: https://github.com/vllm-project/llm-compressor |
|
|
|
The template for prompting the model is this: |
|
``` |
|
Please translate into <target_language>: <input_sentence> 🔽 Translation:: |
|
``` |
|
Line breaks in this model should be replaced with the '🔽' character before running the generation. '#' is used as a stop token. |
|
## Model Details |
|
For details on how to run this please see the gemma2-9b repository: https://huggingface.co./google/gemma-2-9b |
|
|