t-tech
/

T-lite-it-1.0-Q8_0-GGUF

Inference Endpoints

Model card Files Files and versions Community

germanjke commited on Dec 16, 2024

Commit

18dae31

·

verified ·

1 Parent(s): 33ed5b7

Create README.md

Files changed (1) hide show

README.md +82 -0

README.md ADDED Viewed

	@@ -0,0 +1,82 @@

+---
+language:
+- ru
+base_model:
+- t-tech/T-lite-it-1.0
+tags:
+- llama-cpp
+---
+# T-lite-it-1.0-Q8_0-GGUF
+**🚨 T-lite is designed for further fine-tuning and is not intended as a ready-to-use conversational assistant. Users are advised to exercise caution and are responsible for any additional training and oversight required to ensure the model's responses meet acceptable ethical and safety standards. The responsibility for incorporating this model into industrial or commercial solutions lies entirely with those who choose to deploy it.**
+## Description
+This repository contains the [`T-lite-it-1.0`](https://huggingface.co/t-tech/T-lite-it-1.0/) model, which has been quantized into the GGUF format using the [`llama.cpp`](https://github.com/ggerganov/llama.cpp) repository.
+## 📊 Benchmarks
+| Benchmark                                      | T-lite-it-1.0 | T-lite-it-1.0-Q8_0 | Qwen-2.5-7B-Instruct | GigaChat Pro 1.0.26.15 | RuAdapt-Qwen-7B-Instruct-v1 | gemma-2-9b-it |
+|------------------------------------------------|:-------------:|:-------------:|:--------------------:|:----------------------:|:---------------------------:|:--------------|
+ | Arena-Hard-Ru                                  | **метрика** |метрика | 54.29 | - |            52.77            | 47.83 |
+## Llama.cpp usage
+### Server
+From HF:
+```bash
+llama-server --hf-repo t-tech/T-lite-it-1.0-Q8_0-GGUF --hf-file t-lite-it-1.0-q8_0.gguf -c 8192
+```
+Or locally:
+```bash
+./build/bin/llama-server -m t-lite-it-1.0-q8_0.gguf -c 8192
+```
+### POST
+```bash
+curl --request POST \
+    --url http://localhost:8080/completion \
+    --header "Content-Type: application/json" \
+    --data '{
+        "prompt": "<|im_start|>user\nРасскажи мне чем отличается Python от C++?\n<|im_end|>\n<|im_start|>assistant\n",
+        "n_predict": 256
+    }'
+```
+## ollama usage
+### Serve
+```bash
+ollama serve
+```
+### Run
+From HF:
+```bash
+ollama run hf.co/t-tech/T-lite-it-1.0-Q8_0-GGUF/
+```
+Or locally:
+```bash
+ollama create example -f Modelfile
+ollama run example "Расскажи мне про отличия C++ и Python"
+```
+where `Modelfile` is
+```bash
+FROM ./t-lite-it-1.0-q8_0.gguf
+```