metadata
base_model: jpacifico/Chocolatine-3B-Instruct-DPO-v1.2
datasets:
- jpacifico/french-orca-dpo-pairs-revised
language:
- fr
- en
library_name: transformers
license: mit
pipeline_tag: text-generation
tags:
- french
- chocolatine
- llama-cpp
Chocolatine-3B-Instruct-DPO-v1.2-Q4_K_M-GGUF
Quantized q4_k_m GGUF version of the original model jpacifico/Chocolatine-3B-Instruct-DPO-v1.2
can be used on a CPU device, compatible llama.cpp
now supported architecture by LM Studio.
Also ready for Raspberry Pi 5 8Gb.
The model supports 128K context length.
Ollama
Usage:
ollama run jpacifico/chocolatine-3b
Ollama Modelfile example :
FROM ./chocolatine-3b-instruct-dpo-v1.2-q4_k_m.gguf
TEMPLATE """{{ if .System }}<|system|>
{{ .System }}<|end|>
{{ end }}{{ if .Prompt }}<|user|>
{{ .Prompt }}<|end|>
{{ end }}<|assistant|>
{{ .Response }}<|end|>
"""
PARAMETER stop """{"stop": ["<|end|>","<|user|>","<|assistant|>"]}"""
SYSTEM """You are a friendly assistant called Chocolatine."""
Limitations
The Chocolatine model is a quick demonstration that a base model can be easily fine-tuned to achieve compelling performance.
It does not have any moderation mechanism.
- Developed by: Jonathan Pacifico, 2024
- Model type: LLM
- Language(s) (NLP): French, English
- License: MIT