QuantFactory/HelpingAI-Lite-GGUF

This is quantized version of OEvortex/HelpingAI-Lite created using llama.cpp

Original Model Card

HelpingAI-Lite

Subscribe to my YouTube channel

Subscribe

GGUF version here

HelpingAI-Lite is a lite version of the HelpingAI model that can assist with coding tasks. It's trained on a diverse range of datasets and fine-tuned to provide accurate and helpful responses.

License

This model is licensed under MIT.

Datasets

The model was trained on the following datasets:

  • cerebras/SlimPajama-627B
  • bigcode/starcoderdata
  • HuggingFaceH4/ultrachat_200k
  • HuggingFaceH4/ultrafeedback_binarized

Language

The model supports English language.

Usage

CPU and GPU code

from transformers import pipeline
from accelerate import Accelerator

# Initialize the accelerator
accelerator = Accelerator()

# Initialize the pipeline
pipe = pipeline("text-generation", model="OEvortex/HelpingAI-Lite", device=accelerator.device)

# Define the messages
messages = [
    {
        "role": "system",
        "content": "You are a chatbot who can help code!",
    },
    {
        "role": "user",
        "content": "Write me a function to calculate the first 10 digits of the fibonacci sequence in Python and print it out to the CLI.",
    },
]

# Prepare the prompt
prompt = pipe.tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)

# Generate predictions
outputs = pipe(prompt, max_new_tokens=256, do_sample=True, temperature=0.7, top_k=50, top_p=0.95)

# Print the generated text
print(outputs[0]["generated_text"])
Downloads last month
74
GGUF
Model size
1.1B params
Architecture
llama

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference API
Unable to determine this model’s pipeline type. Check the docs .

Datasets used to train QuantFactory/HelpingAI-Lite-GGUF

Evaluation results