base_model: unsloth/llama-3-8b-Instruct-bnb-4bit
language:
- en
license: apache-2.0
tags:
- text-generation-inference
- transformers
- unsloth
- llama
- gguf
Uploaded model
- Developed by: Deeokay
- License: apache-2.0
- Finetuned from model : unsloth/llama-3-8b-Instruct-bnb-4bit
This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.
README
This is a test model on a the following
- a private dataset
- slight customization on llama3 template (no new tokens | no new configs)
- Works with Ollama create with just "FROM path/to/model" as Modelfile (llama3 template works no issues)
NOTE: DISCLAIMER
Please note this is not for the purpose of production, but result of Fine Tuning through self learning
The llama3 Tokens where kept the same, however the format was slight customized using the available tokens
I have foregone the {{.System}} part as this would be updated when converting the llama3.
I wanted to test if the model would understand additional headers that I created such as what my datasets has
- Analaysis, Classification, Sentiment
First pass through my ~30K personalized customized dataset.
If would like to know how I started creating my dataset, you can check this link Crafting GPT2 for Personalized AI-Preparing Data the Long Way (Part1)
As the data was getting created with custom special tokens, I had to convert that to the llama3 Template.
However I got creative again.. the training data has the following Template:
<|begin_of_text|> <|start_header_id|>user<|end_header_id|>
{{.Prompt}}<|eot_id|><|start_header_id|>analysis<|end_header_id|>
{{.Analysis}}<|eot_id|><|start_header_id|>assistant<|end_header_id|>
{{.Response}P<|eot_id|><|start_header_id|>classification<|end_header_id|>
{{.Classification}}<|eot_id|><|start_header_id|>sentiment<|end_header_id|>
{{.Sentiment}}<|eot_id|> <|start_header_id|>user<|end_header_id|>
Thank you for your answer.<|eot_id|><|start_header_id|>assistant<|end_header_id|>
You're most welcome, what would like to know next?<|eot_id|>
For now, the llama3 standard template seems to hold, and can be created in Ollama through normal llama3 template
Will be updating this monthly.. as I have limited colab resources..