WizardChatML 7B v0
I personally think ChatML is the best prompt format. It allows:
- Easier templating for generation
- Lower risk of inadvertently generating role tokens
- Better long-context performance and higher quality on quantized models
This model is an experiment attempting to extend WizardLM 2 7B to ChatML. It was trained on a small ChatML dataset, it probably isn't as good as WizardLM 2 Base, but it's an attempt.
Aside from using the ChatML prompt format, this model supports system prompts. In fact, it adheres very well to these prompts.
If you want to use this model for task-specific purposes, you should probably fine-tune it.
Capabilities & Challenges
- Seems ok-ish at writing
- Pretty good at math
- Sometimes calls itself ChatGPT/OpenAI
Risks
It has not been trained on guardrail data and may generate offensive content if prompted.
License
If you use this model, you must include the Apache 2.0 license AND the following notice:
I'm releasing this model under the Apache 2.0 license, with the added restriction that it cannot be used to compete with OpenAI (due to the nature of the training data). Additionally, this model was finetuned from the WizardLM 2 7B model, which was recently removed by Microsoft (it was Apache licensed, but may have been trained on NC-licensed data). You are responsible for the usage of this model. You are responsible for checking that your usage of this model is legal in your jurisdiction. Commercial use is not advised, as this model is finetuned from a model that may have been trained on NC-licensed data. Make sure to consult a lawyer before using in production or commercially.
- Downloads last month
- 11