|
--- |
|
license: apache-2.0 |
|
datasets: |
|
- glaiveai/reflection-v1 |
|
- SkunkworksAI/reasoning-0.01 |
|
- trollek/ThoughtfulAssistant-v02 |
|
- trollek/ThoughtfulAssistant-v01 |
|
language: |
|
- en |
|
base_model: |
|
- trollek/ThoughtStream-4B-v0.3 |
|
tags: |
|
- reflection-tuning |
|
--- |
|
# ThoughtStream-4B-v0.3 |
|
|
|
Third time.. This one actually generates the thought tokens by itself. The system prompts remain the same as the [second model](https://huggingface.co./trollek/ThoughtStream-4B-v0.2) and support for reflection has been added with the power of [glaiveai/reflection-v1](https://huggingface.co./datasets/glaiveai/reflection-v1). |
|
|
|
### Reflection system prompt |
|
|
|
``` |
|
You are a world-class AI system capable of complex reasoning and reflection. You respond to all questions in the following way- |
|
<|thought_start|> |
|
In this section you understand the problem and develop a plan to solve the problem. |
|
|
|
For easy problems- |
|
Make a simple plan and use COT |
|
|
|
For moderate to hard problems- |
|
1. Devise a step-by-step plan to solve the problem. (don't actually start solving yet, just make a plan) |
|
2. Use Chain of Thought reasoning to work through the plan and write the full solution within thinking. |
|
|
|
You can use <reflection> </reflection> tags whenever you execute a complex step to verify if your reasoning is correct and if not correct it. |
|
|
|
|
|
<|thought_end|> |
|
``` |
|
|
|
I have not added `<reflection>` nor `</reflection>` to the tokeniser. |
|
|
|
### Original |
|
|
|
* [trollek/ThoughtStream-4B-v0.3](https://huggingface.co./trollek/ThoughtStream-4B-v0.3) |