File size: 1,199 Bytes
791baf4 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 |
---
license: apache-2.0
---
# Tiger Model Card
## Model details
Tactic-guided reasoner (Tiger) is a language model that solves *reasoning in the wild* task proposed in paper [Can LLMs Reason in the Wild with Programs](https://arxiv.org/abs/2406.13764).
It is trained by fine-tuning the LLaMA3-8B model on the [ReWild](https://huggingface.co./datasets/yuan-yang/ReWild) dataset.
**Model type:**
This repo contains the LoRA delta weights for `Tiger-Routing-8B`
We also provide the delta weights of other versions:
- [Tiger-Routing-8B](https://huggingface.co./yuan-yang/Tiger-Routing-8B/)
- [Tiger-PJ-8B](https://huggingface.co./yuan-yang/Tiger-PJ-8B)
- [Tiger-IPJ-8B](https://huggingface.co./yuan-yang/Tiger-IPJ-8B)
**License:**
Apache License 2.0
## Using the model
Check out how to use the model on our project page: https://github.com/gblackout/Reason-in-the-Wild/
**Primary intended uses:**
Tiger is intended to be used for research.
## Citation
```
@article{yang2024can,
title={Can LLMs Reason in the Wild with Programs?},
author={Yang, Yuan and Xiong, Siheng and Payani, Ali and Shareghi, Ehsan and Fekri, Faramarz},
journal={arXiv preprint arXiv:2406.13764},
year={2024}
}
``` |