File size: 1,199 Bytes
791baf4
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
---
license: apache-2.0
---

# Tiger Model Card

## Model details

Tactic-guided reasoner (Tiger) is a language model that solves *reasoning in the wild* task proposed in paper [Can LLMs Reason in the Wild with Programs](https://arxiv.org/abs/2406.13764).
It is trained by fine-tuning the LLaMA3-8B model on the [ReWild](https://huggingface.co./datasets/yuan-yang/ReWild) dataset.

**Model type:**
This repo contains the LoRA delta weights for `Tiger-Routing-8B`

We also provide the delta weights of other versions:
- [Tiger-Routing-8B](https://huggingface.co./yuan-yang/Tiger-Routing-8B/)
- [Tiger-PJ-8B](https://huggingface.co./yuan-yang/Tiger-PJ-8B)
- [Tiger-IPJ-8B](https://huggingface.co./yuan-yang/Tiger-IPJ-8B)

**License:**
Apache License 2.0

## Using the model

Check out how to use the model on our project page:  https://github.com/gblackout/Reason-in-the-Wild/


**Primary intended uses:**
Tiger is intended to be used for research.


## Citation

```
@article{yang2024can,
  title={Can LLMs Reason in the Wild with Programs?},
  author={Yang, Yuan and Xiong, Siheng and Payani, Ali and Shareghi, Ehsan and Fekri, Faramarz},
  journal={arXiv preprint arXiv:2406.13764},
  year={2024}
}
```