File size: 1,616 Bytes
0bff3c9 9fe3e58 0bff3c9 9fe3e58 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 |
---
license: apache-2.0
language:
- en
---
# LogicLLaMA Model Card
## Model details
LogicLLaMA is a language model that translates natural-language (NL) statements into first-order logic (FOL) rules.
It is trained by fine-tuning the LLaMA-7B model on the [MALLS](https://huggingface.co./datasets/yuan-yang/MALLS-v0) dataset.
**Model type:**
This repo contains the LoRA delta weights for naive correction LogicLLaMA, which, given a pair of the NL statement and a predicted FOL rule,
corrects the potential errors in the predicted FOL rule.
This is used as a downstream model together with ChatGPT, where ChatGPT does the "heavy lifting" by predicting the initial translated FOL rule
and then LogicLLaMA refines the rule by correcting potential errors.
In our [experiments](https://arxiv.org/abs/2305.15541), this mode yields better performance than ChatGPT and direction translation LogicLLaMA.
We also provide the delta weights for other modes:
- [direct translation LogicLLaMA ](https://huggingface.co./yuan-yang/LogicLLaMA-7b-direct-translate-delta-v0)
**License:**
Apache License 2.0
## Using the model
Check out how to use the model on our project page: https://github.com/gblackout/LogicLLaMA
**Primary intended uses:**
LogicLLaMA is intended to be used for research.
## Citation
```
@article{yang2023harnessing,
title={Harnessing the Power of Large Language Models for Natural Language to First-Order Logic Translation},
author={Yuan Yang and Siheng Xiong and Ali Payani and Ehsan Shareghi and Faramarz Fekri},
journal={arXiv preprint arXiv:2305.15541},
year={2023}
}
``` |