metadata
license: apache-2.0
Tiger Model Card
Model details
Tactic-guided reasoner (Tiger) is a language model that solves reasoning in the wild task proposed in paper Can LLMs Reason in the Wild with Programs. It is trained by fine-tuning the LLaMA3-8B model on the ReWild dataset.
Model type:
This repo contains the LoRA delta weights for Tiger-PJ-8B
We also provide the delta weights of other versions:
License: Apache License 2.0
Using the model
Check out how to use the model on our project page: https://github.com/gblackout/Reason-in-the-Wild/
Primary intended uses: Tiger is intended to be used for research.
Citation
@article{yang2024can,
title={Can LLMs Reason in the Wild with Programs?},
author={Yang, Yuan and Xiong, Siheng and Payani, Ali and Shareghi, Ehsan and Fekri, Faramarz},
journal={arXiv preprint arXiv:2406.13764},
year={2024}
}