datasets: | |
- gsm8k | |
language: | |
- en | |
The the language value model for GSM8k in | |
[Alphazero-like tree-search can guide large language model decoding and training](https://arxiv.org/abs/2309.17179), | |
ICML 2024 | |
``` | |
@article{feng2023alphazero, | |
title={Alphazero-like tree-search can guide large language model decoding and training}, | |
author={Feng, Xidong and Wan, Ziyu and Wen, Muning and Wen, Ying and Zhang, Weinan and Wang, Jun}, | |
journal={arXiv preprint arXiv:2309.17179}, | |
year={2023} | |
} | |
``` |