OhCherryFire commited on
Commit
df3a8e0
1 Parent(s): bdac55b

Create REAME.md

Browse files
Files changed (1) hide show
  1. REAME.md +12 -0
REAME.md ADDED
@@ -0,0 +1,12 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ The the language value model for GSM8k in
2
+ [Alphazero-like tree-search can guide large language model decoding and training](https://arxiv.org/abs/2309.17179),
3
+ ICML 2024
4
+
5
+ ```
6
+ @article{feng2023alphazero,
7
+ title={Alphazero-like tree-search can guide large language model decoding and training},
8
+ author={Feng, Xidong and Wan, Ziyu and Wen, Muning and Wen, Ying and Zhang, Weinan and Wang, Jun},
9
+ journal={arXiv preprint arXiv:2309.17179},
10
+ year={2023}
11
+ }
12
+ ```