philipp-zettl commited on
Commit
ed08b45
1 Parent(s): 094f3cb

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +32 -3
README.md CHANGED
@@ -1,3 +1,32 @@
1
- ---
2
- license: cc0-1.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: cc0-1.0
3
+ datasets:
4
+ - Lichess/standard-chess-games
5
+ pipeline_tag: text2text-generation
6
+ tags:
7
+ - chess
8
+ ---
9
+ # Model card for chessPT
10
+ A pretrained Decoder only transformer model for chess move prediction.
11
+
12
+ ## Intended use
13
+ Predict new moves in a chess game based on PGN tokens.
14
+
15
+ ## Implementation
16
+ The model implementation is based on Andrej Karpathy's [nanoGPT](https://github.com/karpathy/nanoGPT) following the webseries "Zero to Hero" on [youtube](https://www.youtube.com/playlist?list=PLAqhIrjkxbuWI23v9cThsA9GvCAUhRvKZ).
17
+
18
+ ## Training
19
+ You can find the training script in the repositories files under `train.py`.
20
+ This also contains the used parameters
21
+ ```python
22
+ context_size = 256
23
+ batch_size = 128
24
+ max_iters = 30_000
25
+ learning_rate = 3e-5
26
+ eval_interval = 100
27
+ eval_iters = 20
28
+ n_embed = 384
29
+ n_layer = 6
30
+ n_head = 6
31
+ dropout = 0.2
32
+ ```