avia-4x500m / base /config.json
shivendrra's picture
added train and model files
7f4e854 verified
raw
history blame contribute delete
158 Bytes
{
"batch_size": 10,
"block_size": 512,
"d_model": 512,
"n_heads": 8,
"n_layers": 8,
"dropout": 0.18,
"norm_eps": 1e-5,
"learning_rate": 3e-5
}