dm1024 / config.json
jqhoogland's picture
Upload final model (step 75000) and all checkpoints at 2024-10-18T06:07:54.614288
2e55553 verified
raw
history blame contribute delete
218 Bytes
{
"architectures": [
"HFHookedTransformer"
],
"hidden_size": 1024,
"num_attention_heads": 8,
"num_hidden_layers": 2,
"torch_dtype": "float32",
"transformers_version": "4.45.2",
"vocab_size": 5000
}