Edit model card

jonas hallgrimsson gpt v2

second version of a gpt model trained on the works of Jónas Hallgrímsson. The model started heavilly overfitting, due to small training data, as the training metrics evidently show. Hence, this is model is an early checkpoint from the training (before the overfitting of the model)

Downloads last month
25
Safetensors
Model size
125M params
Tensor type
F32
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Dataset used to train Sigurdur/jonas-hallgrimsson-gpt2