A wider Baby Berta Model trained using curriculum learning and layer stacking for the BabyLM Challenge Strict Small track.

Downloads last month
106
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.