Training details
#19 opened 1 day ago
by
nleroux
EOS Token
1
#18 opened 3 months ago
by
darkmatter2222
Benchmark results
#17 opened 4 months ago
by
Bachstelze
Interview request: genAI evaluation & documentation
#16 opened 5 months ago
by
meggymuggy
Are there two identical embedding tensors, even though embeddings are shared?
#15 opened 5 months ago
by
graefics
About MMLU evaluation
1
#12 opened 6 months ago
by
ldwang
Intermediate checkpoints for research purposes
2
#11 opened 6 months ago
by
maveriq

ONNX generation
1
#9 opened 7 months ago
by
davesoma

Adding Evaluation Results
#8 opened 7 months ago
by
leaderboard-pr-bot

Google Colab for Guide to Use the Model
#7 opened 7 months ago
by
mesjavacca
Model code for training from sractch
4
#6 opened 7 months ago
by
Chrisneverdie
Multilingual Support
1
#5 opened 8 months ago
by
Mistsink
Trapezoidal scheduler with cooldown phase
3
#4 opened 8 months ago
by
maveriq
