GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection Paper • 2403.03507 • Published Mar 6 • 182 • 15
PersianMind: A Cross-Lingual Persian-English Large Language Model Paper • 2401.06466 • Published Jan 12 • 3 • 2
GRATH: Gradual Self-Truthifying for Large Language Models Paper • 2401.12292 • Published Jan 22 • 2 • 2