ShortGPT: Layers in Large Language Models are More Redundant Than You Expect Paper • 2403.03853 • Published Mar 6 • 62 • 21
RakutenAI-7B: Extending Large Language Models for Japanese Paper • 2403.15484 • Published Mar 21 • 12 • 2