Gemma release Collection Groups the Gemma models released by the Google team. • 40 items • Updated 12 days ago • 327
qwen-nekomata Collection The nekomata model series are based on the qwen series and have been continually pre-trained on Japanese-specific corpora. • 8 items • Updated 20 days ago • 5
Retentive Network: A Successor to Transformer for Large Language Models Paper • 2307.08621 • Published Jul 17, 2023 • 170