Gemma release Collection Groups the Gemma models released by the Google team. • 40 items • Updated Jul 31 • 325
qwen-nekomata Collection The nekomata model series are based on the qwen series and have been continually pre-trained on Japanese-specific corpora. • 8 items • Updated 3 days ago • 5
Retentive Network: A Successor to Transformer for Large Language Models Paper • 2307.08621 • Published Jul 17, 2023 • 170