Gemma release Collection Groups the Gemma models released by the Google team. โข 40 items โข Updated Dec 13, 2024 โข 330
qwen-nekomata Collection The nekomata model series are based on the qwen series and have been continually pre-trained on Japanese-specific corpora. โข 8 items โข Updated 11 days ago โข 5
Retentive Network: A Successor to Transformer for Large Language Models Paper โข 2307.08621 โข Published Jul 17, 2023 โข 170