view article Article Training and Finetuning Embedding Models with Sentence Transformers v3 May 28 • 158
view article Article Introducing RWKV — An RNN with the advantages of a transformer May 15, 2023 • 14
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model Paper • 2211.05100 • Published Nov 9, 2022 • 28
Retentive Network: A Successor to Transformer for Large Language Models Paper • 2307.08621 • Published Jul 17, 2023 • 170
Meta-Transformer: A Unified Framework for Multimodal Learning Paper • 2307.10802 • Published Jul 20, 2023 • 43