Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback Paper • 2307.16039 • Published Jul 29, 2023 • 4
Taipan: Efficient and Expressive State Space Language Models with Selective Attention Paper • 2410.18572 • Published Oct 24 • 16
OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs Paper • 2409.05152 • Published Sep 8 • 30
ULLME: A Unified Framework for Large Language Model Embeddings with Generation-Augmented Learning Paper • 2408.03402 • Published Aug 6 • 2
CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages Paper • 2309.09400 • Published Sep 17, 2023 • 84