23 Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions · 25 authors 1
15 OpenBA: An Open-sourced 15B Bilingual Asymmetric seq2seq Model Pre-trained from Scratch · 12 authors 1