Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model Paper • 2402.07827 • Published Feb 12 • 43
Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning Paper • 2402.06619 • Published Feb 9 • 47
AfriQA: Cross-lingual Open-Retrieval Question Answering for African Languages Paper • 2305.06897 • Published May 11, 2023 • 5
Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action Paper • 2312.17172 • Published Dec 28, 2023 • 24
From Base to Conversational: Japanese Instruction Dataset and Tuning Large Language Models Paper • 2309.03412 • Published Sep 7, 2023 • 1
Adapters: A Unified Library for Parameter-Efficient and Modular Transfer Learning Paper • 2311.11077 • Published Nov 18, 2023 • 24
Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2 Paper • 2311.10702 • Published Nov 17, 2023 • 17
Universal NER: A Gold-Standard Multilingual Named Entity Recognition Benchmark Paper • 2311.09122 • Published Nov 15, 2023 • 6
AutoAgents: A Framework for Automatic Agent Generation Paper • 2309.17288 • Published Sep 29, 2023 • 3
CoLaDa: A Collaborative Label Denoising Framework for Cross-lingual Named Entity Recognition Paper • 2305.14913 • Published May 24, 2023 • 1
TableGPT: Towards Unifying Tables, Nature Language and Commands into One GPT Paper • 2307.08674 • Published Jul 17, 2023 • 46
Augmenting CLIP with Improved Visio-Linguistic Reasoning Paper • 2307.09233 • Published Jul 18, 2023 • 7
LLMs as Workers in Human-Computational Algorithms? Replicating Crowdsourcing Pipelines with LLMs Paper • 2307.10168 • Published Jul 19, 2023 • 9
Challenges and Applications of Large Language Models Paper • 2307.10169 • Published Jul 19, 2023 • 46
SciBench: Evaluating College-Level Scientific Problem-Solving Abilities of Large Language Models Paper • 2307.10635 • Published Jul 20, 2023 • 6
Meta-Transformer: A Unified Framework for Multimodal Learning Paper • 2307.10802 • Published Jul 20, 2023 • 40
Question Decomposition Improves the Faithfulness of Model-Generated Reasoning Paper • 2307.11768 • Published Jul 17, 2023 • 11
A Real-World WebAgent with Planning, Long Context Understanding, and Program Synthesis Paper • 2307.12856 • Published Jul 24, 2023 • 34
ARB: Advanced Reasoning Benchmark for Large Language Models Paper • 2307.13692 • Published Jul 25, 2023 • 17
WebArena: A Realistic Web Environment for Building Autonomous Agents Paper • 2307.13854 • Published Jul 25, 2023 • 20
PromptStyler: Prompt-driven Style Generation for Source-free Domain Generalization Paper • 2307.15199 • Published Jul 27, 2023 • 10
Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback Paper • 2307.15217 • Published Jul 27, 2023 • 34
SEED-Bench: Benchmarking Multimodal LLMs with Generative Comprehension Paper • 2307.16125 • Published Jul 30, 2023 • 5
UniVTG: Towards Unified Video-Language Temporal Grounding Paper • 2307.16715 • Published Jul 31, 2023 • 8
AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos? Paper • 2307.16368 • Published Jul 31, 2023 • 10
RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control Paper • 2307.15818 • Published Jul 28, 2023 • 25
SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning Paper • 2308.00436 • Published Aug 1, 2023 • 20
Skills-in-Context Prompting: Unlocking Compositionality in Large Language Models Paper • 2308.00304 • Published Aug 1, 2023 • 22
Tool Documentation Enables Zero-Shot Tool-Usage with Large Language Models Paper • 2308.00675 • Published Aug 1, 2023 • 34
Ambient Adventures: Teaching ChatGPT on Developing Complex Stories Paper • 2308.01734 • Published Aug 3, 2023 • 6
OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language Models Paper • 2308.01390 • Published Aug 2, 2023 • 30
DeepSpeed-Chat: Easy, Fast and Affordable RLHF Training of ChatGPT-like Models at All Scales Paper • 2308.01320 • Published Aug 2, 2023 • 42
TPTU: Task Planning and Tool Usage of Large Language Model-based AI Agents Paper • 2308.03427 • Published Aug 7, 2023 • 13
Enhancing Network Management Using Code Generated by Large Language Models Paper • 2308.06261 • Published Aug 11, 2023 • 4
BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents Paper • 2308.05960 • Published Aug 11, 2023 • 18
VisIT-Bench: A Benchmark for Vision-Language Instruction Following Inspired by Real-World Use Paper • 2308.06595 • Published Aug 12, 2023 • 4
The Devil is in the Errors: Leveraging Large Language Models for Fine-grained Machine Translation Evaluation Paper • 2308.07286 • Published Aug 14, 2023 • 5
Learning to Identify Critical States for Reinforcement Learning from Videos Paper • 2308.07795 • Published Aug 15, 2023 • 6
Teach LLMs to Personalize -- An Approach inspired by Writing Education Paper • 2308.07968 • Published Aug 15, 2023 • 24
Dataset and Baseline System for Multi-lingual Extraction and Normalization of Temporal and Numerical Expressions Paper • 2303.18103 • Published Mar 31, 2023 • 1
TIARA: Multi-grained Retrieval for Robust Question Answering over Large Knowledge Bases Paper • 2210.12925 • Published Oct 24, 2022 • 1
Chinese Open Instruction Generalist: A Preliminary Release Paper • 2304.07987 • Published Apr 17, 2023 • 2