arxiv:2311.11045

Orca 2: Teaching Small Language Models How to Reason

Published on Nov 18, 2023

· Submitted by

akhaliq on Nov 21, 2023

#2 Paper of the day

Upvote

Authors:

Arindam Mitra ,

Luciano Del Corro ,

Shweti Mahajan ,

Andres Codas ,

Clarisse Simoes ,

Xuxi Chen ,

Erik Jones ,

Kriti Aggarwal ,

Hamid Palangi ,

Guoqing Zheng ,

Corby Rosset ,

Hamed Khanpour ,

Ahmed Awadallah

Abstract

Orca 1 learns from rich signals, such as explanation traces, allowing it to outperform conventional instruction-tuned models on benchmarks like BigBench Hard and AGIEval. In Orca 2, we continue exploring how improved training signals can enhance smaller LMs' reasoning abilities. Research on training small LMs has often relied on imitation learning to replicate the output of more capable models. We contend that excessive emphasis on imitation may restrict the potential of smaller models. We seek to teach small LMs to employ different solution strategies for different tasks, potentially different from the one used by the larger model. For example, while larger models might provide a direct answer to a complex task, smaller models may not have the same capacity. In Orca 2, we teach the model various reasoning techniques (step-by-step, recall then generate, recall-reason-generate, direct answer, etc.). More crucially, we aim to help the model learn to determine the most effective solution strategy for each task. We evaluate Orca 2 using a comprehensive set of 15 diverse benchmarks (corresponding to approximately 100 tasks and over 36,000 unique prompts). Orca 2 significantly surpasses models of similar size and attains performance levels similar or better to those of models 5-10x larger, as assessed on complex tasks that test advanced reasoning abilities in zero-shot settings. We open-source Orca 2 to encourage further research on the development, evaluation, and alignment of smaller LMs.

View arXiv page View PDF Add to collection

Community

rrtucci

Nov 21, 2023

•

edited Nov 21, 2023

This method of adding reasoning to LLMs is DEEPLY flawed because it doesn't distinguish between correlation and causation (C&C). To do so, one requires causal DAGs, and I see none of those in this paper. There is a term for when you equate C&C: superstition. This AI will be superstitious. IMHO, superstitions are even more dangerous than hallucinations. Religion is a superstition that has been used as an excuse for war since time immemorial. I expand on this idea in the following essay: https://qbnets.wordpress.com/2023/10/30/yann-lecun-the-godfather-of-superstitious-ai/

MichaelBarryUK

Nov 21, 2023

This method of adding reasoning to LLMs is DEEPLY flawed because it doesn't...

All research is flawed. Until it isn't. It's like shining a torch into a dark room. If anyone is superstitious then it's you, because you already know what's in the dark room, without looking.

librarian-bot

Nov 23, 2023

This is an automated message from the Librarian Bot. I found the following papers similar to this paper.

The following papers were recommended by the Semantic Scholar API

Please give a thumbs up to this comment if you found it helpful!

If you want recommendations for any Paper on Hugging Face checkout this Space

littlexxxxx

Dec 4, 2023

The paper does not explain the real important question to me, which are the reasoning strategies and its related system instructions for each sub-tasks , how to select the strategy for each clustered sub-task? manually or through some prompts by leveraging openai.

If they did the main task by hand, then this paper is not insightful at all.

RamZi2

Dec 11, 2023

The paper does not explain the real important question to me, which are the reasoning strategies and its related system instructions for each sub-tasks , how to select the strategy for each clustered sub-task? manually or through some prompts by leveraging openai.

If they did the main task by hand, then this paper is not insightful at all.

Exactly! Where is the complete list of strategies along with their system instructions? It's really weird how these were left out of the paper while they seem to be the cornerstone of this paper!