arxiv:2401.12246

Orion-14B: Open-source Multilingual Large Language Models

Published on Jan 20

· Submitted by

akhaliq on Jan 24

Upvote

Authors:

Du Chen ,

Yongqiang Li ,

Yongqiang Liu ,

Leichao Xu ,

Dacheng Zhang ,

Kun Han

Abstract

In this study, we introduce Orion-14B, a collection of multilingual large language models with 14 billion parameters. We utilize a data scheduling approach to train a foundational model on a diverse corpus of 2.5 trillion tokens, sourced from texts in English, Chinese, Japanese, Korean, and other languages. Additionally, we fine-tuned a series of models tailored for conversational applications and other specific use cases. Our evaluation results demonstrate that Orion-14B achieves state-of-the-art performance across a broad spectrum of tasks. We make the Orion-14B model family and its associated code publicly accessible https://github.com/OrionStarAI/Orion, aiming to inspire future research and practical applications in the field.

View arXiv page View PDF Add to collection

Community

MichaelBarryUK

Jan 24

Looks good, but the licence is misleading. GitHub says Apache, but read further and you'll see only the code is Apache. The models weights require a commercial licence. The license is also revocable.