File size: 3,335 Bytes
69c99d0
 
 
a291bf0
ccc62ae
a291bf0
af70757
a291bf0
 
 
 
 
af70757
a291bf0
 
af70757
a291bf0
 
 
 
 
 
 
 
 
 
616fc5a
a291bf0
616fc5a
 
a291bf0
616fc5a
 
 
 
 
 
 
704c6e4
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
af70757
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
704c6e4
af70757
 
704c6e4
af70757
 
704c6e4
af70757
 
 
616fc5a
 
 
 
 
af70757
22b0ac8
af70757
22b0ac8
af70757
22b0ac8
af70757
22b0ac8
af70757
22b0ac8
616fc5a
 
69c99d0
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
---
license: apache-2.0
---
<div align="center">

<picture> 
  <img src="https://raw.githubusercontent.com/01-ai/Yi/main/assets/img/Yi_logo_icon_light.svg" width="150px">
</picture>

</div>

<p align="center">
  <a href="https://github.com/01-ai">πŸ™ GitHub</a> β€’
  <a href="https://discord.gg/hYUwWddeAu">πŸ‘Ύ Discord</a> β€’
  <a href="https://twitter.com/01ai_yi">🐀 Twitter</a> β€’
  <a href="https://github.com/01-ai/Yi-1.5/issues/2">πŸ’¬ WeChat</a> 
  <br/>
  <a href="https://arxiv.org/abs/2403.04652">πŸ“ Paper</a> β€’
  <a href="https://github.com/01-ai/Yi/tree/main?tab=readme-ov-file#faq">πŸ™Œ FAQ</a> β€’
  <a href="https://github.com/01-ai/Yi/tree/main?tab=readme-ov-file#learning-hub">πŸ“— Learning Hub</a>
</p>

# Intro

Yi-1.5 is an upgraded version of Yi. It is continuously pre-trained on Yi with a high-quality corpus of 500B tokens and fine-tuned on 3M diverse fine-tuning samples. 

Compared with Yi, Yi-1.5 delivers stronger performance in coding, math, reasoning, and instruction-following capability, while still maintaining excellent capabilities in language understanding, commonsense reasoning, and reading comprehension.

<div align="center">
  
Model | Context Length | Pre-trained Tokens
| :------------: | :------------: | :------------: |
| Yi-1.5 | 4K | 3.6T

</div>

# Models

- Chat models

  <table>
  <thead>
    <tr>
      <th>Model</th>
      <th>Download</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td>Yi-1.5-34B-Chat</td>
      <td rowspan="3">β€’ <a href="https://huggingface.co/collections/01-ai/yi-15-2024-05-663f3ecab5f815a3eaca7ca8">πŸ€— Hugging Face</a> β€’ <a href="https://www.modelscope.cn/organization/01ai">πŸ€– ModelScope</a><br></td>
    </tr>
    <tr>
      <td>Yi-1.5-9B-Chat</td>
    </tr>
    <tr>
      <td>Yi-1.5-6B-Chat</td>
    </tr>
  </tbody>
  </table>

- Base models

  <table>
  <thead>
    <tr>
      <th>Model</th>
      <th>Download</th>
    </tr>
  </thead>
  <tbody>
    <tr>
      <td>Yi-1.5-34B</td>
      <td rowspan="3">β€’ <a href="https://huggingface.co/collections/01-ai/yi-15-2024-05-663f3ecab5f815a3eaca7ca8">πŸ€— Hugging Face</a> β€’ <a href="https://www.modelscope.cn/organization/01ai">πŸ€– ModelScope</a><br></td>
    </tr>
    <tr>
      <td>Yi-1.5-9B</td>
    </tr>
    <tr>
      <td>Yi-1.5-6B</td>
    </tr>
  </tbody>
  </table>

---

<table>
<thead>
  <tr>
    <th>Model</th>
    <th>Name</th>
    <th>Download</th>
  </tr>
</thead>
<tbody>
  <tr>
    <td rowspan="3">Chat</td>
    <td>Yi-1.5-34B-Chat</td>
    <td rowspan="6">β€’ <a href="https://huggingface.co/collections/01-ai/yi-15-2024-05-663f3ecab5f815a3eaca7ca8">πŸ€— Hugging Face</a> β€’ <a href="https://www.modelscope.cn/organization/01ai/">πŸ€– ModelScope</a><br></td>
  </tr>
  <tr>
    <td>Yi-1.5-9B-Chat</td>
  </tr>
  <tr>
    <td>Yi-1.5-6B-Chat</td>
  </tr>
  <tr>
    <td rowspan="3">Base</td>
    <td>Yi-1.5-34B</td>
  </tr>
  <tr>
    <td>Yi-1.5-9B</td>
  </tr>
  <tr>
    <td>Yi-1.5-6B</td>
  </tr>
</tbody>
</table>

# Benchmarks

- Chat models

  Yi-1.5-34B-Chat is on par with or excels beyond larger models in most benchmarks.

  tbd

  Yi-1.5-9B-Chat is a strong performer among similarly sized open-source models.

  tbd

- Base models

# Quick Start

For getting up and running with Yi-1.5 models quickly, see [README](https://github.com/01-ai/Yi-1.5).