File size: 3,335 Bytes
69c99d0 a291bf0 ccc62ae a291bf0 af70757 a291bf0 af70757 a291bf0 af70757 a291bf0 616fc5a a291bf0 616fc5a a291bf0 616fc5a 704c6e4 af70757 704c6e4 af70757 704c6e4 af70757 704c6e4 af70757 616fc5a af70757 22b0ac8 af70757 22b0ac8 af70757 22b0ac8 af70757 22b0ac8 af70757 22b0ac8 616fc5a 69c99d0 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 |
---
license: apache-2.0
---
<div align="center">
<picture>
<img src="https://raw.githubusercontent.com/01-ai/Yi/main/assets/img/Yi_logo_icon_light.svg" width="150px">
</picture>
</div>
<p align="center">
<a href="https://github.com/01-ai">π GitHub</a> β’
<a href="https://discord.gg/hYUwWddeAu">πΎ Discord</a> β’
<a href="https://twitter.com/01ai_yi">π€ Twitter</a> β’
<a href="https://github.com/01-ai/Yi-1.5/issues/2">π¬ WeChat</a>
<br/>
<a href="https://arxiv.org/abs/2403.04652">π Paper</a> β’
<a href="https://github.com/01-ai/Yi/tree/main?tab=readme-ov-file#faq">π FAQ</a> β’
<a href="https://github.com/01-ai/Yi/tree/main?tab=readme-ov-file#learning-hub">π Learning Hub</a>
</p>
# Intro
Yi-1.5 is an upgraded version of Yi. It is continuously pre-trained on Yi with a high-quality corpus of 500B tokens and fine-tuned on 3M diverse fine-tuning samples.
Compared with Yi, Yi-1.5 delivers stronger performance in coding, math, reasoning, and instruction-following capability, while still maintaining excellent capabilities in language understanding, commonsense reasoning, and reading comprehension.
<div align="center">
Model | Context Length | Pre-trained Tokens
| :------------: | :------------: | :------------: |
| Yi-1.5 | 4K | 3.6T
</div>
# Models
- Chat models
<table>
<thead>
<tr>
<th>Model</th>
<th>Download</th>
</tr>
</thead>
<tbody>
<tr>
<td>Yi-1.5-34B-Chat</td>
<td rowspan="3">β’ <a href="https://huggingface.co/collections/01-ai/yi-15-2024-05-663f3ecab5f815a3eaca7ca8">π€ Hugging Face</a> β’ <a href="https://www.modelscope.cn/organization/01ai">π€ ModelScope</a><br></td>
</tr>
<tr>
<td>Yi-1.5-9B-Chat</td>
</tr>
<tr>
<td>Yi-1.5-6B-Chat</td>
</tr>
</tbody>
</table>
- Base models
<table>
<thead>
<tr>
<th>Model</th>
<th>Download</th>
</tr>
</thead>
<tbody>
<tr>
<td>Yi-1.5-34B</td>
<td rowspan="3">β’ <a href="https://huggingface.co/collections/01-ai/yi-15-2024-05-663f3ecab5f815a3eaca7ca8">π€ Hugging Face</a> β’ <a href="https://www.modelscope.cn/organization/01ai">π€ ModelScope</a><br></td>
</tr>
<tr>
<td>Yi-1.5-9B</td>
</tr>
<tr>
<td>Yi-1.5-6B</td>
</tr>
</tbody>
</table>
---
<table>
<thead>
<tr>
<th>Model</th>
<th>Name</th>
<th>Download</th>
</tr>
</thead>
<tbody>
<tr>
<td rowspan="3">Chat</td>
<td>Yi-1.5-34B-Chat</td>
<td rowspan="6">β’ <a href="https://huggingface.co/collections/01-ai/yi-15-2024-05-663f3ecab5f815a3eaca7ca8">π€ Hugging Face</a> β’ <a href="https://www.modelscope.cn/organization/01ai/">π€ ModelScope</a><br></td>
</tr>
<tr>
<td>Yi-1.5-9B-Chat</td>
</tr>
<tr>
<td>Yi-1.5-6B-Chat</td>
</tr>
<tr>
<td rowspan="3">Base</td>
<td>Yi-1.5-34B</td>
</tr>
<tr>
<td>Yi-1.5-9B</td>
</tr>
<tr>
<td>Yi-1.5-6B</td>
</tr>
</tbody>
</table>
# Benchmarks
- Chat models
Yi-1.5-34B-Chat is on par with or excels beyond larger models in most benchmarks.
tbd
Yi-1.5-9B-Chat is a strong performer among similarly sized open-source models.
tbd
- Base models
# Quick Start
For getting up and running with Yi-1.5 models quickly, see [README](https://github.com/01-ai/Yi-1.5). |