01-ai
/

Yi-6B-Chat-8bits

@@ -71,14 +71,14 @@ pipeline_tag: text-generation
 <details open>
 <summary></b>📕 Table of Contents</b></summary>
-- [What is Yi?](#-what-is-yi)
-  - [Introduction](#-introduction)
-  - [Models](#-models)
     - [Chat models](#chat-models)
     - [Base models](#base-models)
     - [Other info](#other-info)
-  - [News](#-news)
-- [How to use Yi?](#-how-to-use-yi)
   - [Quick start](#quick-start)
     - [Choose your path](#choose-your-path)
     - [pip](#quick-start---pip)
@@ -90,30 +90,30 @@ pipeline_tag: text-generation
   - [Quantization](#quantization)
   - [Deployment](#deployment)
   - [Learning hub](#learning-hub)
-- [Why Yi?](#-why-yi)
-  - [Ecosystem](#-ecosystem)
-    - [Upstream](#-upstream)
-    - [Downstream](#-downstream)
-      - [Serving](#-serving)
-      - [Quantitation](#️-quantitation)
-      - [Fine-tuning](#️-fine-tuning)
       - [API](#api)
-  - [Benchmarks](#-benchmarks)
-    - [Base model performance](#-base-model-performance)
-    - [Chat model performance](#-chat-model-performance)
-- [Who can use Yi?](#-who-can-use-yi)
-- [Misc.](#-misc)
   - [Acknowledgements](#acknowledgments)
-  - [Disclaimer](#-disclaimer)
-  - [License](#-license)
 </details>
 <hr>
-# 🟢 What is Yi?
-## 📌 Introduction
 - 🤖 The Yi series models are the next generation of open-source large language models trained from scratch by [01.AI](https://01.ai/).
@@ -149,7 +149,7 @@ pipeline_tag: text-generation
   <a href="#top">Back to top ⬆️ </a>  ]
 </p>
-## 🎉 News
 <details open>
   <summary>🎯 <b>2024/03/06</b>: The Yi-9B is open-sourced and available to the public.</summary>
@@ -211,7 +211,7 @@ sequence length and can be extended to 32K during inference time.
   <a href="#top">Back to top ⬆️ </a>  ]
 </p>
-## 🎯 Models
 Yi models come in multiple sizes and cater to different use cases. You can also fine-tune Yi models to meet your specific requirements.
@@ -272,7 +272,7 @@ Model | Intro | Default context window | Pretrained tokens | Training Data Date
 </p>
-# 🟢 How to use Yi?
 - [Quick start](#quick-start)
   - [Choose your path](#choose-your-path)
@@ -281,7 +281,7 @@ Model | Intro | Default context window | Pretrained tokens | Training Data Date
   - [conda-lock](#quick-start---conda-lock)
   - [llama.cpp](#quick-start---llamacpp)
   - [Web demo](#web-demo)
-- [Fine-tuning](#finetuning)
 - [Quantization](#quantization)
 - [Deployment](#deployment)
 - [Learning hub](#learning-hub)
@@ -301,7 +301,7 @@ Select one of the following paths to begin your journey with Yi!
 If you prefer to deploy Yi models locally,
   - 🙋‍♀️ and you have **sufficient** resources (for example, NVIDIA A800 80GB), you can choose one of the following methods:
-    - [pip](#pip)
     - [Docker](#quick-start---docker)
     - [conda-lock](#quick-start---conda-lock)
@@ -1012,31 +1012,31 @@ With all these resources at your fingertips, you're ready to start your exciting
 </details>
-# 🟢 Why Yi?
-  - [🌎 Ecosystem](#-ecosystem)
-    - [💦 Upstream](#-upstream)
-    - [🌊 Downstream](#-downstream)
-      - [🔗 Serving](#-serving)
-      - [⚙️ Quantitation](#️-quantitation)
-      - [🛠️ Fine-tuning](#️-fine-tuning)
       - [API](#api)
-  - [📌 Benchmarks](#-benchmarks)
-    - [📊 Chat model performance](#-chat-model-performance)
-    - [📊 Base model performance](#-base-model-performance)
-## 🌎 Ecosystem
 Yi has a comprehensive ecosystem, offering a range of tools, services, and models to enrich your experiences and maximize productivity.
-- [💦 Upstream](#-upstream)
-- [🌊 Downstream](#-downstream)
-  - [🔗 Serving](#-serving)
-  - [⚙️ Quantitation](#️-quantitation)
-  - [🛠️ Fine-tuning](#️-fine-tuning)
   - [API](#api)
-### 💦 Upstream
 The Yi series models follow the same model architecture as Llama. By choosing Yi, you can leverage existing tools, libraries, and resources within the Llama ecosystem, eliminating the need to create new tools and enhancing development efficiency.
@@ -1054,7 +1054,7 @@ model = AutoModelForCausalLM.from_pretrained("01-ai/Yi-34b", device_map="auto")
   <a href="#top">Back to top ⬆️ </a>  ]
 </p>
-### 🌊 Downstream
 > 💡 Tip
 >
@@ -1062,7 +1062,7 @@ model = AutoModelForCausalLM.from_pretrained("01-ai/Yi-34b", device_map="auto")
 >
 > - To help others quickly understand your work, it is recommended to use the format of `<model-name>: <model-intro> + <model-highlights>`.
-#### 🔗 Serving
 If you want to get up with Yi in a few minutes, you can use the following services built upon Yi.
@@ -1074,7 +1074,7 @@ If you want to get up with Yi in a few minutes, you can use the following servic
 - [ScaleLLM](https://github.com/vectorch-ai/ScaleLLM#supported-models): you can use this service to run Yi models locally with added flexibility and customization.
-#### ⚙️ Quantitation
 If you have limited computational capabilities, you can use Yi's quantized models as follows.
@@ -1084,7 +1084,7 @@ These quantized models have reduced precision but offer increased efficiency, su
 - [TheBloke/Yi-34B-GGUF](https://huggingface.co/TheBloke/Yi-34B-GGUF)
 - [TheBloke/Yi-34B-AWQ](https://huggingface.co/TheBloke/Yi-34B-AWQ)
-#### 🛠️ Fine-tuning
 If you're seeking to explore the diverse capabilities within Yi's thriving family, you can delve into Yi's fine-tuned models as below.
@@ -1110,12 +1110,12 @@ If you're seeking to explore the diverse capabilities within Yi's thriving famil
   <a href="#top">Back to top ⬆️ </a>  ]
 </p>
-## 📌 Benchmarks
-- [📊 Chat model performance](#-chat-model-performance)
-- [📊 Base model performance](#-base-model-performance)
-### 📊 Chat model performance
 Yi-34B-Chat model demonstrates exceptional performance, ranking first among all existing open-source models in the benchmarks including MMLU, CMMLU, BBH, GSM8k, and more.
@@ -1132,7 +1132,7 @@ Yi-34B-Chat model demonstrates exceptional performance, ranking first among all
 <strong>*</strong>: C-Eval results are evaluated on the validation datasets
 </details>
-### 📊 Base model performance
 #### Yi-34B and Yi-34B-200K
@@ -1158,7 +1158,7 @@ Yi-9B is almost the best among a range of similar-sized open-source models (incl
 ![Yi-9B benchmark - details](https://github.com/01-ai/Yi/blob/main/assets/img/Yi-9B_benchmark_details.png?raw=true)
-- In terms of **overall** ability (Mean-All), Yi-9B performs the best among similarly sized open-source models, surpassing DeepSeek-Coder, DeepSeek-Math, Mistral-7B, SOLAR-10.7B, and Gemma-7B.
 ![Yi-9B benchmark - overall](https://github.com/01-ai/Yi/blob/main/assets/img/Yi-9B_benchmark_overall.png?raw=true)
@@ -1178,7 +1178,7 @@ Yi-9B is almost the best among a range of similar-sized open-source models (incl
   <a href="#top">Back to top ⬆️ </a>  ]
 </p>
-# 🟢 Who can use Yi?
 Everyone! 🙌 ✅
@@ -1190,7 +1190,7 @@ Everyone! 🙌 ✅
   <a href="#top">Back to top ⬆️ </a>  ]
 </p>
-# 🟢 Misc.
 ### Acknowledgments
@@ -1202,7 +1202,7 @@ A heartfelt thank you to each of you who have made contributions to the Yi commu
   <a href="#top">Back to top ⬆️ </a>  ]
 </p>
-### 📡 Disclaimer
 We use data compliance checking algorithms during the training process, to
 ensure the compliance of the trained model to the best of our ability. Due to
@@ -1217,7 +1217,7 @@ as well as any associated data security concerns.
   <a href="#top">Back to top ⬆️ </a>  ]
 </p>
-### 🪪 License
 The source code in this repo is licensed under the [Apache 2.0
 license](https://github.com/01-ai/Yi/blob/main/LICENSE). The Yi series models are fully open for academic research and free for commercial use, with automatic permission granted upon application. All usage must adhere to the [Yi Series Models Community License Agreement 2.1](https://github.com/01-ai/Yi/blob/main/MODEL_LICENSE_AGREEMENT.txt).

 <details open>
 <summary></b>📕 Table of Contents</b></summary>
+- [What is Yi?](#what-is-yi)
+  - [Introduction](#introduction)
+  - [Models](#models)
     - [Chat models](#chat-models)
     - [Base models](#base-models)
     - [Other info](#other-info)
+  - [News](#news)
+- [How to use Yi?](#how-to-use-yi)
   - [Quick start](#quick-start)
     - [Choose your path](#choose-your-path)
     - [pip](#quick-start---pip)
   - [Quantization](#quantization)
   - [Deployment](#deployment)
   - [Learning hub](#learning-hub)
+- [Why Yi?](#why-yi)
+  - [Ecosystem](#ecosystem)
+    - [Upstream](#upstream)
+    - [Downstream](#downstream)
+      - [Serving](#serving)
+      - [Quantization](#quantization-1)
+      - [Fine-tuning](#fine-tuning-1)
       - [API](#api)
+  - [Benchmarks](#benchmarks)
+    - [Base model performance](#base-model-performance)
+    - [Chat model performance](#chat-model-performance)
+- [Who can use Yi?](#who-can-use-yi)
+- [Misc.](#misc)
   - [Acknowledgements](#acknowledgments)
+  - [Disclaimer](#disclaimer)
+  - [License](#license)
 </details>
 <hr>
+# What is Yi?
+## Introduction
 - 🤖 The Yi series models are the next generation of open-source large language models trained from scratch by [01.AI](https://01.ai/).
   <a href="#top">Back to top ⬆️ </a>  ]
 </p>
+## News
 <details open>
   <summary>🎯 <b>2024/03/06</b>: The Yi-9B is open-sourced and available to the public.</summary>
   <a href="#top">Back to top ⬆️ </a>  ]
 </p>
+## Models
 Yi models come in multiple sizes and cater to different use cases. You can also fine-tune Yi models to meet your specific requirements.
 </p>
+# How to use Yi?
 - [Quick start](#quick-start)
   - [Choose your path](#choose-your-path)
   - [conda-lock](#quick-start---conda-lock)
   - [llama.cpp](#quick-start---llamacpp)
   - [Web demo](#web-demo)
+- [Fine-tuning](#fine-tuning)
 - [Quantization](#quantization)
 - [Deployment](#deployment)
 - [Learning hub](#learning-hub)
 If you prefer to deploy Yi models locally,
   - 🙋‍♀️ and you have **sufficient** resources (for example, NVIDIA A800 80GB), you can choose one of the following methods:
+    - [pip](#quick-start---pip)
     - [Docker](#quick-start---docker)
     - [conda-lock](#quick-start---conda-lock)
 </details>
+# Why Yi?
+  - [Ecosystem](#ecosystem)
+    - [Upstream](#upstream)
+    - [Downstream](#downstream)
+      - [Serving](#serving)
+      - [Quantization](#quantization-1)
+      - [Fine-tuning](#fine-tuning-1)
       - [API](#api)
+  - [Benchmarks](#benchmarks)
+    - [Chat model performance](#chat-model-performance)
+    - [Base model performance](#base-model-performance)
+## Ecosystem
 Yi has a comprehensive ecosystem, offering a range of tools, services, and models to enrich your experiences and maximize productivity.
+- [Upstream](#upstream)
+- [Downstream](#downstream)
+  - [Serving](#serving)
+  - [Quantitation](#️quantitation)
+  - [Fine-tuning](#️fine-tuning)
   - [API](#api)
+### Upstream
 The Yi series models follow the same model architecture as Llama. By choosing Yi, you can leverage existing tools, libraries, and resources within the Llama ecosystem, eliminating the need to create new tools and enhancing development efficiency.
   <a href="#top">Back to top ⬆️ </a>  ]
 </p>
+### Downstream
 > 💡 Tip
 >
 >
 > - To help others quickly understand your work, it is recommended to use the format of `<model-name>: <model-intro> + <model-highlights>`.
+#### Serving
 If you want to get up with Yi in a few minutes, you can use the following services built upon Yi.
 - [ScaleLLM](https://github.com/vectorch-ai/ScaleLLM#supported-models): you can use this service to run Yi models locally with added flexibility and customization.
+#### Quantization
 If you have limited computational capabilities, you can use Yi's quantized models as follows.
 - [TheBloke/Yi-34B-GGUF](https://huggingface.co/TheBloke/Yi-34B-GGUF)
 - [TheBloke/Yi-34B-AWQ](https://huggingface.co/TheBloke/Yi-34B-AWQ)
+#### Fine-tuning
 If you're seeking to explore the diverse capabilities within Yi's thriving family, you can delve into Yi's fine-tuned models as below.
   <a href="#top">Back to top ⬆️ </a>  ]
 </p>
+## Benchmarks
+- [Chat model performance](#-chat-model-performance)
+- [Base model performance](#-base-model-performance)
+### Chat model performance
 Yi-34B-Chat model demonstrates exceptional performance, ranking first among all existing open-source models in the benchmarks including MMLU, CMMLU, BBH, GSM8k, and more.
 <strong>*</strong>: C-Eval results are evaluated on the validation datasets
 </details>
+### Base model performance
 #### Yi-34B and Yi-34B-200K
 ![Yi-9B benchmark - details](https://github.com/01-ai/Yi/blob/main/assets/img/Yi-9B_benchmark_details.png?raw=true)
+- In terms of **overall** ability (`Mean-All), Yi-9B performs the best among similarly sized open-source models, surpassing DeepSeek-Coder, DeepSeek-Math, Mistral-7B, SOLAR-10.7B, and Gemma-7B.
 ![Yi-9B benchmark - overall](https://github.com/01-ai/Yi/blob/main/assets/img/Yi-9B_benchmark_overall.png?raw=true)
   <a href="#top">Back to top ⬆️ </a>  ]
 </p>
+# Who can use Yi?
 Everyone! 🙌 ✅
   <a href="#top">Back to top ⬆️ </a>  ]
 </p>
+# Misc.
 ### Acknowledgments
   <a href="#top">Back to top ⬆️ </a>  ]
 </p>
+### Disclaimer
 We use data compliance checking algorithms during the training process, to
 ensure the compliance of the trained model to the best of our ability. Due to
   <a href="#top">Back to top ⬆️ </a>  ]
 </p>
+### License
 The source code in this repo is licensed under the [Apache 2.0
 license](https://github.com/01-ai/Yi/blob/main/LICENSE). The Yi series models are fully open for academic research and free for commercial use, with automatic permission granted upon application. All usage must adhere to the [Yi Series Models Community License Agreement 2.1](https://github.com/01-ai/Yi/blob/main/MODEL_LICENSE_AGREEMENT.txt).