Text Generation
Transformers
PyTorch
Chinese
English
llama
text-generation-inference
fireballoon commited on
Commit
00a5d91
1 Parent(s): 8a52c45

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -1
README.md CHANGED
@@ -4,6 +4,8 @@ baichuan-vicuna-7b is a chat model supervised finetuned on vicuna sharegpt data.
4
  - The finetuning data includes [ShareGPT](https://huggingface.co/datasets/anon8231489123/ShareGPT_Vicuna_unfiltered/blob/main/ShareGPT_V3_unfiltered_cleaned_split.json), mixed with [COT](https://huggingface.co/datasets/QingyiSi/Alpaca-CoT) and [Leetcode](https://www.kaggle.com/datasets/erichartford/leetcode-solutions), which are aimed to improve the model's reasoning and coding ability (the data mixing strategy is inspired by [TULU](https://arxiv.org/abs/2306.04751)).
5
  - The training code is based on [FastChat](https://github.com/lm-sys/FastChat), with the prompt format consisting of [Vicuna v1.1](https://huggingface.co/lmsys/vicuna-7b-delta-v1.1).
6
 
 
 
7
  # Inference with FastChat
8
  ```
9
  python3 -m fastchat.serve.cli --model-path fireballoon/baichuan-vicuna-7b
@@ -76,7 +78,7 @@ This algorithm has a runtime complexity of O(log n) and a space complexity of O(
76
 
77
  ---
78
 
79
- # baichuan-vicuna-7b
80
 
81
  baichuan-vicuna-7b是在vicuna sharegpt数据上全参数微调的对话模型。
82
  - 基座模型是[baichuan-7B](https://huggingface.co/baichuan-inc/baichuan-7B),由百川智能开发的可商用大规模预训练模型。
 
4
  - The finetuning data includes [ShareGPT](https://huggingface.co/datasets/anon8231489123/ShareGPT_Vicuna_unfiltered/blob/main/ShareGPT_V3_unfiltered_cleaned_split.json), mixed with [COT](https://huggingface.co/datasets/QingyiSi/Alpaca-CoT) and [Leetcode](https://www.kaggle.com/datasets/erichartford/leetcode-solutions), which are aimed to improve the model's reasoning and coding ability (the data mixing strategy is inspired by [TULU](https://arxiv.org/abs/2306.04751)).
5
  - The training code is based on [FastChat](https://github.com/lm-sys/FastChat), with the prompt format consisting of [Vicuna v1.1](https://huggingface.co/lmsys/vicuna-7b-delta-v1.1).
6
 
7
+ [中文说明](#chinese-model-card)
8
+
9
  # Inference with FastChat
10
  ```
11
  python3 -m fastchat.serve.cli --model-path fireballoon/baichuan-vicuna-7b
 
78
 
79
  ---
80
 
81
+ # Chinese model card
82
 
83
  baichuan-vicuna-7b是在vicuna sharegpt数据上全参数微调的对话模型。
84
  - 基座模型是[baichuan-7B](https://huggingface.co/baichuan-inc/baichuan-7B),由百川智能开发的可商用大规模预训练模型。