IDEA-CCNL
/

Ziya-LLaMA-13B-v1

Text Generation

text-generation-inference

Model card Files Files and versions Community

suolyer commited on May 17, 2023

Commit

220161f

·

1 Parent(s): 31e2c1c

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -49,6 +49,7 @@ During the incremental training process, we used 160 A100s with a total of 40GB
 Throughout the training process, we encountered various issues such as machine crashes, underlying framework bugs, and loss spikes. However, we ensured the stability of the incremental training by making rapid adjustments. We have also released the loss curve during the training process to help everyone understand the potential issues that may arise.
 <img src="https://huggingface.co/datasets/suolyer/testb/resolve/main/loss.png" width=1000 height=600>
 ### 多任务有监督微调 Supervised finetuning
 在多任务有监督微调阶段，采用了课程学习（curiculum learning）和增量训练（continual learning）的策略，用大模型辅助划分已有的数据难度，然后通过“Easy To Hard”的方式，分多个阶段进行SFT训练。
@@ -76,7 +77,7 @@ We implemented the HFT training process on an internally developed framework, wh
 ### 效果评估 Performance
 ## 使用 Usage

 Throughout the training process, we encountered various issues such as machine crashes, underlying framework bugs, and loss spikes. However, we ensured the stability of the incremental training by making rapid adjustments. We have also released the loss curve during the training process to help everyone understand the potential issues that may arise.
 <img src="https://huggingface.co/datasets/suolyer/testb/resolve/main/loss.png" width=1000 height=600>
 ### 多任务有监督微调 Supervised finetuning
 在多任务有监督微调阶段，采用了课程学习（curiculum learning）和增量训练（continual learning）的策略，用大模型辅助划分已有的数据难度，然后通过“Easy To Hard”的方式，分多个阶段进行SFT训练。
 ### 效果评估 Performance
+<img src="" width=1000 height=600>
 ## 使用 Usage