Text Generation
Transformers
PyTorch
llama
text-generation-inference
Inference Endpoints
yxdyc's picture
Update README.md
232bece
|
raw
history blame
No virus
802 Bytes
metadata
license: apache-2.0
datasets:
  - datajuicer/alpaca-cot-en-refined-by-data-juicer

This is a reference LLM from Data-Juicer.

The model architecture is LLaMA-7B and we built it upon the pre-trained checkpoint. The model is fine-trained on 40k English chat samples of Data-Juicer's refined alpaca-CoT data. It beats LLaMA-7B fine-tuned on 52k Alpaca samples in GPT-4 evaluation.

For more details, please refer to our paper.

exp_llama