Spaces:
Running
oncpu upgrade
Running
oncpu upgrade
Salesforce Introduces XGen-7B, a new 7B LLM trained on 8K seq. length for 1.5T tokens. #91
by
- opened
Better or comparable results with MPT, Falcon, LLaMA, OpenLLaMA in text & code tasks.
Blog: http://blog.salesforceairesearch.com/xgen/
github: https://github.com/salesforce/xgen
On standard NLP benchmarks, XGen achieves comparable or better results when compared with state-of-the-art open-source LLMs (e.g. MPT, Falcon, LLaMA, Redpajama, OpenLLaMA) of similar model size.
Our targeted evaluation on long sequence modeling benchmarks show benefits of our 8K-seq models over 2K- and 4K-seq models.
XGen-7B archives equally strong results both in text (e.g., MMLU, QA) and code (HumanEval) tasks.
This comment has been hidden
changed discussion status to
closed