OpenAssistant
/

llama2-13b-orca-8k-3319

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

jordiclive commited on Jul 24, 2023

Commit

2bc2c75

•

1 Parent(s): ccfe9f2

Update README.md

Files changed (1) hide show

README.md +7 -8

README.md CHANGED Viewed

@@ -69,11 +69,10 @@ This model was trained on:
 - [togethercomputer/RedPajama-Data-1T-Sample](https://huggingface.co/datasets/togethercomputer/RedPajama-Data-1T)
 - [atom-in-the-universe/fanfics-10k-50k](https://huggingface.co/datasets/atom-in-the-universe/fanfics-10k-50k)
-The dataset [shahules786/orca-chat](https://huggingface.co/datasets/shahules786/orca-chat) combines similar
-examples of the GPT-4 subset of [ehartford/dolphin](https://huggingface.co/datasets/ehartford/dolphin) to form longer conversations
-to improve long-context trainig.
-RedPajama and FanFics were additionally used for classic language modelling to fine-tune the RoPE scaling for 8k context size.
 ## Model Configuration
@@ -130,15 +129,15 @@ llama2_13b_orca_8k:
 # Developers
 - [shahules786](https://github.com/shahules786)
-- [jordicli](https://github.com/jordiclive)
 - [andreaskoepf](https://github.com/andreaskoepf/)
 # Special Thanks
-We want to especially thank Eric Hardford who spared no expense in replicating ORCA and making it available at [ehartford/dolphin](https://huggingface.co/datasets/ehartford/dolphin)!
-Also shoutout to the whole team working on [LLongMA-2-13b](https://huggingface.co/conceptofmind/LLongMA-2-13b) & the [scaled-rope](https://github.com/jquesnelle/scaled-rope) repository for their awesome work: bloc97, jquesnelle & conceptofmind!
-The whole Open-Assistant team is very grateful for the continued support of [Redmond.ai](https://redmond.ai/) who sponsored the training compute for this model.
 # License

 - [togethercomputer/RedPajama-Data-1T-Sample](https://huggingface.co/datasets/togethercomputer/RedPajama-Data-1T)
 - [atom-in-the-universe/fanfics-10k-50k](https://huggingface.co/datasets/atom-in-the-universe/fanfics-10k-50k)
+The dataset [shahules786/orca-chat](https://huggingface.co/datasets/shahules786/orca-chat) combines similar examples of the GPT-4 subset of [ehartford/dolphin](https://huggingface.co/datasets/ehartford/dolphin) to form longer conversations
+to improve long-context training.
+Additionally, RedPajama and FanFics were used for classic language modelling as an auxiliary task to improve the RoPE scaling for the 8k context size.
 ## Model Configuration
 # Developers
 - [shahules786](https://github.com/shahules786)
+- [jordiclive](https://github.com/jordiclive)
 - [andreaskoepf](https://github.com/andreaskoepf/)
 # Special Thanks
+We want to especially thank Eric Hartford who spared no expense in replicating ORCA and making it available at [ehartford/dolphin](https://huggingface.co/datasets/ehartford/dolphin)!
+Also, shoutout to the whole team working on [LLongMA-2-13b](https://huggingface.co/conceptofmind/LLongMA-2-13b) & the [scaled-rope](https://github.com/jquesnelle/scaled-rope) repository for their awesome work: bloc97, jquesnelle & conceptofmind!
+The whole Open-Assistant team is very grateful for the continued support of [Redmond.ai](https://redmond.ai/) who sponsored the training compute required for this model.
 # License