cerebras
/

btlm-3b-8k-chat

Text Generation

Model card Files Files and versions Community

YX-Cerebras commited on Dec 8, 2023

Commit

94dd770

•

1 Parent(s): 0b6d290

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ license: apache-2.0
 # BTLM-3B-8k-chat
-BTLM-3B-8k-chat is a chat version of the [BTLM-3B-8K](cerebras/btlm-3b-8k-base) model trained using [DPO](https://arxiv.org/abs/2305.18290) method on [Anthropic-HH-RLHF](Anthropic/hh-rlhf) dataset. The model was specifically trained to align to human preferences and optimized for dialogue use cases.
@@ -107,7 +107,7 @@ Table 1: Detailed down-stream tasks comparisons. MMLU task performance is report
 - Lora r: 128
 - Lora alpha: 16
 - Beta: 0.05
-- Learn more: [BTLM-3B-8k-chat blog](blogpage)
 ## Uses and Limitations

 # BTLM-3B-8k-chat
+BTLM-3B-8k-chat is a chat version of the [BTLM-3B-8K-base](cerebras/btlm-3b-8k-base) model trained using [DPO](https://arxiv.org/abs/2305.18290) method on [Anthropic-HH-RLHF](Anthropic/hh-rlhf) dataset. The model was specifically trained to align to human preferences and optimized for dialogue use cases.
 - Lora r: 128
 - Lora alpha: 16
 - Beta: 0.05
+- Learn more: [BTLM-3B-8k-chat blog](https://www.cerebras.net/blog/fine-tuning-language-models-using-direct-preference-optimization)
 ## Uses and Limitations