LnL-AI
/

dbrx-base-converted-v2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Qubitium commited on Mar 31, 2024

Commit

a1307ee

·

verified ·

1 Parent(s): 103afef

Update README.md

Files changed (1) hide show

README.md +4 -5

README.md CHANGED Viewed

@@ -9,15 +9,14 @@ Special thanks to https://huggingface.co/fahadh4ilyas
 convert_v2.py
 ```
-Training Notes:
 ```
-# 1. dbrx trains like a much smaller model (~7B)
 # start with this as reference point and move up or down based on eval/train loss
 learning_rate = 1.5e-5
-# 2. due to BPE (tiktoken) nature, tokenizer expansion/resize is not very friendly to training
-# use text based special tokens if you need/use extra tokens to avoid bad train/eval losses
 ```
 Known Issues:

 convert_v2.py
 ```
+Training Notes/Observations:
+1. dbrx trains like a much smaller model (~7B)
 ```
 # start with this as reference point and move up or down based on eval/train loss
 learning_rate = 1.5e-5
 ```
+2. Due to nature of BPE (tiktoken), tokenizer expansion/resize is not very friendly to training. Use text based special tokens if you need/use extra tokens to avoid bad train/eval losses
 Known Issues: