tiiuae
/

falcon-7b-instruct

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Fix minor typo

#45

by pcuenq HF staff - opened Jun 23, 2023

base: refs/heads/main

←

from: refs/pr/45

Discussion Files changed

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -163,7 +163,7 @@ Falcon-7B is a causal decoder-only model trained on a causal language modeling t
 The architecture is broadly adapted from the GPT-3 paper ([Brown et al., 2020](https://arxiv.org/abs/2005.14165)), with the following differences:
-* **Positionnal embeddings:** rotary ([Su et al., 2021](https://arxiv.org/abs/2104.09864));
 * **Attention:** multiquery ([Shazeer et al., 2019](https://arxiv.org/abs/1911.02150)) and FlashAttention ([Dao et al., 2022](https://arxiv.org/abs/2205.14135));
 * **Decoder-block:** parallel attention/MLP with a single layer norm.

 The architecture is broadly adapted from the GPT-3 paper ([Brown et al., 2020](https://arxiv.org/abs/2005.14165)), with the following differences:
+* **Positional embeddings:** rotary ([Su et al., 2021](https://arxiv.org/abs/2104.09864));
 * **Attention:** multiquery ([Shazeer et al., 2019](https://arxiv.org/abs/1911.02150)) and FlashAttention ([Dao et al., 2022](https://arxiv.org/abs/2205.14135));
 * **Decoder-block:** parallel attention/MLP with a single layer norm.