protylopus / README.md
epinnock's picture
Update README.md
2e10fb9
|
raw
history blame
470 Bytes
metadata
license: bigcode-openrail-m
datasets:
  - tiiuae/falcon-refinedweb
language:
  - en

This a ~90m assistant model for cameloid models like LLama/Alpaca/Vicuna/Guanaco that use the llama tokenizer, allowing for speedups up to 3x with greed sampling. Its trained on 5.5 billion tokens of refinedweb and uses the GPTBigcode architecture and has a context window: 1024. To use please see this article on assisted generation https://huggingface.co/blog/assisted-generation.