license: mit | |
datasets: | |
- HuggingFaceFW/fineweb | |
library_name: Transformers | |
pipeline_tag: text-generation | |
This is a Llama 2 architecture model series trained on the FineWeb dataset. This is ~500 Million model and uses tiktoken cl100k_base model as tokenizer |