File size: 368 Bytes
c1efa8a 90d1148 c1efa8a 90d1148 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 |
---
license: apache-2.0
base_model: TinyLlama/TinyLlama-1.1B-step-50K-105b
datasets:
- cerebras/SlimPajama-627B
- bigcode/starcoderdata
- monsoon-nlp/greenbeing-proteins
language:
- en
---
# tinyllama-proteinpretrain-quinoa
Continued pretraining of TinyLLaMA-1.1B on the "research" split (quinoa
protein sequences) of GreenBeing-Proteins dataset.
More details TBD
|