File size: 368 Bytes
c1efa8a
90d1148
 
 
 
 
 
 
 
c1efa8a
90d1148
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
---
license: apache-2.0
base_model: TinyLlama/TinyLlama-1.1B-step-50K-105b
datasets:
- cerebras/SlimPajama-627B
- bigcode/starcoderdata
- monsoon-nlp/greenbeing-proteins
language:
- en
---

# tinyllama-proteinpretrain-quinoa

Continued pretraining of TinyLLaMA-1.1B on the "research" split (quinoa 
protein sequences) of GreenBeing-Proteins dataset.

More details TBD