loubnabnl's picture
loubnabnl HF staff
add file
1a542da
|
raw
history blame
650 Bytes

CodeParrot uses GPT-2 architecture with BPE tokenizer trained on Python code. We released this model as an educational tool for training large language models from scratch on code, with detailed tutorials and descriptions of the training process. It makes use of Accelerate for distributed training and mixed precision. See this blog and repo for more details.

Model # parameters
GPT2 1.5B