--- language: - code license: apache-2.0 tags: - code - gpt2 - generation datasets: - "codeparrot/codeparrot-clean" - "codeparrot/github-jupyter-text-to-code" --- # CodeParrot 🦜 small for text-t-code generation This model is [CodeParrot-small](https://huggingface.co/codeparrot/codeparrot-small) (from `branch megatron`) Fine-tuned on [github-jupyter-text-to-code](https://huggingface.co/datasets/codeparrot/github-jupyter-text-to-code), a dataset where the samples are a succession of docstrings and their Python code, originally extracted from Jupyter notebooks parsed in this [dataset](https://huggingface.co/datasets/codeparrot/github-jupyter-parsed).