Spaces:

codeparrot
/

code-generation-models

Running

Loubna ben allal commited on May 23, 2022

Commit

d97087d

1 Parent(s): 210cf5b

add architecture info

Files changed (6) hide show

architectures/.ipynb_checkpoints/codeparrot-checkpoint.txt ADDED Viewed

+[CodeParrot](https://huggingface.co/lvwerra/codeparrot) uses GPT-2 architecture with BPE tokenizer trained on Python code.
+|Model | # parameters |
+|   -   |   -  |
+| GPT2 | 1.5B |

architectures/.ipynb_checkpoints/incoder-checkpoint.txt ADDED Viewed

+[InCoder](https://huggingface.co/facebook/incoder-6B) uses a decoder-only Transformer with [Causal Masking objective](https://arxiv.org/abs/2201.07520), to train a left-to-right language model to fill in masked token segments.
+|Model | # parameters |
+|   -   |   -  |
+| Decoder |6.7B |

architectures/.ipynb_checkpoints/opt-checkpoint.txt ADDED Viewed

+[OPT](https://huggingface.co/facebook/opt-30b) uses decoder-only models like GPT-3. It was trained on datasets with a small portion of code. In this demo we use the 30B parameters model. The largest model has 176B parameters.
+|Model | # parameters |
+|   -   |   -  |
+| Decoder |30B |

architectures/codeparrot.txt ADDED Viewed

+[CodeParrot](https://huggingface.co/lvwerra/codeparrot) uses GPT-2 architecture with BPE tokenizer trained on Python code.
+|Model | # parameters |
+|   -   |   -  |
+| GPT2 | 1.5B |

architectures/incoder.txt ADDED Viewed

+[InCoder](https://huggingface.co/facebook/incoder-6B) uses a decoder-only Transformer with [Causal Masking objective](https://arxiv.org/abs/2201.07520), to train a left-to-right language model to fill in masked token segments.
+|Model | # parameters |
+|   -   |   -  |
+| Decoder |6.7B |

architectures/opt.txt ADDED Viewed

+[OPT](https://huggingface.co/facebook/opt-30b) uses decoder-only models like GPT-3. It was trained on datasets with a small portion of code. In this demo we use the 30B parameters model. The largest model has 176B parameters.
+|Model | # parameters |
+|   -   |   -  |
+| Decoder |30B |