flax-community
/

gpt-neo-1.3B-apps-all-2

Text Generation

Inference Endpoints

Model card Files Files and versions Community

arampacha commited on Jul 23, 2021

Commit

7b603af

•

1 Parent(s): ad81500

Update README.md

Files changed (1) hide show

README.md +5 -7

README.md CHANGED Viewed

@@ -11,11 +11,11 @@ datasets:
 ---
-# GPT-Code-Clippy-125M-APPS
 ## Model Description
-GPT-CC-125M-APPS is a GPT-Neo-125M finetuned on APPS dataset. This model is specialized to solve programming tasks.
 ## Training data
@@ -58,7 +58,7 @@ python run_clm_apps.py \
 ## Intended Use and Limitations
-The model is finetuned to solve programming problems given a text description and optional starter code.
 ### How to use
@@ -104,11 +104,9 @@ The paper ["Evaluating Large Language Models Trained on Code"](https://arxiv.org
 2. **Economic and labor market impacts:** Large language models trained on large code datasets such as this one that are capable of generating high-quality code have the potential to automate part of the software development process. This may negatively impact software developers. However, as discussed in the paper, as shown in the Summary Report of software developers from [O*NET OnLine](https://www.onetonline.org/link/summary/15-1252.00), developers don't just write software.
-5. **Biases:** The model is trained on data containing prompt questions formatted in specific way. The performance of the model can be worse if the prompt
-formatting is different from that used in APPS dataset.
-GPT-CC is finetuned GPT-Neo and might have inhereted biases and limitations from it. See [GPT-Neo model card](https://huggingface.co/EleutherAI/gpt-neo-125M#limitations-and-biases) for details.
 ## Eval results

 ---
+# GPT-Code-Clippy-1.3B-APPS-all
 ## Model Description
+GPT-CC-1.3B-APPS-all is a GPT-Neo-1.3B fine-tuned on APPS dataset. This model is specialized to solve programming tasks.
 ## Training data
 ## Intended Use and Limitations
+The model is fine-tuned to solve programming problems given a text description and optional starter code.
 ### How to use
 2. **Economic and labor market impacts:** Large language models trained on large code datasets such as this one that are capable of generating high-quality code have the potential to automate part of the software development process. This may negatively impact software developers. However, as discussed in the paper, as shown in the Summary Report of software developers from [O*NET OnLine](https://www.onetonline.org/link/summary/15-1252.00), developers don't just write software.
+5. **Biases:** The model is trained on data containing prompt questions formatted in specific way. The performance of the model can be worse if the prompt formatting is different from that used in APPS dataset.
+This model is finetuned GPT-Neo and might have inhereted biases and limitations from it. See [GPT-Neo model card](https://huggingface.co/EleutherAI/gpt-neo-125M#limitations-and-biases) for details.
 ## Eval results