Text Generation
Transformers
PyTorch
Safetensors
code
Eval Results
Inference Endpoints
Muennighoff commited on
Commit
9ba0a0b
1 Parent(s): b9c92c5

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +29 -0
README.md CHANGED
@@ -61,6 +61,35 @@ OctoCoder is an instruction tuned model with 15.5B parameters created by finetun
61
  - **Repository:** [bigcode/octopack](https://github.com/bigcode-project/octopack)
62
  - **Paper:** [TODO]()
63
  - **Languages:** 80+ Programming languages
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
64
 
65
  ## Use
66
 
 
61
  - **Repository:** [bigcode/octopack](https://github.com/bigcode-project/octopack)
62
  - **Paper:** [TODO]()
63
  - **Languages:** 80+ Programming languages
64
+ - **OctoPack🐙🎒:**
65
+
66
+ <table>
67
+ <tr>
68
+ <th>Data</t>
69
+ <td><a href=https://huggingface.co/datasets/bigcode/commitpack>CommitPack</a></td>
70
+ <td>4TB of GitHub commits across 350 programming languages</td>
71
+ </tr>
72
+ <tr>
73
+ <th></t>
74
+ <td><a href=https://huggingface.co/datasets/bigcode/commitpackft>CommitPackFT</a></td>
75
+ <td>Filtered version of CommitPack for high-quality commit messages that resemble instructions</td>
76
+ </tr>
77
+ <tr>
78
+ <th>Model</t>
79
+ <td><a href=https://huggingface.co/bigcode/octocoder>OctoCoder</a></td>
80
+ <td>StarCoder (16B parameters) instruction tuned on CommitPackFT + OASST</td>
81
+ </tr>
82
+ <tr>
83
+ <th></t>
84
+ <td><a href=https://huggingface.co/bigcode/octogeex>OctoGeeX</a></td>
85
+ <td>CodeGeeX2 (6B parameters) instruction tuned on CommitPackFT + OASST</td>
86
+ </tr>
87
+ <tr>
88
+ <th>Evaluation</t>
89
+ <td><a href=https://huggingface.co/datasets/bigcode/humanevalpack>HumanEvalPack</a></td>
90
+ <td>Extension of OpenAI's HumanEval to cover 3 scenarios across 6 languages</td>
91
+ </tr>
92
+ </table>
93
 
94
  ## Use
95