Update README.md
Browse files
README.md
CHANGED
@@ -63,6 +63,8 @@ To simplify the comparison, we chosed the Pass@1 metric for the Python language,
|
|
63 |
| --- |----------------------------------------------------------------------------- |
|
64 |
| phi-2 | 48.2% |
|
65 |
| **opencsg-phi-2-v0.1** |**54.3%**|
|
|
|
|
|
66 |
|
67 |
|
68 |
|
@@ -164,7 +166,8 @@ HumanEval 是评估模型在代码生成方面性能的最常见的基准,尤
|
|
164 |
| --- |----------------------------------------------------------------------------- |
|
165 |
| phi-2 | 48.2% |
|
166 |
| **opencsg-phi-2-v0.1** |**54.3%**|
|
167 |
-
|
|
|
168 |
|
169 |
|
170 |
|
|
|
63 |
| --- |----------------------------------------------------------------------------- |
|
64 |
| phi-2 | 48.2% |
|
65 |
| **opencsg-phi-2-v0.1** |**54.3%**|
|
66 |
+
| stable-coder-3b | 29.3%|
|
67 |
+
| **opencsg-stable-coder-3b-v1**| **46.3%** |
|
68 |
|
69 |
|
70 |
|
|
|
166 |
| --- |----------------------------------------------------------------------------- |
|
167 |
| phi-2 | 48.2% |
|
168 |
| **opencsg-phi-2-v0.1** |**54.3%**|
|
169 |
+
| stable-coder-3b | 29.3%|
|
170 |
+
| **opencsg-stable-coder-3b-v1**| **46.3%** |
|
171 |
|
172 |
|
173 |
|