mayank-mishra commited on
Commit
322a980
1 Parent(s): a55dc10

add mmodel

Browse files
Files changed (2) hide show
  1. README.md +18 -0
  2. model.pt +3 -0
README.md CHANGED
@@ -1,3 +1,21 @@
1
  ---
2
  license: openrail
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: openrail
3
  ---
4
+
5
+ # GPTQ-for-StarCoder
6
+ Visit [GPTQ-for-SantaCoder](https://github.com/mayank31398/GPTQ-for-SantaCoder) for instructions on how to use the model weights here.
7
+ If you want 8-bit weights, visit [starcoderbase-GPTQ-8bit-128g](https://huggingface.co/mayank31398/starcoderbase-GPTQ-8bit-128g).
8
+
9
+ ## Results
10
+ | StarCoderBase | Bits | group-size | memory(MiB) | wikitext2 | ptb | c4 | stack | checkpoint size(MB) |
11
+ | -------------------------------------------------- | ---- | ---------- | ----------- | --------- | ---------- | ---------- | ---------- | ------------------- |
12
+ | FP32 | 32 | - | | 10.172 | 15.756 | 12.736 | 1.692 | 59195 |
13
+ | BF16 | 16 | - | | 10.173 | 15.765 | 12.745 | 1.692 | 29597 |
14
+ | [GPTQ](https://arxiv.org/abs/2210.17323) | 8 | 128 | | 10.174 | 15.767 | 12.739 | 1.692 | 16163 |
15
+ | [GPTQ](https://arxiv.org/abs/2210.17323) | 4 | 128 | | 10.387 | 16.056 | 13.005 | 1.708 | 8877 |
16
+
17
+ # License
18
+ The model is licenses under the CodeML Open RAIL-M v0.1 license. You can find the full license [here](https://huggingface.co/spaces/bigcode/license).
19
+
20
+ # Acknowledgements
21
+ Thanks to everyone in BigCode who worked so hard to create these code models.
model.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:40512cdba1a2f65d23b2f32c11ac9723d4fd187682d0991b9d7a2e5a102d5e43
3
+ size 9308143245