dranger003
commited on
Commit
•
bb0ee84
1
Parent(s):
7141798
Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,12 @@
|
|
1 |
---
|
2 |
license: bigcode-openrail-m
|
|
|
|
|
3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
license: bigcode-openrail-m
|
3 |
+
pipeline_tag: text-generation
|
4 |
+
library_name: gguf
|
5 |
---
|
6 |
+
GGUF quants for https://huggingface.co/bigcode/starcoder2-15b
|
7 |
+
|
8 |
+
> StarCoder2-15B model is a 15B parameter model trained on 600+ programming languages from The Stack v2, with opt-out requests excluded. The model uses Grouped Query Attention, a context window of 16,384 tokens with a sliding window attention of 4,096 tokens, and was trained using the Fill-in-the-Middle objective on 4+ trillion tokens.
|
9 |
+
|
10 |
+
| Layers | Context | [Template (Text Representation)](https://github.com/ContextualAI/gritlm?tab=readme-ov-file#inference) | [Template (Text Generation)](https://github.com/ContextualAI/gritlm?tab=readme-ov-file#inference) |
|
11 |
+
| --- | --- | --- | --- |
|
12 |
+
| <pre>40</pre> | <pre>16384</pre> | <pre>{context}<br><br>Code Editing Instruction: {prompt}<br>{response}</pre> |
|