dranger003
/

starcoder2-15b-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

dranger003 commited on Mar 1

Commit

2069895

•

1 Parent(s): 03220f1

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -9,6 +9,6 @@ GGUF quants for https://huggingface.co/bigcode/starcoder2-15b
 > StarCoder2-15B model is a 15B parameter model trained on 600+ programming languages from The Stack v2, with opt-out requests excluded. The model uses Grouped Query Attention, a context window of 16,384 tokens with a sliding window attention of 4,096 tokens, and was trained using the Fill-in-the-Middle objective on 4+ trillion tokens.
-| Layers | Context | Template |
 | --- | --- | --- |
-| <pre>40</pre> | <pre>16384</pre> | <pre>{context}<br><br>Code Editing Instruction: {prompt}<br>{response}</pre> |

 > StarCoder2-15B model is a 15B parameter model trained on 600+ programming languages from The Stack v2, with opt-out requests excluded. The model uses Grouped Query Attention, a context window of 16,384 tokens with a sliding window attention of 4,096 tokens, and was trained using the Fill-in-the-Middle objective on 4+ trillion tokens.
+| Layers | Context | Template (None/Base Model) |
 | --- | --- | --- |
+| <pre>40</pre> | <pre>16384</pre> | <pre>{prompt}</pre> |