dranger003 commited on
Commit
bb0ee84
1 Parent(s): 7141798

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -0
README.md CHANGED
@@ -1,3 +1,12 @@
1
  ---
2
  license: bigcode-openrail-m
 
 
3
  ---
 
 
 
 
 
 
 
 
1
  ---
2
  license: bigcode-openrail-m
3
+ pipeline_tag: text-generation
4
+ library_name: gguf
5
  ---
6
+ GGUF quants for https://huggingface.co/bigcode/starcoder2-15b
7
+
8
+ > StarCoder2-15B model is a 15B parameter model trained on 600+ programming languages from The Stack v2, with opt-out requests excluded. The model uses Grouped Query Attention, a context window of 16,384 tokens with a sliding window attention of 4,096 tokens, and was trained using the Fill-in-the-Middle objective on 4+ trillion tokens.
9
+
10
+ | Layers | Context | [Template (Text Representation)](https://github.com/ContextualAI/gritlm?tab=readme-ov-file#inference) | [Template (Text Generation)](https://github.com/ContextualAI/gritlm?tab=readme-ov-file#inference) |
11
+ | --- | --- | --- | --- |
12
+ | <pre>40</pre> | <pre>16384</pre> | <pre>{context}<br><br>Code Editing Instruction: {prompt}<br>{response}</pre> |