dranger003
commited on
Commit
•
2069895
1
Parent(s):
03220f1
Update README.md
Browse files
README.md
CHANGED
@@ -9,6 +9,6 @@ GGUF quants for https://huggingface.co/bigcode/starcoder2-15b
|
|
9 |
|
10 |
> StarCoder2-15B model is a 15B parameter model trained on 600+ programming languages from The Stack v2, with opt-out requests excluded. The model uses Grouped Query Attention, a context window of 16,384 tokens with a sliding window attention of 4,096 tokens, and was trained using the Fill-in-the-Middle objective on 4+ trillion tokens.
|
11 |
|
12 |
-
| Layers | Context | Template |
|
13 |
| --- | --- | --- |
|
14 |
-
| <pre>40</pre> | <pre>16384</pre> | <pre>{
|
|
|
9 |
|
10 |
> StarCoder2-15B model is a 15B parameter model trained on 600+ programming languages from The Stack v2, with opt-out requests excluded. The model uses Grouped Query Attention, a context window of 16,384 tokens with a sliding window attention of 4,096 tokens, and was trained using the Fill-in-the-Middle objective on 4+ trillion tokens.
|
11 |
|
12 |
+
| Layers | Context | Template (None/Base Model) |
|
13 |
| --- | --- | --- |
|
14 |
+
| <pre>40</pre> | <pre>16384</pre> | <pre>{prompt}</pre> |
|