amezasor commited on
Commit
102261f
1 Parent(s): 257d697
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -327,7 +327,7 @@ Granite-3.0-8B-Instruct is based on a decoder-only dense transformer architectur
327
  | Initialization std | 0.1 | **0.1** | 0.1 | 0.1 |
328
  | Sequence Length | 4096 | **4096** | 4096 | 4096 |
329
  | Position Embedding | RoPE | **RoPE** | RoPE | RoPE |
330
- | # Paremeters | 2.5B | **8.1B** | 1.3B | 3.3B |
331
  | # Active Parameters | 2.5B | **8.1B** | 400M | 800M |
332
  | # Training tokens | 12T | **12T** | 10T | 10T |
333
 
 
327
  | Initialization std | 0.1 | **0.1** | 0.1 | 0.1 |
328
  | Sequence Length | 4096 | **4096** | 4096 | 4096 |
329
  | Position Embedding | RoPE | **RoPE** | RoPE | RoPE |
330
+ | # Parameters | 2.5B | **8.1B** | 1.3B | 3.3B |
331
  | # Active Parameters | 2.5B | **8.1B** | 400M | 800M |
332
  | # Training tokens | 12T | **12T** | 10T | 10T |
333