Pernekhan commited on
Commit
5f7a136
1 Parent(s): eece8c4

Create quantize_config.json

Browse files

This is to make it work with engines like vLLM

Files changed (1) hide show
  1. quantize_config.json +6 -0
quantize_config.json ADDED
@@ -0,0 +1,6 @@
 
 
 
 
 
 
 
1
+ {
2
+ "bits": 4,
3
+ "desc_act": false,
4
+ "group_size": 128,
5
+ "true_sequential": true
6
+ }