Phil
cduk
ยท
AI & ML interests
None yet
Organizations
None yet
cduk's activity
Why 72B model has different vocab size comparing with other models?
5
#1 opened 8 months ago
by
Mikasaka
intermediate_size which was incompatible with VLLM parallel inference
#1 opened 14 days ago
by
cduk
Instruct version
#1 opened 3 months ago
by
cduk
Different vocab size and speculative decoding
#4 opened 5 months ago
by
cduk
Some weights of LlamaForCausalLM were not initialized from the model checkpoint
6
#3 opened 6 months ago
by
giorgiopiatti
Yi 1.5 34B Chat Int8 GPTQ
1
#8 opened 5 months ago
by
cduk