neuralmagic 's Collections

INT4 LLMs for vLLM

Accurate INT4 quantized models by Neural Magic, ready for use with vLLM!