FP8 LLMs for vLLM Collection Accurate FP8 quantized models by Neural Magic, ready for use with vLLM! • 43 items • Updated 1 day ago • 52
Llama-3.2 Quantization Collection Llama 3.2 models quantized by Neural Magic • 9 items • Updated 3 days ago • 5