Edit model card

Model Card for Model ID

This is the quantized version of Llama3.1-8B using bitsandbytes. More quantized LLMs coming soon...

Model Description

Model Source

Downloads last month
8
Safetensors
Model size
4.65B params
Tensor type
BF16
F32
U8
Inference Examples
Inference API (serverless) is not available, repository is disabled.