akshathmangudi
/

llama3.1-8b-quantized

Text Generation

text-generation-inference

Inference Endpoints

4-bit precision

Model card Files Files and versions Community

Edit model card

Model Card for Model ID

This is the quantized version of Llama3.1-8B using bitsandbytes. More quantized LLMs coming soon...

Model Description

Developed by: Meta
Quantized by: Akshath Mangudi
My GitHub: https://github.com/akshathmangudi
My LinkedIn: https://www.linkedin.com/in/akshathmangudi/
License: llama3.1

Model Source

Repository: https://huggingface.co/meta-llama/Meta-Llama-3.1-8B

Downloads last month: 8

Safetensors

Model size

4.65B params

Tensor type

BF16

·

F32

·

U8

·

Inference Examples

Text Generation

Inference API (serverless) is not available, repository is disabled.