Llama-3-Taiwan-8B-Instruct - GPTQ
- Model creator: Yen-Ting Lin
- Original model: Llama-3-Taiwan-8B-Instruct
Description
This repo contains GPTQ model files for Llama-3-Taiwan-8B-Instruct.
Quantization parameter
- Bits : 4
- Group Size : 128
- Act Order : Yes
- Damp % : 0.1
- Seq Len : 2048
- Size : 5.34 GB
It tooks about 50 mins to quantize on H100.
- Downloads last month
- 3
This model does not have enough activity to be deployed to Inference API (serverless) yet.
Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.