Edit model card

Exllama v2 Quantizations of Tess-v2.5.2-Qwen2-72B

Using turboderp's ExLlamaV2 v0.0.21 for quantization.

Original model: https://huggingface.co/migtissera/Tess-v2.5.2-Qwen2-72B

Downloads last month
1
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.