Edit model card

FP16 model of airoboros 70b 1.4.1 (https://huggingface.co/jondurbin/airoboros-l2-70b-gpt4-1.4.1) from .bin to . safetensors, to be used to quant on exllama2.

It can also be used to load faster at FP16 using transformers.

There is a script inside bin2safetensors folder, that you can use to convert .bin files into .safetensor ones for other models.

Also, I included 2 measurements.json to be used to quant. First one (called old) was made with https://huggingface.co/datasets/EleutherAI/the_pile_deduplicated/blob/refs%2Fconvert%2Fparquet/default/train/0000.parquet and first exllamav2 version, and the second one is a cleaned pippa, with good formatting on 17/09/2023 exllamav2.

Downloads last month
13
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.