Quantization made by Richard Erkhov.

SOLAR-13B-Instruct-v1.0 - GGUF

Model creator: https://huggingface.co/vicgalle/
Original model: https://huggingface.co/vicgalle/SOLAR-13B-Instruct-v1.0/

Name	Quant method	Size
SOLAR-13B-Instruct-v1.0.Q2_K.gguf	Q2_K	4.33GB
SOLAR-13B-Instruct-v1.0.IQ3_XS.gguf	IQ3_XS	4.81GB
SOLAR-13B-Instruct-v1.0.IQ3_S.gguf	IQ3_S	5.07GB
SOLAR-13B-Instruct-v1.0.Q3_K_S.gguf	Q3_K_S	5.04GB
SOLAR-13B-Instruct-v1.0.IQ3_M.gguf	IQ3_M	5.24GB
SOLAR-13B-Instruct-v1.0.Q3_K.gguf	Q3_K	5.62GB
SOLAR-13B-Instruct-v1.0.Q3_K_M.gguf	Q3_K_M	5.62GB
SOLAR-13B-Instruct-v1.0.Q3_K_L.gguf	Q3_K_L	6.11GB
SOLAR-13B-Instruct-v1.0.IQ4_XS.gguf	IQ4_XS	6.3GB
SOLAR-13B-Instruct-v1.0.Q4_0.gguf	Q4_0	6.57GB
SOLAR-13B-Instruct-v1.0.IQ4_NL.gguf	IQ4_NL	6.64GB
SOLAR-13B-Instruct-v1.0.Q4_K_S.gguf	Q4_K_S	6.62GB
SOLAR-13B-Instruct-v1.0.Q4_K.gguf	Q4_K	6.99GB
SOLAR-13B-Instruct-v1.0.Q4_K_M.gguf	Q4_K_M	6.99GB
SOLAR-13B-Instruct-v1.0.Q4_1.gguf	Q4_1	7.29GB
SOLAR-13B-Instruct-v1.0.Q5_0.gguf	Q5_0	8.01GB
SOLAR-13B-Instruct-v1.0.Q5_K_S.gguf	Q5_K_S	8.01GB
SOLAR-13B-Instruct-v1.0.Q5_K.gguf	Q5_K	8.22GB
SOLAR-13B-Instruct-v1.0.Q5_K_M.gguf	Q5_K_M	8.22GB
SOLAR-13B-Instruct-v1.0.Q5_1.gguf	Q5_1	8.73GB
SOLAR-13B-Instruct-v1.0.Q6_K.gguf	Q6_K	9.53GB
SOLAR-13B-Instruct-v1.0.Q8_0.gguf	Q8_0	12.35GB

Original model description:

license: apache-2.0 tags: - mergekit - merge - solar base_model: - upstage/SOLAR-10.7B-Instruct-v1.0 model-index: - name: SOLAR-13B-Instruct-v1.0 results: - task: type: text-generation name: Text Generation dataset: name: AI2 Reasoning Challenge (25-Shot) type: ai2_arc config: ARC-Challenge split: test args: num_few_shot: 25 metrics: - type: acc_norm value: 57.25 name: normalized accuracy source: url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=vicgalle/SOLAR-13B-Instruct-v1.0 name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: HellaSwag (10-Shot) type: hellaswag split: validation args: num_few_shot: 10 metrics: - type: acc_norm value: 78.03 name: normalized accuracy source: url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=vicgalle/SOLAR-13B-Instruct-v1.0 name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: MMLU (5-Shot) type: cais/mmlu config: all split: test args: num_few_shot: 5 metrics: - type: acc value: 55.75 name: accuracy source: url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=vicgalle/SOLAR-13B-Instruct-v1.0 name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: TruthfulQA (0-shot) type: truthful_qa config: multiple_choice split: validation args: num_few_shot: 0 metrics: - type: mc2 value: 61.99 source: url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=vicgalle/SOLAR-13B-Instruct-v1.0 name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: Winogrande (5-shot) type: winogrande config: winogrande_xl split: validation args: num_few_shot: 5 metrics: - type: acc value: 70.24 name: accuracy source: url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=vicgalle/SOLAR-13B-Instruct-v1.0 name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: GSM8k (5-shot) type: gsm8k config: main split: test args: num_few_shot: 5 metrics: - type: acc value: 16.6 name: accuracy source: url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=vicgalle/SOLAR-13B-Instruct-v1.0 name: Open LLM Leaderboard

SOLAR-13B-Instruct-v1.0

This is SOLAR-10.7B, but upscaled to 13B, to optimize VRAM usage of typical GPU cards (a 4bit quant fits in 12GB).

Evaluations coming soon!

This is a frankenmerge model created using mergekit.

Merge Details

Merge Method

This model was merged using the passthrough merge method.

Models Merged

The following models were included in the merge:

upstage/SOLAR-10.7B-Instruct-v1.0

Configuration

The following YAML configuration was used to produce this model:

slices:
  - sources:
    - model: upstage/SOLAR-10.7B-Instruct-v1.0
      layer_range: [0, 28]
  - sources:
    - model: upstage/SOLAR-10.7B-Instruct-v1.0
      layer_range: [20, 48]
merge_method: passthrough
dtype: float16

Prompt template

The same as in SOLAR-10.7B:

<s> ### User:
{prompt}

### Assistant:
{response}</s>

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	56.65
AI2 Reasoning Challenge (25-Shot)	57.25
HellaSwag (10-Shot)	78.03
MMLU (5-Shot)	55.75
TruthfulQA (0-shot)	61.99
Winogrande (5-shot)	70.24
GSM8k (5-shot)	16.60