Files changed (1) hide show
  1. README.md +15 -1
README.md CHANGED
@@ -1,6 +1,5 @@
1
  ---
2
  license: apache-2.0
3
- base_model: mistralai/Mistral-7B-v0.1
4
  tags:
5
  - alignment-handbook
6
  - generated_from_trainer
@@ -9,6 +8,7 @@ tags:
9
  - generated_from_trainer
10
  datasets:
11
  - HuggingFaceH4/ultrachat_200k
 
12
  model-index:
13
  - name: zephyr-7b-sft-full
14
  results: []
@@ -66,3 +66,17 @@ The following hyperparameters were used during training:
66
  - Pytorch 2.1.2+cu121
67
  - Datasets 2.14.6
68
  - Tokenizers 0.15.0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
 
3
  tags:
4
  - alignment-handbook
5
  - generated_from_trainer
 
8
  - generated_from_trainer
9
  datasets:
10
  - HuggingFaceH4/ultrachat_200k
11
+ base_model: mistralai/Mistral-7B-v0.1
12
  model-index:
13
  - name: zephyr-7b-sft-full
14
  results: []
 
66
  - Pytorch 2.1.2+cu121
67
  - Datasets 2.14.6
68
  - Tokenizers 0.15.0
69
+
70
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
71
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_alignment-handbook__zephyr-7b-sft-full)
72
+
73
+ | Metric |Value|
74
+ |---------------------------------|----:|
75
+ |Avg. |57.56|
76
+ |AI2 Reasoning Challenge (25-Shot)|57.68|
77
+ |HellaSwag (10-Shot) |80.82|
78
+ |MMLU (5-Shot) |60.31|
79
+ |TruthfulQA (0-shot) |41.71|
80
+ |Winogrande (5-shot) |76.09|
81
+ |GSM8k (5-shot) |28.73|
82
+