mlabonne
/

Beyonder-4x7B-v2

Text Generation

Mixture of Experts

openchat/openchat-3.5-1210

beowolx/CodeNinja-1.0-OpenChat-7B

maywell/PiVoT-0.1-Starling-LM-RP

WizardLM/WizardMath-7B-V1.1

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

mlabonne commited on Jan 5

Commit

0e9343a

•

1 Parent(s): c7018ef

Update README.md

Files changed (1) hide show

README.md +10 -0

README.md CHANGED Viewed

@@ -16,6 +16,16 @@ This model is a Mixture of Experts (MoE) made with [mergekit](https://github.com
 ## 🏆 Evaluation
 |                               Model                                |AGIEval|GPT4All|TruthfulQA|Bigbench|Average|
 |--------------------------------------------------------------------|------:|------:|---------:|-------:|------:|
 |[**Beyonder-4x7B-v2**](https://huggingface.co/shadowml/Beyonder-4x7B-v2)|  **45.29**|  **75.95**|     <u>**60.86**</u>|    **46.4**|  **57.13**|

 ## 🏆 Evaluation
+Beyonder-4x7B-v2 is competitive with Mixtral-8x7B-Instruct-v0.1 on the Open LLM Leaderboard, while only having 4 experts instead of 8.
+![](https://i.imgur.com/5raBff0.png)
+It also displays a significant improvement over the individual experts.
+![](https://i.imgur.com/7Idwkb0.png)
+It also performs very well compared to other models on Nous benchmark suite. It's almost as good as the best Yi-34B fine-tune, which is a much bigger model: 24.2B parameters + only two experts are selected during inference (so ~12B) vs. 34B param.
 |                               Model                                |AGIEval|GPT4All|TruthfulQA|Bigbench|Average|
 |--------------------------------------------------------------------|------:|------:|---------:|-------:|------:|
 |[**Beyonder-4x7B-v2**](https://huggingface.co/shadowml/Beyonder-4x7B-v2)|  **45.29**|  **75.95**|     <u>**60.86**</u>|    **46.4**|  **57.13**|