GPT4-X-Alpasta-30b / README.md
leaderboard-pr-bot's picture
Adding Evaluation Results
17fa51c
|
raw
history blame
1.5 kB

Dont be upsetti, here, have some spaghetti! Att: A'eala <3

Information

GPT4-X-Alpasta-30b working with Oobabooga's Text Generation Webui and KoboldAI.

This is an attempt at improving Open Assistant's performance as an instruct while retaining its excellent prose. The merge consists of Chansung's GPT4-Alpaca Lora and Open Assistant's native fine-tune.

Benchmarks

FP16

Wikitext2: 4.6077961921691895

Ptb-New: 9.41549301147461

C4-New: 6.98392915725708

Benchmarks brought to you by A'eala

# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard) Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_MetaIX__GPT4-X-Alpasta-30b)
Metric Value
Avg. 57.85
ARC (25-shot) 63.05
HellaSwag (10-shot) 83.56
MMLU (5-shot) 57.71
TruthfulQA (0-shot) 51.52
Winogrande (5-shot) 78.22
GSM8K (5-shot) 30.48
DROP (3-shot) 40.38