CobraMamba
/

mamba-gpt-3b-v4

Text Generation

large language model

text-generation-inference

Model card Files Files and versions Community

CobraMamba commited on Sep 13, 2023

Commit

be03b95

•

1 Parent(s): 4666fbc

Update README.md

Files changed (1) hide show

README.md +30 -1

README.md CHANGED Viewed

@@ -1,4 +1,33 @@
 ---
 license: apache-2.0
 ---
-test

 ---
+language:
+- en
+library_name: transformers
+tags:
+- gpt
+- llm
+- large language model
+inference: false
+thumbnail: >-
+  https://h2o.ai/etc.clientlibs/h2o/clientlibs/clientlib-site/resources/images/favicon.ico
 license: apache-2.0
 ---
+# Model Card
+**The Best 3B Model! Surpassing dolly-v2-12b**
+The best 3B model on MMLU (5-shot) on the [Open LLM Leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard), with performance surpassing dolly-v2-12b
+| Metric                | Value |
+|-----------------------|-------|
+| MMLU (5-shot)         | 30.0  |
+| ARC (25-shot)         | 42.6  |
+| HellaSwag (10-shot)   | 71.0  |
+| TruthfulQA (0-shot)   | 37.3  |
+| Avg.                  | 45.2  |
+We use state-of-the-art [Language Model Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness) to run the benchmark tests above.
+The training code and data will be open sourced later on Github(https://github.com/chi2liu/mamba-gpt-3b)