Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

	@@ -31,3 +31,18 @@ We use state-of-the-art [Language Model Evaluation Harness](https://github.com/E
31	The training code and data will be open sourced later on Github(https://github.com/chi2liu/mamba-gpt-3b)
32
33

 The training code and data will be open sourced later on Github(https://github.com/chi2liu/mamba-gpt-3b)
+## Training Dataset
+` mamba-gpt-3b-v4 ` is trained on multiply dataset:
+  - [Stanford Alpaca (en)](https://github.com/tatsu-lab/stanford_alpaca)
+  - [Open Assistant (multilingual)](https://huggingface.co/datasets/OpenAssistant/oasst1)
+  - [LIMA (en)](https://huggingface.co/datasets/GAIR/lima)
+  - [CodeAlpaca 20k (en)](https://huggingface.co/datasets/sahil2801/CodeAlpaca-20k)
+  - [GPT-4 Generated Data (en&zh)](https://github.com/Instruction-Tuning-with-GPT-4/GPT-4-LLM)
+  - [UltraChat (en)](https://github.com/thunlp/UltraChat)
+## Summary
+We have fine-tuned the open-lama model and surpassed the original model in multiple evaluation subtasks, making it currently the best performing 3B model with comparable performance to llama-7b
+- Base model: [openlm-research/open_llama_3b_v2](https://huggingface.co/openlm-research/open_llama_3b_v2)