Magpie Alignment's profile picture

Magpie Alignment

community

AI & ML interests

Transparent LLM alignment for all.

Organization Card
About org cards

Hi, I am a magpie 🐦!

πŸ•ΈοΈ Project Website: https://magpie-align.github.io/

πŸ“„ Technical Report: https://arxiv.org/abs/2406.08464

πŸ€— HF Paper Page: https://huggingface.co/papers/2406.08464

😬 Codes: https://github.com/magpie-align/magpie

πŸ€— Magpie Demo: https://huggingface.co/spaces/davanstrien/magpie (Thanks a lot for the implementation from @davanstrien!)

🐦 Chat with Magpie: https://huggingface.co/spaces/flydust/Chat-with-Magpie

[News] πŸ”₯ Our open-aligned model, Llama-3-8B-Magpie-Align-v0.1 is out! It is πŸ† the best <30B Model in AI2 WildBench Leaderboard! Even better than the official Meta-Llama-3-8B-Instruct model!

Dataset Navigation 🧭

Meta Llama 3

Model Name Dataset Type Description
Llama 3 70B Instruct Magpie-Pro-1M SFT 1M Raw conversations built with Meta Llama 3 70B.
Llama 3 70B Instruct Magpie-Pro-300K-Filtered SFT Apply a filter and select 300K high quality conversations.
Llama 3 70B Instruct Magpie-Pro-MT-300K SFT Select 300K difficult questions and extend to multi-turn conversations.
Llama 3 8B Instruct Magpie-Air-3M SFT 3M Raw conversations built with Meta Llama 3 8B.
Llama 3 8B Instruct Magpie-Air-300K-Filtered SFT Apply a filter and select 300K high quality data.
Llama 3 8B Instruct Magpie-Air-MT-300K SFT Select 300K difficult questions and extend to multi-turn conversations.

Qwen2

Model Name Dataset Type Description
Qwen2 72B Instruct Magpie-Qwen2-Pro-1M SFT 1M Raw conversations built with Qwen2 72B Instruct.
Qwen2 72B Instruct Magpie-Qwen2-Pro-300K-Filtered SFT Apply a filter and select 300K high quality conversations.
Qwen2 72B Instruct Magpie-Qwen2-Pro-200K-Chinese SFT Apply a filter and select 200K high quality Chinese conversations.
Qwen2 72B Instruct Magpie-Qwen2-Pro-200K-English SFT Apply a filter and select 200K high quality English conversations.
Qwen2 7B Instruct Magpie-Qwen2-Air-3M SFT 3M Raw conversations built with Qwen2 7B Instruct.
Qwen2 7B Instruct Magpie-Qwen2-Air-300K-Filtered SFT Apply a filter and select 300K high quality conversations.

Phi-3

Model Name Dataset Type Description
Phi-3 Medium Instruct Magpie-Phi3-Pro-1M SFT 1M Raw conversations built with Phi-3 Medium Instruct.
Phi-3 Medium Instruct Magpie-Phi3-Pro-300K-Filtered SFT Apply a filter and select 300K high quality conversations.