Minitron Collection A family of compressed models obtained via pruning and knowledge distillation • 6 items • Updated 25 days ago • 53
view article Article Run the strongest open-source LLM model: Llama3 70B with just a single 4GB GPU! By lyogavin • Apr 21 • 41
Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences Paper • 2404.03715 • Published Apr 4 • 59