pretrain data selectection Data Selection via Optimal Control for Language Models Paper • 2410.07064 • Published 13 days ago • 8
Data Selection via Optimal Control for Language Models Paper • 2410.07064 • Published 13 days ago • 8
llm math LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning Paper • 2410.02884 • Published 19 days ago • 11
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning Paper • 2410.02884 • Published 19 days ago • 11