Submitted by Sylvestre 39 Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion · 6 authors 1
Submitted by philschmid 33 Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language Models · 6 authors 1
Submitted by zuom 14 Planetarium: A Rigorous Benchmark for Translating Text to Structured Planning Languages · 5 authors 1