Submitted by akhaliq 46 Teaching Large Language Models to Reason with Reinforcement Learning · 9 authors 2
Submitted by akhaliq 40 PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation · 10 authors 1
Submitted by akhaliq 38 Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference · 11 authors 1
Submitted by akhaliq 21 LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error · 5 authors 1
Submitted by akhaliq 16 Common 7B Language Models Already Possess Strong Math Capabilities · 8 authors 1
Submitted by akhaliq 3 Radiative Gaussian Splatting for Efficient X-ray Novel View Synthesis · 8 authors 1