Submitted by akhaliq 52 Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models · 4 authors 6
Submitted by akhaliq 43 ConvLLaVA: Hierarchical Backbones as Visual Encoder for Large Multimodal Models · 9 authors 7
Submitted by akhaliq 33 Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization · 4 authors 1
Submitted by akhaliq 23 Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training · 8 authors 1
Submitted by akhaliq 21 AutoCoder: Enhancing Code Large Language Model with \textsc{AIEV-Instruct} · 3 authors 9
Submitted by akhaliq 14 CraftsMan: High-fidelity Mesh Generation with 3D Native Generation and Interactive Geometry Refiner · 7 authors 2
Submitted by akhaliq 12 Automatic Data Curation for Self-Supervised Learning: A Clustering-Based Approach · 13 authors
Submitted by akhaliq 11 Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition · 6 authors
Submitted by akhaliq 11 Data Mixing Made Efficient: A Bivariate Scaling Law for Language Model Pretraining · 5 authors
Submitted by akhaliq 5 HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed via Gaussian Splatting · 7 authors