Cool Papers - a Subuday Collection

Subuday 's Collections

Cool Papers

updated Feb 28

BlockFusion: Expandable 3D Scene Generation using Latent Tri-plane Extrapolation

Paper • 2401.17053 • Published Jan 30 • 30
Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning Tasks

Paper • 2402.04248 • Published Feb 6 • 29
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5 • 67
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue

Paper • 2402.05930 • Published Feb 8 • 39
Animated Stickers: Bringing Stickers to Life with Video Diffusion

Paper • 2402.06088 • Published Feb 8 • 9
HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting

Paper • 2402.06149 • Published Feb 9 • 17
BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data

Paper • 2402.08093 • Published Feb 12 • 54
Neural Network Diffusion

Paper • 2402.13144 • Published Feb 20 • 94
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22 • 108
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 592