In-Context Pretraining: Language Modeling Beyond Document Boundaries Paper • 2310.10638 • Published Oct 16, 2023 • 28
BitNet: Scaling 1-bit Transformers for Large Language Models Paper • 2310.11453 • Published Oct 17, 2023 • 96
From Word Models to World Models: Translating from Natural Language to the Probabilistic Language of Thought Paper • 2306.12672 • Published Jun 22, 2023 • 26