Haoze Wu's picture

1 4

Haoze Wu

WaitHZ

https://waithz.github.io/

AI & ML interests

Modular DL, Complex Reasoning

Organizations

None yet

WaitHZ's activity

upvoted 2 papers 11 days ago

Benchmarking Chinese Knowledge Rectification in Large Language Models

Paper • 2409.05806 • Published 12 days ago • 14

OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs

Paper • 2409.05152 • Published 13 days ago • 28

upvoted a paper 2 months ago

GW-MoE: Resolving Uncertainty in MoE Router with Global Workspace Theory

Paper • 2406.12375 • Published Jun 18 • 1

upvoted a paper 6 months ago

Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

Paper • 2404.02258 • Published Apr 2 • 103