BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks 18 days ago • 31
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions Paper • 2406.15877 • Published 13 days ago • 42 • 8