Sibyl: Simple yet Effective Agent Framework for Complex Real-world Reasoning Paper • 2407.10718 • Published 7 days ago • 11
OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement Paper • 2402.14658 • Published Feb 22 • 79
RoleEval: A Bilingual Role Evaluation Benchmark for Large Language Models Paper • 2312.16132 • Published Dec 26, 2023 • 2
ChatMusician: Understanding and Generating Music Intrinsically with LLM Paper • 2402.16153 • Published Feb 25 • 55