BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks 21 days ago • 31
Agentless: Demystifying LLM-based Software Engineering Agents Paper • 2407.01489 • Published 7 days ago • 36
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale Paper • 2406.17557 • Published 14 days ago • 73
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions Paper • 2406.15877 • Published 17 days ago • 42
view article Article Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models 15 days ago • 130
Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations Paper • 2405.18392 • Published May 28 • 12
Leaderboards and benchmarks ✨ Collection Cool leaderboard spaces collection for models across modalities! Text, vision, audio, ... • 64 items • Updated 28 days ago • 69
view article Article StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation Apr 29 • 70
Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent Paper • 2402.09844 • Published Feb 15 • 20
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone Paper • 2404.14219 • Published Apr 22 • 241
DoGE: Domain Reweighting with Generalization Estimation Paper • 2310.15393 • Published Oct 23, 2023 • 1
Comparing DPO with IPO and KTO Collection A collection of chat models to explore the differences between three alignment techniques: DPO, IPO, and KTO. • 56 items • Updated Jan 9 • 31
Possible Meissner effect near room temperature in copper-substituted lead apatite Paper • 2401.00999 • Published Jan 2 • 5
Improving Text Embeddings with Large Language Models Paper • 2401.00368 • Published Dec 31, 2023 • 77
Mamba: Linear-Time Sequence Modeling with Selective State Spaces Paper • 2312.00752 • Published Dec 1, 2023 • 132
⭐ StarCoder Collection All models, datasets, and demos related to StarCoder! • 11 items • Updated Feb 27 • 20
Zephyr 7B Collection Models, datasets, and demos associated with Zephyr 7B. For code to train the models, see: https://github.com/huggingface/alignment-handbook • 9 items • Updated Apr 12 • 142
Llama 2: Open Foundation and Fine-Tuned Chat Models Paper • 2307.09288 • Published Jul 18, 2023 • 237
RepoFusion: Training Code Models to Understand Your Repository Paper • 2306.10998 • Published Jun 19, 2023 • 14