-
CodeFusion: A Pre-trained Diffusion Model for Code Generation
Paper • 2310.17680 • Published • 69 -
What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning
Paper • 2312.15685 • Published • 17 -
LLaMA Beyond English: An Empirical Study on Language Capability Transfer
Paper • 2401.01055 • Published • 53 -
Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling Laws
Paper • 2401.00448 • Published • 27
Sergei Averkiev
averoo
AI & ML interests
None yet
Organizations
Collections
1
models
None public yet
datasets
25
averoo/test_set_ru_v1
Viewer
•
Updated
•
2.48k
•
2
averoo/baby_mmlu2_good
Viewer
•
Updated
•
528
•
2
averoo/rucc_wiki_v1_mmlu_filtered_ru2_good
Viewer
•
Updated
•
1.95k
•
2
averoo/rucc_wiki_v1_mmlu_filtered_en2
Viewer
•
Updated
•
6.17k
•
2
averoo/rucc_wiki_v1_mmlu_filtered_ru2
Viewer
•
Updated
•
6.17k
•
2
averoo/rucc_wiki_v1_mmlu_filtered_en
Viewer
•
Updated
•
1.88k
•
2
averoo/rucc_wiki_v1_mmlu_filtered_ru
Viewer
•
Updated
•
1.88k
•
2
averoo/rucc_wiki_v1_mmlu
Viewer
•
Updated
•
9.95k
•
2
averoo/sumi_test2
Viewer
•
Updated
•
1.93k
•
2
averoo/sumi_test
Viewer
•
Updated
•
20
•
2