-
CodeFusion: A Pre-trained Diffusion Model for Code Generation
Paper • 2310.17680 • Published • 68 -
What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning
Paper • 2312.15685 • Published • 16 -
LLaMA Beyond English: An Empirical Study on Language Capability Transfer
Paper • 2401.01055 • Published • 50 -
Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling Laws
Paper • 2401.00448 • Published • 25
Sergei Averkiev
averoo
AI & ML interests
None yet
Organizations
Collections
1
models
None public yet
datasets
9
averoo/baby_mmlu2_pt
Viewer
•
Updated
•
7
averoo/baby_mmlu2_kk
Viewer
•
Updated
•
7
averoo/baby_mmlu2_be
Viewer
•
Updated
•
1
averoo/baby_mmlu2
Viewer
•
Updated
•
747
•
1
averoo/fake_mmlu
Viewer
•
Updated
•
4
averoo/fake_mmlu_en
Viewer
•
Updated
•
6
averoo/fake_mmlu_ru
Viewer
•
Updated
averoo/baby_mmlu
Viewer
•
Updated
•
2
averoo/lurk
Viewer
•
Updated
•
4
•
3