arxiv:2403.08763
Benjamin Therien
btherien
AI & ML interests
Passionate about machine learning research! Currently working on efficient foundation model pre-training.
Organizations
Papers
1
models
20
btherien/Model_-7-1B_It_-132366_Tr_-slim-pajama-300B-replay5_finetune-sft-full
Text Generation
•
Updated
•
4
btherien/410M_it-132366_Tr-slim-pajama-300B-replay5_finetune_dpo-full
Text Generation
•
Updated
•
5
btherien/410M_it-132366_Tr-slim-pajama-300B-replay5_finetune_sft-full
Text Generation
•
Updated
•
4
btherien/410M_it-86245_tr-german-only_scratch-sft-full
Text Generation
•
Updated
•
4
btherien/zephyr-7b-sft-full
Text Generation
•
Updated
•
4
btherien/JOB-3289974_410M_it-132366_InvSqrt_tr-slim-pajama-300B_scratch
Text Generation
•
Updated
•
5
btherien/JOB-3291751_410M_it-132366_ConsineInf_tr-slim-pajama-300B_scratch
Text Generation
•
Updated
•
4
btherien/JOB-3292571_410M_it-44229_ConsineInf_tr-slim-pajama-100B-3_scratch
Text Generation
•
Updated
•
1
btherien/JOB-3312386_410M_it-86245_tr-german-only_scratch
Text Generation
•
Updated
btherien/JOB-3312838_410M_it-86245_tr-german-replay-25_scratch
Text Generation
•
Updated
datasets
None public yet