EPFL-CS-552-Group-Model
Collection
Models presented for the Group Model (benchmarks: General Knowledge, Multilinguality, Maths, Safety) • 4 items • Updated
GRPO on math model using Multilinguality and Knowledge GRPO datasets. Checkpoint 1000 was remerged back with math model.