Automated MNLP evaluation report (2026-05-18)

#1

Posted by the course CI pipeline.

zechen-nlp changed pull request title from Automated MNLP evaluation report (2026-05-13) to Automated MNLP evaluation report (2026-05-16)
zechen-nlp changed pull request title from Automated MNLP evaluation report (2026-05-16) to Automated MNLP evaluation report (2026-05-17)
zechen-nlp changed pull request title from Automated MNLP evaluation report (2026-05-17) to Automated MNLP evaluation report (2026-05-18)
cs-552-2026-4neurons org

Accuracy: 16 base model, 31 first SFT

leonardamsler changed pull request status to merged

Sign up or log in to comment