Niels PRO

nielsp

AI & ML interests

LLM, NLP, Datasets, Fine tuning

Recent Activity

reacted to mkurman's post with 👍 about 2 months ago

I've been working on something cool: a GRPO with an LLM evaluator that can also perform SFT on the feedback data - if you want. Check it out 😊 Any 🌟are more than welcome 🤗 https://github.com/mkurman/grpo-llm-evaluator

View all activity

Organizations

None yet

nielsp's activity

reacted to mkurman's post with 👍 about 2 months ago

Post

2043

I've been working on something cool: a GRPO with an LLM evaluator that can also perform SFT on the feedback data - if you want. Check it out 😊

Any 🌟are more than welcome 🤗

https://github.com/mkurman/grpo-llm-evaluator