Merge branch 'main' of https://huggingface.co/p3nGu1nZz/Tau bdb5628 p3nGu1nZz commited on Sep 19, 2024
added latest ppo runs A1, A2, and A3. Each used 500, 2500, and 4800 training messages respectively. no other settings or config was modified. fba91ab p3nGu1nZz commited on Sep 19, 2024