Hello!I'm reading the source code with the paper and I do not see the part that implements the C-RLFT into the loss optimization. Sorry if it's obvious, I'm a beginner in reinforcement learning!
· Sign up or log in to comment