migueldeguzmandev/falcon-1b-rw-RLLMv3-1
Text Generation
•
Updated
•
8
This is a collection designed to present the 10 RLLM steps/ training runs intended to improve Falcon-RW-1B's outputs towards coherence and politeness.