deepfish-16b-0.0.1 / README.md
gilbertomarcano's picture
Trained with Unsloth
66353d8 verified
metadata
license: mit
tags:
  - unsloth
  - trl
  - grpo