Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
gilbertomarcano
/
deepfish-16b-0.0.1
like
0
PyTorch
llama
unsloth
trl
grpo
License:
mit
Model card
Files
Files and versions
xet
Community
1
main
deepfish-16b-0.0.1
/
README.md
gilbertomarcano
Trained with Unsloth
66353d8
verified
5 months ago
preview
code
|
raw
Copy download link
history
blame
contribute
delete
57 Bytes
metadata
license:
mit
tags:
-
unsloth
-
trl
-
grpo