Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
yuta0x89
/
llmjp13b-numinacot-epoch2-GRPO
like
0
Text Generation
Transformers
Safetensors
llama
Generated from Trainer
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
main
llmjp13b-numinacot-epoch2-GRPO
/
model-00003-of-00006.safetensors
Commit History
Training in progress, step 646
0db5901
verified
yuta0x89
commited on
Feb 11
Training in progress, step 640
0490261
verified
yuta0x89
commited on
Feb 11
Training in progress, step 630
efff898
verified
yuta0x89
commited on
Feb 11
Training in progress, step 620
e551e43
verified
yuta0x89
commited on
Feb 11
Training in progress, step 610
6e9d1d5
verified
yuta0x89
commited on
Feb 11
Training in progress, step 600
ab94201
verified
yuta0x89
commited on
Feb 11
Training in progress, step 590
85edb88
verified
yuta0x89
commited on
Feb 11
Training in progress, step 580
937a32a
verified
yuta0x89
commited on
Feb 10
Training in progress, step 570
45974ee
verified
yuta0x89
commited on
Feb 10
Training in progress, step 560
fc4e9e1
verified
yuta0x89
commited on
Feb 10
Training in progress, step 550
f7c2b2d
verified
yuta0x89
commited on
Feb 10
Training in progress, step 540
eb18fde
verified
yuta0x89
commited on
Feb 10
Training in progress, step 530
3e115e5
verified
yuta0x89
commited on
Feb 10
Training in progress, step 520
f167dc4
verified
yuta0x89
commited on
Feb 10
Training in progress, step 510
87f1d88
verified
yuta0x89
commited on
Feb 10
Training in progress, step 500
7f5b5ba
verified
yuta0x89
commited on
Feb 10
Training in progress, step 490
171b70c
verified
yuta0x89
commited on
Feb 10
Training in progress, step 480
a11e6f4
verified
yuta0x89
commited on
Feb 9
Training in progress, step 470
30a7b87
verified
yuta0x89
commited on
Feb 9
Training in progress, step 460
1993a1e
verified
yuta0x89
commited on
Feb 9
Training in progress, step 450
2ee618c
verified
yuta0x89
commited on
Feb 9
Training in progress, step 440
cce3fac
verified
yuta0x89
commited on
Feb 9
Training in progress, step 430
48e05b3
verified
yuta0x89
commited on
Feb 9
Training in progress, step 420
910cf3a
verified
yuta0x89
commited on
Feb 9
Training in progress, step 410
01b6921
verified
yuta0x89
commited on
Feb 9
Training in progress, step 400
72d89f2
verified
yuta0x89
commited on
Feb 9
Training in progress, step 390
d1419f1
verified
yuta0x89
commited on
Feb 9
Training in progress, step 380
abee19c
verified
yuta0x89
commited on
Feb 8
Training in progress, step 370
7630c6c
verified
yuta0x89
commited on
Feb 8
Training in progress, step 360
af55d91
verified
yuta0x89
commited on
Feb 8
Training in progress, step 350
df40391
verified
yuta0x89
commited on
Feb 8
Training in progress, step 340
f715a8b
verified
yuta0x89
commited on
Feb 8
Training in progress, step 330
8425273
verified
yuta0x89
commited on
Feb 8
Training in progress, step 320
8360094
verified
yuta0x89
commited on
Feb 8
Training in progress, step 310
43f6284
verified
yuta0x89
commited on
Feb 8
Training in progress, step 300
d8e16e1
verified
yuta0x89
commited on
Feb 8
Training in progress, step 290
fa01427
verified
yuta0x89
commited on
Feb 8
Training in progress, step 280
a27b436
verified
yuta0x89
commited on
Feb 8
Training in progress, step 270
1b816a2
verified
yuta0x89
commited on
Feb 7
Training in progress, step 260
fa9d4f6
verified
yuta0x89
commited on
Feb 7
Training in progress, step 250
8c926d6
verified
yuta0x89
commited on
Feb 7
Training in progress, step 240
ca65443
verified
yuta0x89
commited on
Feb 7
Training in progress, step 230
2e10fa6
verified
yuta0x89
commited on
Feb 7
Training in progress, step 220
41dd4ee
verified
yuta0x89
commited on
Feb 7
Training in progress, step 210
0b2eedb
verified
yuta0x89
commited on
Feb 7
Training in progress, step 200
9356568
verified
yuta0x89
commited on
Feb 7
Training in progress, step 190
0da3ca4
verified
yuta0x89
commited on
Feb 7
Training in progress, step 180
3c7c2ec
verified
yuta0x89
commited on
Feb 7
Training in progress, step 170
decf754
verified
yuta0x89
commited on
Feb 6
Training in progress, step 160
c7f0787
verified
yuta0x89
commited on
Feb 6
Previous
1
2
Next