Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
amang1802
/
Llama3.2-1B-summary-length-exp7
like
0
Text Generation
Transformers
Safetensors
llama
conversational
text-generation-inference
Inference Endpoints
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Llama3.2-1B-summary-length-exp7
/
README.md
amang1802
Update README.md
b16491b
verified
about 2 months ago
preview
code
|
raw
Copy download link
history
blame
contribute
delete
279 Bytes
metadata
library_name:
transformers
tags:
[]
Model Card for Model ID
Summary Length PPO experiment #7
No KL divergence in loss
Model Details
Dataset size: 16384
Epochs: 1
Batch Size: 16 * 4 (w/ 4 GPUs)
Optimizer args: Torch AdamW default, except
LR = 0.00001