Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
amang1802
/
Llama3.2-1B-summary-length-exp6
like
0
Text Generation
Transformers
Safetensors
llama
conversational
text-generation-inference
Inference Endpoints
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Llama3.2-1B-summary-length-exp6
/
README.md
amang1802
Update README.md
7ee374f
verified
about 2 months ago
preview
code
|
raw
Copy download link
history
blame
contribute
delete
Safe
308 Bytes
metadata
library_name:
transformers
tags:
[]
Model Card for Model ID
Summary Length PPO experiment #5
No KL divergence in loss
Model Details
Dataset size: 1024
Epochs: 1
Batch Size: 4 * 4 (w/ 4 GPUs) * 8 (w/ Gradient Accumulation)
Optimizer args: Torch AdamW default, except
LR = 0.00001