Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
amang1802
/
Llama3.2-1B-summary-length-exp4
like
0
Text Generation
Transformers
Safetensors
llama
conversational
text-generation-inference
Inference Endpoints
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
Model Card for Model ID
Model Details
Outcomes
Model Card for Model ID
Summary Length PPO experiment #2
No KL divergence in loss
Model Details
Dataset size: 1024
Epochs: 2
Batch Size: 4 * 8 (w / Gradient Accumulation)
Optimizer args: Torch AdamW default, except
LR = 0.0001
Outcomes
Only outputs one word "relationship"
Downloads last month
13
Safetensors
Model size
1.24B params
Tensor type
BF16
·
Inference Examples
Text Generation
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to
Inference Endpoints (dedicated)
instead.