Roaoch
/

CyberClassic-Generator

Text Generation

text-generation-inference

Model card Files Files and versions Community

This text generator is based on OpenAI GPT2 model from HuggingFace Base model went through two step of learning

First - Finetining of base model

On this step model is finetuned on dataset of single sentence from the texts of Dostovesky F.M.

Training parameters:

Epoch = 10
Learning Rate = 1e-3
Optimizer = AdamW
Scheduler = OneCycleLR
Training env = PyTorch

Second - RL

On this step finetuned model went trough reinforcement learning pipline with TRL library.

Training parameters:

Epoch = 30
Trainer = PPO
Query texts = first 100 texts from dataset, trimmed by first 3 words
Reward = score from binary classifier multiplied by 10

Downloads last month: 0

Safetensors

Model size

124M params

Tensor type

F32

·

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Space using Roaoch/CyberClassic-Generator 1

Collection including Roaoch/CyberClassic-Generator

CyberClassic

This is a models for telegram bot - https://t.me/cyber_classic_bot • 4 items • Updated Jul 1, 2024 • 1