Edit model card

Megatron-GPT2-Classification

Description

The megatron-gpt2-classification model is a language model trained using Megatron and Accelerate frameworks. It has been fine-tuned for classification tasks and benefits from distributed training across 4 GPUs (RTX 4070).

Key Features

  • Trained with Megatron and Accelerate.
  • Distributed training on 4 GPUs (RTX 4070).
  • Fine-tuned for classification tasks.
Downloads last month
7
Safetensors
Model size
124M params
Tensor type
F32
·