luel
/

gpt2-tigrinya-small

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

GPT-2 for Tigrinya Language

This repository contains a GPT-2 model trained from scratch on Tigrinya text data. The model was trained using the Hugging Face Transformers library.

Model Details

Model Type: GPT-2
Language: Tigrinya
Vocabulary Size: 16000
Maximum Length: 128

Training Details

Number of Epochs: 12
Batch Size: 1 (with gradient accumulation steps of 4)
Learning Rate: 5e-4

Dataset Statistics

Total number of words: 16061839
Total number of unique words: 458901

Usage

from transformers import pipeline

# Load the model
generator = pipeline('text-generation', model='luel/gpt2-tigrinya-small')

# Generate text
text = generator("ትግራይ", max_length=60)
print(text)

Downloads last month: 49

Safetensors

Model size

16.8M params

Tensor type

F32

·

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported Inference Providers.

Collection including luel/gpt2-tigrinya-small

GPT-2 Tigrinya

A collection of GPT-2 based language models trained explicitly for the Tigrinya language (ትግርኛ). • 4 items • Updated Nov 2, 2024