Instructions to use EhabSuliman/my_awesome_eli5_clm-model with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use EhabSuliman/my_awesome_eli5_clm-model with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="EhabSuliman/my_awesome_eli5_clm-model")

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("EhabSuliman/my_awesome_eli5_clm-model")
model = AutoModelForCausalLM.from_pretrained("EhabSuliman/my_awesome_eli5_clm-model")

Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use EhabSuliman/my_awesome_eli5_clm-model with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "EhabSuliman/my_awesome_eli5_clm-model"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "EhabSuliman/my_awesome_eli5_clm-model",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker

docker model run hf.co/EhabSuliman/my_awesome_eli5_clm-model

SGLang

How to use EhabSuliman/my_awesome_eli5_clm-model with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "EhabSuliman/my_awesome_eli5_clm-model" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "EhabSuliman/my_awesome_eli5_clm-model",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "EhabSuliman/my_awesome_eli5_clm-model" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "EhabSuliman/my_awesome_eli5_clm-model",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Docker Model Runner
How to use EhabSuliman/my_awesome_eli5_clm-model with Docker Model Runner:
```
docker model run hf.co/EhabSuliman/my_awesome_eli5_clm-model
```

my_awesome_eli5_clm-model

A Causal Language Model fine-tuned on Arabic text, based on distilbert/distilgpt2.
Fine-tuned by EhabSuliman as part of an LLM course project.

Model Description

Model type: Causal Language Model (CLM)
Base model: distilbert/distilgpt2
Language: English
Fine-tuned by: EhabSuliman
License: Apache 2.0

Intended Uses & Limitations

Intended uses:

Arabic text generation
Language modeling research
Educational/learning purposes

Limitations:

Trained on a relatively small dataset
Loss is still relatively high (3.8027), meaning the model may generate inaccurate or repetitive text
Not recommended for production use without further fine-tuning

How to Use

from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("EhabSuliman/my_awesome_eli5_clm-model")
model = AutoModelForCausalLM.from_pretrained("EhabSuliman/my_awesome_eli5_clm-model")

prompt = "Somatic hypermutation allows the immune system to"
inputs = tokenizer(prompt, return_tensors="pt").input_ids
outputs = model.generate(
    inputs,
    max_new_tokens=100,
    do_sample=True,
    top_k=50,
    top_p=0.95
)
print(tokenizer.batch_decode(outputs, skip_special_tokens=True))

Training and Evaluation Data

Fine-tuned on an Arabic text dataset using the ELI5 (Explain Like I'm 5) format.

Training Procedure

Training Hyperparameters

The following hyperparameters were used during training:

Learning rate: 2e-05
Train batch size: 8
Eval batch size: 8
Seed: 42
Optimizer: AdamW (fused) with betas=(0.9, 0.999), epsilon=1e-08
LR scheduler: Linear
Epochs: 3

Training Results

Training Loss	Epoch	Step	Validation Loss
3.8556	1.0	1327	3.8101
3.7851	2.0	2654	3.8035
3.7514	3.0	3981	3.8027

The model shows steady improvement across epochs with validation loss decreasing from 3.8101 → 3.8027.

Framework Versions

Transformers 5.9.0
PyTorch 2.11.0+cu128
Datasets 4.0.0
Tokenizers 0.22.2

Author

Ehab Suliman — Machine Learning Engineer
🔗 HuggingFace Profile

Downloads last month: 109

Safetensors

Model size

81.9M params

Tensor type

F32

Model tree for EhabSuliman/my_awesome_eli5_clm-model

Base model

distilbert/distilgpt2

Finetuned

(1489)

this model