LLamaStory-70M is a LLama Model Pre-trained on a story-generation dataset

About Training:

this model will be used to Debug 4 and 8 bit training and inference in JAX and Rust with EasyDel

Safetensors

Model size

70.5M params

Tensor type

FP16

erfanzar
/

LLamaStory-70M

Dataset used to train erfanzar/LLamaStory-70M