Vintage-LLM

This is a 340M params Llama3-based model, trained on text pre-1900.

This is a base model. It has very limited chat capabilities.

Datasets used:

  • American Stories
  • British Library
  • HMD
  • LOC-PD
  • Lampeter
  • NewsWire

The data was de-duplicated and only the decent quality texts were used.

Downloads last month
67
Safetensors
Model size
0.3B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support