Model Visualization

Hamanasu 4B

🌌 Overview

This model is a finetune of Llama-3.1-Minitron-4B-Width-Base-chatml on 1B tokens of Stories & Books

This model is not usable for Chat.

All thanks to Tav for funding the train.

⚔️ Hardware

  • 8x H100s
  • Epochs: 1
  • Base: IntervitensInc/Llama-3.1-Minitron-4B-Width-Base-chatml

Axolotl Config ꒰(˶• ᴗ •˶)꒱

https://wandb.ai/new-eden/tavbussy/artifacts/axolotl-config/config-jpgzpr2g/v0/files/axolotl_config_3y5zkvbz.yml

⚡ Credits


Made by
Delta-Vector
Downloads last month
29
Safetensors
Model size
4.51B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Delta-Vector/Hamanasu-4B-PT

Finetuned
(13)
this model
Finetunes
1 model
Quantizations
2 models

Datasets used to train Delta-Vector/Hamanasu-4B-PT

Collection including Delta-Vector/Hamanasu-4B-PT