Delta-Vector
/

Hamanasu-4B-PT

Model card Files Files and versions Community

Model Visualization

Hamanasu 4B

🌌 Overview

This model is a finetune of Llama-3.1-Minitron-4B-Width-Base-chatml on 1B tokens of Stories & Books

This model is not usable for Chat.

All thanks to Tav for funding the train.

⚔️ Hardware

8x H100s
Epochs: 1
Base: IntervitensInc/Llama-3.1-Minitron-4B-Width-Base-chatml

Axolotl Config ꒰(˶• ᴗ •˶)꒱

https://wandb.ai/new-eden/tavbussy/artifacts/axolotl-config/config-jpgzpr2g/v0/files/axolotl_config_3y5zkvbz.yml

⚡ Credits

Made by

Delta-Vector

Downloads last month: 29

Safetensors

Model size

4.51B params

Tensor type

BF16

·

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Delta-Vector/Hamanasu-4B-PT

Base model

IntervitensInc/Llama-3.1-Minitron-4B-Width-Base-chatml

Finetuned

(13)

this model

Finetunes

1 model

Quantizations

Datasets used to train Delta-Vector/Hamanasu-4B-PT

Collection including Delta-Vector/Hamanasu-4B-PT

Hamanasu

A brand new series of Models from yours truly, Designed for Intelligence, Creativity and Roleplay - R/Locallama keeps DELETING MY GODDAMN COMMENTS • 31 items • Updated about 18 hours ago • 8