Edit model card
YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

[4., 0.5. 2.] + q / k values trainable + 48 effective batch size Make your batch size as large as possible in order to get better results. Avoiding all_modules trainable keeps some level of intelligence.

Downloads last month
0
Safetensors
Model size
8.03B params
Tensor type
BF16
·
Model is too large to load in Inference API (serverless). To try the model, launch it on Inference Endpoints (dedicated) instead.