NotASI
/

FineTome-Llama3.2-3B-1002

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

FineTome-Llama3.2-3B-1002 / README.md

NotASI's picture

Update README.md

e49f6f4 verified about 2 months ago

|

1.23 kB

	---
	base_model: unsloth/Llama-3.2-3B-Instruct-bnb-4bit
	language:
	- en
	license: llama3.2
	tags:
	- text-generation-inference
	- transformers
	- unsloth
	- llama
	- llama-3
	- trl
	- sft
	datasets:
	- mlabonne/FineTome-100k
	---

	# Notice

	Model was submitted to OpenLLM Leaderboard for full evaluation.

	# IMPORTANT

	In case you got the following error:
	```
	exception: data did not match any variant of untagged enum modelwrapper at line 1251003 column 3
	```
	Please upgrade your transformer package, that is, use the following code:
	```
	pip install --upgrade "transformers>=4.45"
	```

	# Uploaded model

	- Developed by: NotASI
	- License: apache-2.0
	- Finetuned from model : unsloth/Llama-3.2-3B-Instruct-bnb-4bit

	# Details

	This model was trained on mlabonne/FineTome-100k for 2 epochs with rslora + qlora, and achieve the final training loss: 0.596400.

	This model follows the same chat template as the base model one.

	This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.

	[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)