e-palmisano
/

Qwen2-0.5B-ITA-Instruct

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Qwen2-0.5B-ITA-Instruct / README.md

e-palmisano's picture

Update README.md

2a4565f verified 4 months ago

|

history blame contribute delete

1.39 kB

	---
	license: apache-2.0
	datasets:
	- gsarti/clean_mc4_it
	- FreedomIntelligence/alpaca-gpt4-italian
	language:
	- it
	- en
	---


	This model has been fine-tuned with the continuous pretraining mode of Unsloth on the gsarti/clean_mc4_it dataset (only 100k rows) to improve the Italian language. The second fine-tuning was performed on the instructed dataset FreedomIntelligence/alpaca-gpt4-italian.



	# Uploaded model

	- Developed by: e-palmisano
	- License: apache-2.0
	- Finetuned from model : unsloth/Qwen2-0.5B-Instruct-bnb-4bit

	## Evaluation

	For a detailed comparison of model performance, check out the [Leaderboard for Italian Language Models](https://huggingface.co/spaces/FinancialSupport/open_ita_llm_leaderboard).

	Here's a breakdown of the performance metrics:

	\| Metric \| hellaswag_it acc_norm \| arc_it acc_norm \| m_mmlu_it 5-shot acc \| Average \|
	\|:----------------------------\|:----------------------\|:----------------\|:---------------------\|:--------\|
	\| Accuracy Normalized \| 36.28 \| 27.63 \| 35.4 \| 33.1 \|



	This qwen2 model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.

	[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)