cosimoiaia
commited on
Commit
•
79c3bb1
1
Parent(s):
df80270
Update README.md
Browse files
README.md
CHANGED
@@ -23,6 +23,10 @@ An exclusively Italian speaking, instruction finetuned, Large Language model.
|
|
23 |
The Loquace Italian LLM models family was created as a proof-of-concept to evaluate on how different model sizes can be fine-tuned using QLoRa on an instruct dataset
|
24 |
of a specific language.
|
25 |
|
|
|
|
|
|
|
|
|
26 |
## Model Description
|
27 |
|
28 |
Loquace-70m is the smallest model of the Loquace family. It was trained using QLoRa on a large dataset of 102k question/answer pairs
|
|
|
23 |
The Loquace Italian LLM models family was created as a proof-of-concept to evaluate on how different model sizes can be fine-tuned using QLoRa on an instruct dataset
|
24 |
of a specific language.
|
25 |
|
26 |
+
The QLoRa (https://github.com/artidoro/qlora) method of fine-tuning significantly lower the resources requirements compared to any other methods available,
|
27 |
+
this allow to easily execute the process on significanly larger dataset while still using consumers GPUs and still achieve high accuracy.
|
28 |
+
|
29 |
+
|
30 |
## Model Description
|
31 |
|
32 |
Loquace-70m is the smallest model of the Loquace family. It was trained using QLoRa on a large dataset of 102k question/answer pairs
|