cosimoiaia
/

Loquace-70m

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

cosimoiaia commited on Jun 10, 2023

Commit

e094381

•

1 Parent(s): ff93b42

Update README.md

Files changed (1) hide show

README.md +4 -1

README.md CHANGED Viewed

@@ -20,6 +20,9 @@ Model Card for Loquace-70m
 An exclusively Italian speaking, instruction finetuned, Large Language model. 🇮🇹
 ## Model Description
 Loquace-70m is the smallest model of the Loquace family. It was trained using QLoRa on a large dataset of 102k question/answer pairs
@@ -57,7 +60,7 @@ model = LLaMAForCausalLM.from_pretrained(
 Loquace-70m was trained on a conversational dataset comprising 102k question/answer pairs in Italian language.
 The training data was constructed by putting together translations from the original alpaca Dataset and other sources like the OpenAssistant dataset.
-The model was trained for only 3000 iterations and took 18 hours on a single RTX 3090, kindly provided by Genesis Cloud.
 ## Limitations

 An exclusively Italian speaking, instruction finetuned, Large Language model. 🇮🇹
+The Loquace Italian LLM models family was created as a proof-of-concept to evaluate on how different model sizes can be fine-tuned using QLoRa on an instruct dataset
+of a specific language.
 ## Model Description
 Loquace-70m is the smallest model of the Loquace family. It was trained using QLoRa on a large dataset of 102k question/answer pairs
 Loquace-70m was trained on a conversational dataset comprising 102k question/answer pairs in Italian language.
 The training data was constructed by putting together translations from the original alpaca Dataset and other sources like the OpenAssistant dataset.
+The model was trained for only 10000 iterations and took 6 hours on a single RTX 3090, kindly provided by Genesis Cloud. (https://gnsiscld.co/26qhlf)
 ## Limitations