Update README.md
Browse files
README.md
CHANGED
@@ -4,29 +4,28 @@ tags:
|
|
4 |
- unsloth
|
5 |
- trl
|
6 |
- sft
|
|
|
|
|
7 |
---
|
8 |
|
9 |
# Model Card for Model ID
|
10 |
|
11 |
<!-- Provide a quick summary of what the model is/does. -->
|
12 |
|
|
|
13 |
|
14 |
|
15 |
## Model Details
|
16 |
|
17 |
### Model Description
|
18 |
|
19 |
-
|
20 |
|
21 |
-
|
22 |
-
|
23 |
-
- **
|
24 |
-
- **
|
25 |
-
- **
|
26 |
-
- **Model type:** [More Information Needed]
|
27 |
-
- **Language(s) (NLP):** [More Information Needed]
|
28 |
-
- **License:** [More Information Needed]
|
29 |
-
- **Finetuned from model [optional]:** [More Information Needed]
|
30 |
|
31 |
### Model Sources [optional]
|
32 |
|
@@ -40,11 +39,15 @@ This is the model card of a 🤗 transformers model that has been pushed on the
|
|
40 |
|
41 |
<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
|
42 |
|
|
|
|
|
|
|
|
|
43 |
### Direct Use
|
44 |
|
45 |
<!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
|
46 |
|
47 |
-
|
48 |
|
49 |
### Downstream Use [optional]
|
50 |
|
@@ -62,19 +65,28 @@ This is the model card of a 🤗 transformers model that has been pushed on the
|
|
62 |
|
63 |
<!-- This section is meant to convey both technical and sociotechnical limitations. -->
|
64 |
|
65 |
-
[More Information Needed]
|
66 |
|
67 |
### Recommendations
|
68 |
|
69 |
<!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
|
70 |
|
71 |
-
Users
|
72 |
|
73 |
## How to Get Started with the Model
|
74 |
|
75 |
Use the code below to get started with the model.
|
76 |
|
77 |
-
[
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
78 |
|
79 |
## Training Details
|
80 |
|
@@ -82,7 +94,7 @@ Use the code below to get started with the model.
|
|
82 |
|
83 |
<!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
|
84 |
|
85 |
-
|
86 |
|
87 |
### Training Procedure
|
88 |
|
@@ -169,7 +181,7 @@ Carbon emissions can be estimated using the [Machine Learning Impact calculator]
|
|
169 |
|
170 |
#### Software
|
171 |
|
172 |
-
|
173 |
|
174 |
## Citation [optional]
|
175 |
|
@@ -199,4 +211,4 @@ Carbon emissions can be estimated using the [Machine Learning Impact calculator]
|
|
199 |
|
200 |
## Model Card Contact
|
201 |
|
202 |
-
|
|
|
4 |
- unsloth
|
5 |
- trl
|
6 |
- sft
|
7 |
+
language:
|
8 |
+
- it
|
9 |
---
|
10 |
|
11 |
# Model Card for Model ID
|
12 |
|
13 |
<!-- Provide a quick summary of what the model is/does. -->
|
14 |
|
15 |
+
ItalIA is a LLM trained for the Italian language and based on Llama3-8b.
|
16 |
|
17 |
|
18 |
## Model Details
|
19 |
|
20 |
### Model Description
|
21 |
|
22 |
+
ItalIA is a state-of-the-art language model specifically trained for the Italian language, leveraging the latest advancements in the LLM frameworks llama3. This model aims to provide highly accurate and context-aware natural language understanding and generation, making it ideal for a wide range of applications from automated customer support to content creation.
|
23 |
|
24 |
+
- **Developed by:** [Davide Pizzo]
|
25 |
+
- **Model type:** [Transformer-based Large Language Model]
|
26 |
+
- **Language(s) (NLP):** [Italian]
|
27 |
+
- **License:** [Other]
|
28 |
+
- **Finetuned from model [optional]:** [llama3-8b]
|
|
|
|
|
|
|
|
|
29 |
|
30 |
### Model Sources [optional]
|
31 |
|
|
|
39 |
|
40 |
<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
|
41 |
|
42 |
+
|
43 |
+
ItalIA can be directly integrated into applications requiring natural language processing in Italian, including but not limited to text summarization, question answering, and conversational agents.
|
44 |
+
|
45 |
+
|
46 |
### Direct Use
|
47 |
|
48 |
<!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
|
49 |
|
50 |
+
This model serves as a powerful italian base for fine-tuning on specific tasks such as legal document analysis, medical record interpretation, and more specialized forms of conversational AI tailored to specific industries.
|
51 |
|
52 |
### Downstream Use [optional]
|
53 |
|
|
|
65 |
|
66 |
<!-- This section is meant to convey both technical and sociotechnical limitations. -->
|
67 |
|
|
|
68 |
|
69 |
### Recommendations
|
70 |
|
71 |
<!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
|
72 |
|
73 |
+
Users should be aware of the potential for biased outputs based on the training data, particularly in scenarios involving regional linguistic variations within Italy.
|
74 |
|
75 |
## How to Get Started with the Model
|
76 |
|
77 |
Use the code below to get started with the model.
|
78 |
|
79 |
+
[from transformers import AutoModelForCausalLM, AutoTokenizer
|
80 |
+
|
81 |
+
model_name = "your-model-name-on-huggingface"
|
82 |
+
tokenizer = AutoTokenizer.from_pretrained(model_name)
|
83 |
+
model = AutoModelForCausalLM.from_pretrained(model_name)
|
84 |
+
|
85 |
+
text = "Inserisci qui il tuo testo in italiano."
|
86 |
+
input_ids = tokenizer.encode(text, return_tensors="pt")
|
87 |
+
output = model.generate(input_ids)
|
88 |
+
|
89 |
+
print(tokenizer.decode(output[0], skip_special_tokens=True))]
|
90 |
|
91 |
## Training Details
|
92 |
|
|
|
94 |
|
95 |
<!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
|
96 |
|
97 |
+
The model was trained on a diverse corpus of Italian texts, including literature, news articles, and web content, ensuring a broad understanding of the language.
|
98 |
|
99 |
### Training Procedure
|
100 |
|
|
|
181 |
|
182 |
#### Software
|
183 |
|
184 |
+
unsloth
|
185 |
|
186 |
## Citation [optional]
|
187 |
|
|
|
211 |
|
212 |
## Model Card Contact
|
213 |
|
214 |
+
For any question, contact me [pizzodavide93@gmail.com]
|