MoxoffSrL
/

Azzurro

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

JacopoAbate commited on Apr 3, 2024

Commit

28d1054

·

verified ·

1 Parent(s): 8fb4b85

Update README.md

Files changed (1) hide show

README.md +4 -6

README.md CHANGED Viewed

@@ -13,18 +13,16 @@ tags:
 # Model Information
-xxxx is a SFT and LoRA finetuned version of Mistral-7B-v0.2
-It has been trained on a mixture of opensource datasets, like SQUAD-it (https://huggingface.co/datasets/squad_it), and some internally made datasets.
-It is not just a Q&A, it is a Q&A + Context model, with the goal being it being used for RAGs and application in need of a context.
 # Evaluation
 We evaluated the model using the same test sets as used for the Open Ita LLM Leaderboard
 | hellaswag_it acc_norm | arc_it acc_norm | m_mmlu_it 5-shot acc | Average |
-|:----------------------:| :---------------: | :--------------------: | :-------: |
 | 0.6067 | 0.4405 | 0.5112 | 0,52 |

 # Model Information
+XXXX is an updated version of Mistral-7B-v0.2, specifically fine-tuned with SFT and LoRA adjustments.
+- It's trained both on publicly available datasets like SQUAD-it and datasets we've created in-house.
+- it's designed to understand and maintain context, making it ideal for Retrieval Augmented Generation (RAG) tasks and applications requiring contextual awareness.
 # Evaluation
 We evaluated the model using the same test sets as used for the Open Ita LLM Leaderboard
 | hellaswag_it acc_norm | arc_it acc_norm | m_mmlu_it 5-shot acc | Average |
+|:----------------------| :--------------- | :-------------------- | :------- |
 | 0.6067 | 0.4405 | 0.5112 | 0,52 |