Unbabel
/

TowerInstruct-7B-v0.1

text-generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

nunonmg commited on Jan 5, 2024

Commit

923de31

·

1 Parent(s): b47c7ad

Update README.md

Files changed (1) hide show

README.md +22 -17

README.md CHANGED Viewed

@@ -25,27 +25,32 @@ This modelcard aims to be a base template for new models. It has been generated
 ### Model Description
-<!-- Provide a longer summary of what this model is. -->
 - **Developed by:** Unbabel, Instituto Superior Técnico, CentraleSupélec University of Paris-Saclay
-- **Model type:** [More Information Needed]
-- **Language(s) (NLP):** [More Information Needed]
 - **License:** CC-BY-NC-4.0
-- **Finetuned from model [optional]:** LLaMA2
-### Model Sources [optional]
-<!-- Provide the basic links for the model. -->
-- **Repository:** TBA
-- **Paper [optional]:** TBA
-- **Demo [optional]:** TBA
-## Uses
-<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
 ### Direct Use

 ### Model Description
+TowerInstruct is a language model that results from fine-tuning TowerBase on the TowerBricks supervised fine-tuning dataset. TowerInstruct v0.1 is the first model in the series.
+The model is trained to handle several translation-related tasks, such as general machine translation (e.g., sentence- and document-level translation, terminology-aware translation, context-aware translation), automatic post edition, named-entity recognition, gramatical error correction, and paraphrase generation.
+We will release more details in the upcoming technical report.
 - **Developed by:** Unbabel, Instituto Superior Técnico, CentraleSupélec University of Paris-Saclay
+- **Model type:** A 7B parameter model fine-tuned on a mix of publicly available, synthetic datasets on translation-related tasks, as well as conversational datasets and code instructions.
+- **Language(s) (NLP):** English, Portuguese, Spanish, French, German, Dutch, Italian, Korean, Chinese, Russian
 - **License:** CC-BY-NC-4.0
+- **Finetuned from model [optional]:** TowerBase
+## Intended uses & limitations
+The model was initially fine-tuned on a filtered and preprocessed supervised fine-tuning dataset (TowerBricks), which contains a diverse range of data sources:
+- Translation
+- Automatic Post Edition
+- Machine Translation Evaluation
+- Context-aware Translation
+- Terminology-aware Translation
+- Multi-reference Translation
+- Named-entity Recognition
+- Paraphrase Generation
+- Synthetic Chat data
+- Code instructions
+You can find the dataset and all data sources of TowerBricks here.
 ### Direct Use