maddes8cht
/

OpenLLM-France-Claire-7B-0.1-gguf

Text Generation

GGUF

French

pretrained

conversational

Inference Endpoints

Model card Files Files and versions Community

maddes8cht commited on Nov 22, 2023

Commit

ee5b0fe

1 Parent(s): 96fca6c

"Update README.md"

Browse files

Files changed (1) hide show

README.md +12 -8

README.md CHANGED Viewed

@@ -198,18 +198,20 @@ prompt = """\
 ### Training Data
 Claire-7B-0.1 was tuned from Falcon-7b on the following data distribution:
 | **Data type**                 | **Words**  | **Training Sampling Weight** | **Sources**                                         |
 |-------------------------------|------------|------------------------------|-----------------------------------------------------|
-| Parliamentary Proceedings     | 135M       | 35%                          | assemblee-nationale.fr                              |
-| Theatre                       |  16M       | 18%                          | dracor.org/fre, theatregratuit.com            |
-| Interviews                    |   6.4M     | 29%                          | TCOF, CFPP, CFPB, ACSYNT, PFC, Valibel (ORFEO), ESLO              |
-| Free Conversations            |   2.2M     | 10%                          | CRFP, OFROM, CID, Rhapsodie, ParisStories, PFC, CLAPI, C-ORAL-ROM (ORFEO), LinTO, ESLO |
-| Meetings                      |   1.2M     |  5%                          | SUMM-RE, LinTO, Réunions de travail (ORFEO) |
-| Debates                       |   402k     | <2%                          | FreD, ESLO                                |
-| Assistance                    |   159k     | <1%                          | Fleuron (ORFEO), Accueil UBS, OTG, ESLO     |
-| Presentation, Formal Address         |    86k     | <0.5%                        | Valibel (ORFEO), LinTO, ESLO              |
 Training data was augmented with the following techniques:
 * varying the format used to indicate speech turns (dashes or [XXX:])
@@ -223,6 +225,8 @@ While the model has been trained and evaluated only on French dialogues, it may
 ### Training Procedure
 Claire-7B-0.1 is a causal decoder-only model trained on a causal language modeling task (i.e., predict the next token).
 See [Falcon-7b](https://huggingface.co/tiiuae/falcon-7b) for more details.

 ### Training Data
+The training dataset will be made available soon.
 Claire-7B-0.1 was tuned from Falcon-7b on the following data distribution:
 | **Data type**                 | **Words**  | **Training Sampling Weight** | **Sources**                                         |
 |-------------------------------|------------|------------------------------|-----------------------------------------------------|
+| Parliamentary Proceedings     | 135M       | 35%                          | Assemblée Nationale                                 |
+| Theatre                       |  16M       | 18%                          | Théâtre Classique, Théâtre Gratuit                  |
+| Interviews                    |   6.4M     | 29%                          | TCOF, CFPP, CFPB, ACSYNT, PFC, Valibel (ORFEO), ESLO|
+| Free Conversations            |   2.2M     | 10%                          | CRFP (ORFEO), OFROM (ORFEO), CID, Rhapsodie, ParisStories, PFC, CLAPI, C-ORAL-ROM (ORFEO), LinTO, ESLO |
+| Meetings                      |   1.2M     |  5%                          | SUMM-RE, LinTO, Réunions de travail (ORFEO)         |
+| Debates                       |   402k     | <2%                          | FreDSum, ESLO                                       |
+| Assistance                    |   159k     | <1%                          | Fleuron (ORFEO), Accueil UBS, OTG, ESLO             |
+| Presentation, Formal Address  |    86k     | <0.5%                        | Valibel (ORFEO), LinTO, ESLO                        |
 Training data was augmented with the following techniques:
 * varying the format used to indicate speech turns (dashes or [XXX:])
 ### Training Procedure
+The training code will be made available soon.
 Claire-7B-0.1 is a causal decoder-only model trained on a causal language modeling task (i.e., predict the next token).
 See [Falcon-7b](https://huggingface.co/tiiuae/falcon-7b) for more details.