librarian-bot commited on
Commit
b491b0c
1 Parent(s): 6635468

Librarian Bot: Update Hugging Face dataset ID

Browse files

This pull request updates the ID of the dataset used to train the model to the new Hub identifier `tau/commonsense_qa` (which has been migrated moved from `commonsense_qa`). We have been working to migrate datasets to their own repositories on the Hub, and this is part of that effort.

Updating the dataset ID in the model card will ensure that the model card is correctly linked to the dataset repository on the Hub. This will also make it easier for people to find your model via the training data used to create it.

This PR comes courtesy of [Librarian Bot](https://huggingface.co/librarian-bot). If you have any feedback, queries, or need assistance, please don't hesitate to reach out to [@davanstrien](https://huggingface.co/davanstrien).

Files changed (1) hide show
  1. README.md +26 -28
README.md CHANGED
@@ -1,5 +1,28 @@
1
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2
  license: openrail
 
 
 
3
  datasets:
4
  - irds/codesearchnet
5
  - giganticode/java-cmpx-v1
@@ -12,8 +35,7 @@ datasets:
12
  - codeparrot/github-code
13
  - codeparrot/github-code-clean
14
  - code_x_glue_cc_code_completion_line
15
- - >-
16
- autoevaluate/autoeval-eval-jeffdshen__inverse_superglue_mixedp1-jeffdshen__inverse-63643c-1665558893
17
  - bentrevett/multi30k
18
  - edbeeching/decision_transformer_gym_replay
19
  - psyche/common_crawl
@@ -64,8 +86,7 @@ datasets:
64
  - US-Artificial-Intelligence/algemap
65
  - rombodawg/2XUNCENSORED_alpaca_840k_Evol_USER_ASSIS
66
  - griffin/chain_of_density
67
- - >-
68
- shirsh10mall/LLM_Instruct_Learning_Project_Preprocessed_Tokenized_Open_Orca_Dataset_Flan_T5
69
  - Thaweewat/chain-of-thought-74k-th
70
  - AlekseyKorshuk/chain-of-thoughts-chatml-deduplicated
71
  - dair-ai/emotion
@@ -117,7 +138,7 @@ datasets:
117
  - Amani27/massive_translation_dataset
118
  - huggingface/transformers-metadata
119
  - hf-internal-testing/transformers-metadata
120
- - commonsense_qa
121
  - nlplabtdtu/test-edu-crawl
122
  - kernelmachine/open-license-corpus
123
  - BDas/EnglishNLPDataset
@@ -140,26 +161,6 @@ datasets:
140
  - EnigmaOfTheWorld/b-mc2-sql-create-context
141
  - HuggingFaceH4/testing_self_instruct_small
142
  - RUCAIBox/Data-to-text-Generation
143
- language:
144
- - en
145
- - it
146
- - fr
147
- - pt
148
- - la
149
- - ru
150
- - ro
151
- - el
152
- - ja
153
- - zh
154
- - ga
155
- - cy
156
- - gd
157
- - de
158
- - da
159
- - sw
160
- - bg
161
- - ce
162
- - rm
163
  metrics:
164
  - accuracy
165
  - bertscore
@@ -175,9 +176,6 @@ metrics:
175
  - perplexity
176
  - precision
177
  - hyperml/balanced_accuracy
178
- tags:
179
- - text-generation-inference
180
- library_name: transformers
181
  pipeline_tag: text-generation
182
  ---
183
 
 
1
  ---
2
+ language:
3
+ - en
4
+ - it
5
+ - fr
6
+ - pt
7
+ - la
8
+ - ru
9
+ - ro
10
+ - el
11
+ - ja
12
+ - zh
13
+ - ga
14
+ - cy
15
+ - gd
16
+ - de
17
+ - da
18
+ - sw
19
+ - bg
20
+ - ce
21
+ - rm
22
  license: openrail
23
+ library_name: transformers
24
+ tags:
25
+ - text-generation-inference
26
  datasets:
27
  - irds/codesearchnet
28
  - giganticode/java-cmpx-v1
 
35
  - codeparrot/github-code
36
  - codeparrot/github-code-clean
37
  - code_x_glue_cc_code_completion_line
38
+ - autoevaluate/autoeval-eval-jeffdshen__inverse_superglue_mixedp1-jeffdshen__inverse-63643c-1665558893
 
39
  - bentrevett/multi30k
40
  - edbeeching/decision_transformer_gym_replay
41
  - psyche/common_crawl
 
86
  - US-Artificial-Intelligence/algemap
87
  - rombodawg/2XUNCENSORED_alpaca_840k_Evol_USER_ASSIS
88
  - griffin/chain_of_density
89
+ - shirsh10mall/LLM_Instruct_Learning_Project_Preprocessed_Tokenized_Open_Orca_Dataset_Flan_T5
 
90
  - Thaweewat/chain-of-thought-74k-th
91
  - AlekseyKorshuk/chain-of-thoughts-chatml-deduplicated
92
  - dair-ai/emotion
 
138
  - Amani27/massive_translation_dataset
139
  - huggingface/transformers-metadata
140
  - hf-internal-testing/transformers-metadata
141
+ - tau/commonsense_qa
142
  - nlplabtdtu/test-edu-crawl
143
  - kernelmachine/open-license-corpus
144
  - BDas/EnglishNLPDataset
 
161
  - EnigmaOfTheWorld/b-mc2-sql-create-context
162
  - HuggingFaceH4/testing_self_instruct_small
163
  - RUCAIBox/Data-to-text-Generation
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
164
  metrics:
165
  - accuracy
166
  - bertscore
 
176
  - perplexity
177
  - precision
178
  - hyperml/balanced_accuracy
 
 
 
179
  pipeline_tag: text-generation
180
  ---
181