AlekseyKorshuk commited on
Commit
330a9be
1 Parent(s): b5ba932

huggingartists

Browse files
README.md CHANGED
@@ -14,11 +14,11 @@ widget:
14
  <div class="inline-flex flex-col" style="line-height: 1.5;">
15
  <div class="flex">
16
  <div
17
- style="display:DISPLAY_1; margin-left: auto; margin-right: auto; width: 92px; height:92px; border-radius: 50%; background-size: cover; background-image: url(&#39;https://images.genius.com/22e752c5e4e7e4d2e8488babffb33bbf.1000x1000x1.jpg&#39;)">
18
  </div>
19
  </div>
20
  <div style="text-align: center; margin-top: 3px; font-size: 16px; font-weight: 800">🤖 HuggingArtists Model 🤖</div>
21
- <div style="text-align: center; font-size: 16px; font-weight: 800">MORGENSHTERN</div>
22
  <a href="https://genius.com/artists/morgenshtern">
23
  <div style="text-align: center; font-size: 14px;">@morgenshtern</div>
24
  </a>
@@ -34,7 +34,7 @@ To understand how the model was developed, check the [W&B report](https://wandb.
34
 
35
  ## Training data
36
 
37
- The model was trained on lyrics from MORGENSHTERN.
38
 
39
  Dataset is available [here](https://huggingface.co/datasets/huggingartists/morgenshtern).
40
  And can be used with:
@@ -45,15 +45,15 @@ from datasets import load_dataset
45
  dataset = load_dataset("huggingartists/morgenshtern")
46
  ```
47
 
48
- [Explore the data](https://wandb.ai/huggingartists/huggingartists/runs/3q6pnb0j/artifacts), which is tracked with [W&B artifacts](https://docs.wandb.com/artifacts) at every step of the pipeline.
49
 
50
  ## Training procedure
51
 
52
- The model is based on a pre-trained [GPT-2](https://huggingface.co/gpt2) which is fine-tuned on MORGENSHTERN's lyrics.
53
 
54
- Hyperparameters and metrics are recorded in the [W&B training run](https://wandb.ai/huggingartists/huggingartists/runs/2ury2mhk) for full transparency and reproducibility.
55
 
56
- At the end of training, [the final model](https://wandb.ai/huggingartists/huggingartists/runs/2ury2mhk/artifacts) is logged and versioned.
57
 
58
  ## How to use
59
 
@@ -90,6 +90,8 @@ In addition, the data present in the user's tweets further affects the text gene
90
 
91
  [![Follow](https://img.shields.io/twitter/follow/alekseykorshuk?style=social)](https://twitter.com/intent/follow?screen_name=alekseykorshuk)
92
 
 
 
93
  For more details, visit the project repository.
94
 
95
  [![GitHub stars](https://img.shields.io/github/stars/AlekseyKorshuk/huggingartists?style=social)](https://github.com/AlekseyKorshuk/huggingartists)
 
14
  <div class="inline-flex flex-col" style="line-height: 1.5;">
15
  <div class="flex">
16
  <div
17
+ style="display:DISPLAY_1; margin-left: auto; margin-right: auto; width: 92px; height:92px; border-radius: 50%; background-size: cover; background-image: url(&#39;https://images.genius.com/df75ede64ffcf049727bfbb01d323081.400x400x1.jpg&#39;)">
18
  </div>
19
  </div>
20
  <div style="text-align: center; margin-top: 3px; font-size: 16px; font-weight: 800">🤖 HuggingArtists Model 🤖</div>
21
+ <div style="text-align: center; font-size: 16px; font-weight: 800">The Beatles</div>
22
  <a href="https://genius.com/artists/morgenshtern">
23
  <div style="text-align: center; font-size: 14px;">@morgenshtern</div>
24
  </a>
 
34
 
35
  ## Training data
36
 
37
+ The model was trained on lyrics from The Beatles.
38
 
39
  Dataset is available [here](https://huggingface.co/datasets/huggingartists/morgenshtern).
40
  And can be used with:
 
45
  dataset = load_dataset("huggingartists/morgenshtern")
46
  ```
47
 
48
+ [Explore the data](https://wandb.ai/huggingartists/huggingartists/runs/36ru50a4/artifacts), which is tracked with [W&B artifacts](https://docs.wandb.com/artifacts) at every step of the pipeline.
49
 
50
  ## Training procedure
51
 
52
+ The model is based on a pre-trained [GPT-2](https://huggingface.co/gpt2) which is fine-tuned on The Beatles's lyrics.
53
 
54
+ Hyperparameters and metrics are recorded in the [W&B training run](https://wandb.ai/huggingartists/huggingartists/runs/1k6lslqs) for full transparency and reproducibility.
55
 
56
+ At the end of training, [the final model](https://wandb.ai/huggingartists/huggingartists/runs/1k6lslqs/artifacts) is logged and versioned.
57
 
58
  ## How to use
59
 
 
90
 
91
  [![Follow](https://img.shields.io/twitter/follow/alekseykorshuk?style=social)](https://twitter.com/intent/follow?screen_name=alekseykorshuk)
92
 
93
+ [![Follow](https://img.shields.io/badge/dynamic/json?color=blue&label=Telegram%20Channel&query=%24.result&url=https%3A%2F%2Fapi.telegram.org%2Fbot1929545866%3AAAFGhV-KKnegEcLiyYJxsc4zV6C-bdPEBtQ%2FgetChatMemberCount%3Fchat_id%3D-1001253621662&style=social&logo=telegram)](https://t.me/joinchat/_CQ04KjcJ-4yZTky)
94
+
95
  For more details, visit the project repository.
96
 
97
  [![GitHub stars](https://img.shields.io/github/stars/AlekseyKorshuk/huggingartists?style=social)](https://github.com/AlekseyKorshuk/huggingartists)
config.json CHANGED
@@ -1,5 +1,5 @@
1
  {
2
- "_name_or_path": "gpt2",
3
  "activation_function": "gelu_new",
4
  "architectures": [
5
  "GPT2LMHeadModel"
@@ -35,7 +35,7 @@
35
  }
36
  },
37
  "torch_dtype": "float32",
38
- "transformers_version": "4.9.1",
39
  "use_cache": true,
40
  "vocab_size": 50257
41
  }
 
1
  {
2
+ "_name_or_path": "huggingartists/morgenshtern",
3
  "activation_function": "gelu_new",
4
  "architectures": [
5
  "GPT2LMHeadModel"
 
35
  }
36
  },
37
  "torch_dtype": "float32",
38
+ "transformers_version": "4.9.2",
39
  "use_cache": true,
40
  "vocab_size": 50257
41
  }
flax_model.msgpack CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:31606b5df45658b03ce97110b2fb5cd44b16f8c126a0aeca7a01cf0645b9696f
3
  size 497764120
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3d11136176323aafb2ef75f5525ef66852770716f4ced58db73503e0a7484137
3
  size 497764120
optimizer.pt CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:0e1d282a1d71170062ecdb062d5c638852f43c3cd839ff61cacc44240716c313
3
  size 995603825
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:df0396891551f1f573ada519077a8bf740ef79c72b09df5bc47b336bcfae1a01
3
  size 995603825
pytorch_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7be973f06905cedc248ba31c9027bfb7bed7f5330cc8eea151a16663cb260a5f
3
  size 510403817
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:14245ab76b0bcb59d2619dfecebb00c58ba368ba92e4979db4a5e50454a3f65d
3
  size 510403817
rng_state.pth CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a1a77f16b5380cb9f8dfa41c789f6f7b0d5f10d7040476401f9a011e850af2a4
3
- size 14503
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:aec18bd090ee79f7be43632d1d02335edd519ec6f49a3a61a5f244bf515bf8da
3
+ size 14567
scheduler.pt CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8bc8c181f3eeffc22bed5df763ebe76f1c0ce2ad567f243d6703ceaa6e371773
3
  size 623
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cae94fe29647f1ab9ebfc3069e27ada487df598ed599d7fbb4182e85d06b41b1
3
  size 623
tokenizer_config.json CHANGED
@@ -1 +1 @@
1
- {"unk_token": "<|endoftext|>", "bos_token": "<|endoftext|>", "eos_token": "<|endoftext|>", "add_prefix_space": false, "model_max_length": 1024, "special_tokens_map_file": null, "name_or_path": "gpt2", "tokenizer_class": "GPT2Tokenizer"}
 
1
+ {"unk_token": "<|endoftext|>", "bos_token": "<|endoftext|>", "eos_token": "<|endoftext|>", "add_prefix_space": false, "model_max_length": 1024, "special_tokens_map_file": null, "name_or_path": "huggingartists/morgenshtern", "tokenizer_class": "GPT2Tokenizer"}
trainer_state.json CHANGED
@@ -1,8 +1,8 @@
1
  {
2
  "best_metric": null,
3
  "best_model_checkpoint": null,
4
- "epoch": 1.0,
5
- "global_step": 102,
6
  "is_hyper_param_search": false,
7
  "is_local_process_zero": true,
8
  "is_world_process_zero": true,
@@ -166,11 +166,167 @@
166
  "eval_samples_per_second": 40.618,
167
  "eval_steps_per_second": 5.077,
168
  "step": 100
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
169
  }
170
  ],
171
- "max_steps": 102,
172
- "num_train_epochs": 1,
173
- "total_flos": 106345857024000.0,
174
  "trial_name": null,
175
  "trial_params": null
176
  }
 
1
  {
2
  "best_metric": null,
3
  "best_model_checkpoint": null,
4
+ "epoch": 2.0,
5
+ "global_step": 232,
6
  "is_hyper_param_search": false,
7
  "is_local_process_zero": true,
8
  "is_world_process_zero": true,
 
166
  "eval_samples_per_second": 40.618,
167
  "eval_steps_per_second": 5.077,
168
  "step": 100
169
+ },
170
+ {
171
+ "epoch": 0.91,
172
+ "learning_rate": 3.0216830127274476e-06,
173
+ "loss": 1.8376,
174
+ "step": 105
175
+ },
176
+ {
177
+ "epoch": 0.95,
178
+ "learning_rate": 9.037005536513067e-07,
179
+ "loss": 1.7024,
180
+ "step": 110
181
+ },
182
+ {
183
+ "epoch": 0.99,
184
+ "learning_rate": 2.515656508272057e-08,
185
+ "loss": 1.7911,
186
+ "step": 115
187
+ },
188
+ {
189
+ "epoch": 1.03,
190
+ "learning_rate": 4.0213613921093164e-07,
191
+ "loss": 1.8512,
192
+ "step": 120
193
+ },
194
+ {
195
+ "epoch": 1.08,
196
+ "learning_rate": 2.0277372298297e-06,
197
+ "loss": 1.7573,
198
+ "step": 125
199
+ },
200
+ {
201
+ "epoch": 1.12,
202
+ "learning_rate": 4.8721970205680935e-06,
203
+ "loss": 1.7902,
204
+ "step": 130
205
+ },
206
+ {
207
+ "epoch": 1.16,
208
+ "learning_rate": 8.88343684654658e-06,
209
+ "loss": 1.7602,
210
+ "step": 135
211
+ },
212
+ {
213
+ "epoch": 1.21,
214
+ "learning_rate": 1.3988015692592823e-05,
215
+ "loss": 1.8606,
216
+ "step": 140
217
+ },
218
+ {
219
+ "epoch": 1.25,
220
+ "learning_rate": 2.009247481060283e-05,
221
+ "loss": 1.6102,
222
+ "step": 145
223
+ },
224
+ {
225
+ "epoch": 1.29,
226
+ "learning_rate": 2.708504883770769e-05,
227
+ "loss": 1.8574,
228
+ "step": 150
229
+ },
230
+ {
231
+ "epoch": 1.34,
232
+ "learning_rate": 3.483771208671411e-05,
233
+ "loss": 1.6927,
234
+ "step": 155
235
+ },
236
+ {
237
+ "epoch": 1.38,
238
+ "learning_rate": 4.320852254368187e-05,
239
+ "loss": 1.7203,
240
+ "step": 160
241
+ },
242
+ {
243
+ "epoch": 1.42,
244
+ "learning_rate": 5.204422065684016e-05,
245
+ "loss": 1.8592,
246
+ "step": 165
247
+ },
248
+ {
249
+ "epoch": 1.47,
250
+ "learning_rate": 6.118303533611755e-05,
251
+ "loss": 1.7338,
252
+ "step": 170
253
+ },
254
+ {
255
+ "epoch": 1.51,
256
+ "learning_rate": 7.045764578878282e-05,
257
+ "loss": 1.7386,
258
+ "step": 175
259
+ },
260
+ {
261
+ "epoch": 1.55,
262
+ "learning_rate": 7.969824496351964e-05,
263
+ "loss": 1.6874,
264
+ "step": 180
265
+ },
266
+ {
267
+ "epoch": 1.59,
268
+ "learning_rate": 8.873564851492995e-05,
269
+ "loss": 1.8691,
270
+ "step": 185
271
+ },
272
+ {
273
+ "epoch": 1.64,
274
+ "learning_rate": 9.740439236703416e-05,
275
+ "loss": 1.7808,
276
+ "step": 190
277
+ },
278
+ {
279
+ "epoch": 1.68,
280
+ "learning_rate": 0.00010554576216307802,
281
+ "loss": 1.8296,
282
+ "step": 195
283
+ },
284
+ {
285
+ "epoch": 1.72,
286
+ "learning_rate": 0.00011301069913603334,
287
+ "loss": 1.737,
288
+ "step": 200
289
+ },
290
+ {
291
+ "epoch": 1.77,
292
+ "learning_rate": 0.0001196625291967717,
293
+ "loss": 1.8,
294
+ "step": 205
295
+ },
296
+ {
297
+ "epoch": 1.81,
298
+ "learning_rate": 0.00012537946527356269,
299
+ "loss": 1.6787,
300
+ "step": 210
301
+ },
302
+ {
303
+ "epoch": 1.85,
304
+ "learning_rate": 0.000130056837088046,
305
+ "loss": 1.664,
306
+ "step": 215
307
+ },
308
+ {
309
+ "epoch": 1.9,
310
+ "learning_rate": 0.00013360900754314024,
311
+ "loss": 1.5839,
312
+ "step": 220
313
+ },
314
+ {
315
+ "epoch": 1.94,
316
+ "learning_rate": 0.0001359709406361119,
317
+ "loss": 1.8525,
318
+ "step": 225
319
+ },
320
+ {
321
+ "epoch": 1.98,
322
+ "learning_rate": 0.0001370993921901871,
323
+ "loss": 1.7228,
324
+ "step": 230
325
  }
326
  ],
327
+ "max_steps": 232,
328
+ "num_train_epochs": 2,
329
+ "total_flos": 241695129600000.0,
330
  "trial_name": null,
331
  "trial_params": null
332
  }
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7f915a601b161294d09f0795c26c67c9dddff75c8768bf2451d5a65f6c1dd3a2
3
  size 2671
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e3f7789495a48c9ed1372c3a20ff68e3fd471ceffc8c79810dc223ac2f95c6ed
3
  size 2671