deanna-emery commited on
Commit
32b0e02
1 Parent(s): d869440

Upload TFT5ForConditionalGeneration

Browse files
Files changed (3) hide show
  1. README.md +14 -14
  2. config.json +1 -1
  3. tf_model.h5 +1 -1
README.md CHANGED
@@ -1,5 +1,5 @@
1
  ---
2
- base_model: deanna-emery/ASL_t5_word_epoch15_1204
3
  tags:
4
  - generated_from_keras_callback
5
  model-index:
@@ -12,16 +12,16 @@ probably proofread and complete it, then remove this comment. -->
12
 
13
  # ASL_t5_movinet_sentence
14
 
15
- This model is a fine-tuned version of [deanna-emery/ASL_t5_word_epoch15_1204](https://huggingface.co/deanna-emery/ASL_t5_word_epoch15_1204) on an unknown dataset.
16
  It achieves the following results on the evaluation set:
17
- - Train Loss: 0.2835
18
- - Train Top 1: 0.9429
19
- - Train Top 5: 0.9663
20
- - Validation Loss: 0.3456
21
- - Validation Top 1: 0.9379
22
- - Validation Top 5: 0.9602
23
- - Train Bleu: 0.8177
24
- - Train Gen Len: 14.9924
25
  - Epoch: 2
26
 
27
  ## Model description
@@ -41,16 +41,16 @@ More information needed
41
  ### Training hyperparameters
42
 
43
  The following hyperparameters were used during training:
44
- - optimizer: {'name': 'Adafactor', 'weight_decay': None, 'clipnorm': None, 'global_clipnorm': None, 'clipvalue': None, 'use_ema': False, 'ema_momentum': 0.99, 'ema_overwrite_frequency': None, 'jit_compile': True, 'is_legacy_optimizer': False, 'learning_rate': {'module': 'keras.optimizers.schedules', 'class_name': 'CosineDecay', 'config': {'initial_learning_rate': 0.0005, 'decay_steps': 1335, 'alpha': 0.0, 'name': None, 'warmup_target': 0.0005, 'warmup_steps': 400}, 'registered_name': None}, 'beta_2_decay': -0.8, 'epsilon_1': 1e-30, 'epsilon_2': 0.001, 'clip_threshold': 1.0, 'relative_step': True}
45
  - training_precision: float32
46
 
47
  ### Training results
48
 
49
  | Train Loss | Train Top 1 | Train Top 5 | Validation Loss | Validation Top 1 | Validation Top 5 | Train Bleu | Train Gen Len | Epoch |
50
  |:----------:|:-----------:|:-----------:|:---------------:|:----------------:|:----------------:|:----------:|:-------------:|:-----:|
51
- | 0.3382 | 0.9366 | 0.9593 | 0.3506 | 0.9365 | 0.9592 | 0.5417 | 17.4659 | 0 |
52
- | 0.3085 | 0.9398 | 0.9631 | 0.3450 | 0.9373 | 0.9600 | 0.6583 | 15.1629 | 1 |
53
- | 0.2835 | 0.9429 | 0.9663 | 0.3456 | 0.9379 | 0.9602 | 0.8177 | 14.9924 | 2 |
54
 
55
 
56
  ### Framework versions
 
1
  ---
2
+ base_model: deanna-emery/ASL_t5_movinet_sentence
3
  tags:
4
  - generated_from_keras_callback
5
  model-index:
 
12
 
13
  # ASL_t5_movinet_sentence
14
 
15
+ This model is a fine-tuned version of [deanna-emery/ASL_t5_movinet_sentence](https://huggingface.co/deanna-emery/ASL_t5_movinet_sentence) on an unknown dataset.
16
  It achieves the following results on the evaluation set:
17
+ - Train Loss: 0.2383
18
+ - Train Top 1: 0.9491
19
+ - Train Top 5: 0.9730
20
+ - Validation Loss: 0.3164
21
+ - Validation Top 1: 0.9416
22
+ - Validation Top 5: 0.9642
23
+ - Train Bleu: 0.5761
24
+ - Train Gen Len: 13.5547
25
  - Epoch: 2
26
 
27
  ## Model description
 
41
  ### Training hyperparameters
42
 
43
  The following hyperparameters were used during training:
44
+ - optimizer: {'name': 'Adafactor', 'weight_decay': None, 'clipnorm': None, 'global_clipnorm': None, 'clipvalue': None, 'use_ema': False, 'ema_momentum': 0.99, 'ema_overwrite_frequency': None, 'jit_compile': True, 'is_legacy_optimizer': False, 'learning_rate': {'module': 'keras.optimizers.schedules', 'class_name': 'CosineDecay', 'config': {'initial_learning_rate': 0.0005, 'decay_steps': 1998, 'alpha': 0.0, 'name': None, 'warmup_target': 0.0005, 'warmup_steps': 600}, 'registered_name': None}, 'beta_2_decay': -0.8, 'epsilon_1': 1e-30, 'epsilon_2': 0.001, 'clip_threshold': 1.0, 'relative_step': True}
45
  - training_precision: float32
46
 
47
  ### Training results
48
 
49
  | Train Loss | Train Top 1 | Train Top 5 | Validation Loss | Validation Top 1 | Validation Top 5 | Train Bleu | Train Gen Len | Epoch |
50
  |:----------:|:-----------:|:-----------:|:---------------:|:----------------:|:----------------:|:----------:|:-------------:|:-----:|
51
+ | 0.2845 | 0.9432 | 0.9669 | 0.3092 | 0.9416 | 0.9644 | 0.4712 | 14.4505 | 0 |
52
+ | 0.2619 | 0.9459 | 0.9697 | 0.3116 | 0.9417 | 0.9643 | 0.6262 | 13.9115 | 1 |
53
+ | 0.2383 | 0.9491 | 0.9730 | 0.3164 | 0.9416 | 0.9642 | 0.5761 | 13.5547 | 2 |
54
 
55
 
56
  ### Framework versions
config.json CHANGED
@@ -1,5 +1,5 @@
1
  {
2
- "_name_or_path": "deanna-emery/ASL_t5_word_epoch15_1204",
3
  "architectures": [
4
  "T5ForConditionalGeneration"
5
  ],
 
1
  {
2
+ "_name_or_path": "deanna-emery/ASL_t5_movinet_sentence",
3
  "architectures": [
4
  "T5ForConditionalGeneration"
5
  ],
tf_model.h5 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d2d2bd109946a4a581d0cd62281c7c0f3b0ff1865d84844a6947381d4f378f1b
3
  size 1089544048
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:856df96167528cc61d42046f91f4b8b5a2e18690219278131e9904ab2844c631
3
  size 1089544048