deanna-emery
/

ASL_t5_movinet_sentence

Text2Text Generation

Transformers

TensorFlow

generated_from_keras_callback

Inference Endpoints

Model card Files Files and versions Community

deanna-emery commited on Dec 7, 2023

Commit

32b0e02

•

1 Parent(s): d869440

Upload TFT5ForConditionalGeneration

Browse files

Files changed (3) hide show

README.md +14 -14
config.json +1 -1
tf_model.h5 +1 -1

README.md CHANGED Viewed

@@ -1,5 +1,5 @@
 ---
-base_model: deanna-emery/ASL_t5_word_epoch15_1204
 tags:
 - generated_from_keras_callback
 model-index:
@@ -12,16 +12,16 @@ probably proofread and complete it, then remove this comment. -->
 # ASL_t5_movinet_sentence
-This model is a fine-tuned version of [deanna-emery/ASL_t5_word_epoch15_1204](https://huggingface.co/deanna-emery/ASL_t5_word_epoch15_1204) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Train Loss: 0.2835
-- Train Top 1: 0.9429
-- Train Top 5: 0.9663
-- Validation Loss: 0.3456
-- Validation Top 1: 0.9379
-- Validation Top 5: 0.9602
-- Train Bleu: 0.8177
-- Train Gen Len: 14.9924
 - Epoch: 2
 ## Model description
@@ -41,16 +41,16 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- optimizer: {'name': 'Adafactor', 'weight_decay': None, 'clipnorm': None, 'global_clipnorm': None, 'clipvalue': None, 'use_ema': False, 'ema_momentum': 0.99, 'ema_overwrite_frequency': None, 'jit_compile': True, 'is_legacy_optimizer': False, 'learning_rate': {'module': 'keras.optimizers.schedules', 'class_name': 'CosineDecay', 'config': {'initial_learning_rate': 0.0005, 'decay_steps': 1335, 'alpha': 0.0, 'name': None, 'warmup_target': 0.0005, 'warmup_steps': 400}, 'registered_name': None}, 'beta_2_decay': -0.8, 'epsilon_1': 1e-30, 'epsilon_2': 0.001, 'clip_threshold': 1.0, 'relative_step': True}
 - training_precision: float32
 ### Training results
 | Train Loss | Train Top 1 | Train Top 5 | Validation Loss | Validation Top 1 | Validation Top 5 | Train Bleu | Train Gen Len | Epoch |
 |:----------:|:-----------:|:-----------:|:---------------:|:----------------:|:----------------:|:----------:|:-------------:|:-----:|
-| 0.3382     | 0.9366      | 0.9593      | 0.3506          | 0.9365           | 0.9592           | 0.5417     | 17.4659       | 0     |
-| 0.3085     | 0.9398      | 0.9631      | 0.3450          | 0.9373           | 0.9600           | 0.6583     | 15.1629       | 1     |
-| 0.2835     | 0.9429      | 0.9663      | 0.3456          | 0.9379           | 0.9602           | 0.8177     | 14.9924       | 2     |
 ### Framework versions

 ---
+base_model: deanna-emery/ASL_t5_movinet_sentence
 tags:
 - generated_from_keras_callback
 model-index:
 # ASL_t5_movinet_sentence
+This model is a fine-tuned version of [deanna-emery/ASL_t5_movinet_sentence](https://huggingface.co/deanna-emery/ASL_t5_movinet_sentence) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Train Loss: 0.2383
+- Train Top 1: 0.9491
+- Train Top 5: 0.9730
+- Validation Loss: 0.3164
+- Validation Top 1: 0.9416
+- Validation Top 5: 0.9642
+- Train Bleu: 0.5761
+- Train Gen Len: 13.5547
 - Epoch: 2
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- optimizer: {'name': 'Adafactor', 'weight_decay': None, 'clipnorm': None, 'global_clipnorm': None, 'clipvalue': None, 'use_ema': False, 'ema_momentum': 0.99, 'ema_overwrite_frequency': None, 'jit_compile': True, 'is_legacy_optimizer': False, 'learning_rate': {'module': 'keras.optimizers.schedules', 'class_name': 'CosineDecay', 'config': {'initial_learning_rate': 0.0005, 'decay_steps': 1998, 'alpha': 0.0, 'name': None, 'warmup_target': 0.0005, 'warmup_steps': 600}, 'registered_name': None}, 'beta_2_decay': -0.8, 'epsilon_1': 1e-30, 'epsilon_2': 0.001, 'clip_threshold': 1.0, 'relative_step': True}
 - training_precision: float32
 ### Training results
 | Train Loss | Train Top 1 | Train Top 5 | Validation Loss | Validation Top 1 | Validation Top 5 | Train Bleu | Train Gen Len | Epoch |
 |:----------:|:-----------:|:-----------:|:---------------:|:----------------:|:----------------:|:----------:|:-------------:|:-----:|
+| 0.2845     | 0.9432      | 0.9669      | 0.3092          | 0.9416           | 0.9644           | 0.4712     | 14.4505       | 0     |
+| 0.2619     | 0.9459      | 0.9697      | 0.3116          | 0.9417           | 0.9643           | 0.6262     | 13.9115       | 1     |
+| 0.2383     | 0.9491      | 0.9730      | 0.3164          | 0.9416           | 0.9642           | 0.5761     | 13.5547       | 2     |
 ### Framework versions

config.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "_name_or_path": "deanna-emery/ASL_t5_word_epoch15_1204",
   "architectures": [
     "T5ForConditionalGeneration"
   ],

 {
+  "_name_or_path": "deanna-emery/ASL_t5_movinet_sentence",
   "architectures": [
     "T5ForConditionalGeneration"
   ],

tf_model.h5 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d2d2bd109946a4a581d0cd62281c7c0f3b0ff1865d84844a6947381d4f378f1b
 size 1089544048

 version https://git-lfs.github.com/spec/v1
+oid sha256:856df96167528cc61d42046f91f4b8b5a2e18690219278131e9904ab2844c631
 size 1089544048