Upload TFDistilBertForQuestionAnswering

by chibichibi - opened Aug 17, 2023

base: refs/heads/main

←

from: refs/pr/2

Discussion Files changed

+23

-39

Files changed (3) hide show

README.md +20 -36
config.json +1 -1
tf_model.h5 +2 -2

README.md CHANGED Viewed

@@ -1,70 +1,54 @@
 ---
-datasets:
-- squad
 license: apache-2.0
-tags:
 - generated_from_keras_callback
-metrics:
-- f1
-model-index:
 - name: transformers-qa
-  results:
-  - task:
-      name: "Question Answering"
-      type: question-answering
-    dataset:
-      type: squad
-      name: SQuAD
-      args: en
-    metrics:
-      []
-widget:
-  - context: "Keras is an API designed for human beings, not machines. Keras follows best practices for reducing cognitive load: it offers consistent & simple APIs, it minimizes the number of user actions required for common use cases, and it provides clear and actionable feedback upon user error."
 ---
 <!-- This model card has been generated automatically according to the information Keras had access to. You should
 probably proofread and complete it, then remove this comment. -->
-# Question Answering with Hugging Face Transformers and Keras 🤗❤️
-This model is a fine-tuned version of [distilbert-base-cased](https://huggingface.co/distilbert-base-cased) on SQuAD dataset.
 It achieves the following results on the evaluation set:
-- Train Loss: 0.9300
-- Validation Loss: 1.1437
-- Epoch: 1
 ## Model description
-Question answering model based on distilbert-base-cased, trained with 🤗Transformers + ❤️Keras.
 ## Intended uses & limitations
-This model is trained for Question Answering tutorial for Keras.io.
 ## Training and evaluation data
-It is trained on [SQuAD](https://huggingface.co/datasets/squad) question answering dataset. ⁉️
 ## Training procedure
-Find the notebook in Keras Examples [here](https://keras.io/examples/nlp/question_answering/). ❤️
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- optimizer: {'name': 'Adam', 'learning_rate': 5e-05, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False}
 - training_precision: mixed_float16
 ### Training results
 | Train Loss | Validation Loss | Epoch |
 |:----------:|:---------------:|:-----:|
-| 1.5145     | 1.1500          | 0     |
-| 0.9300     | 1.1437          | 1     |
 ### Framework versions
-- Transformers 4.16.0.dev0
-- TensorFlow 2.6.0
-- Datasets 1.16.2.dev0
-- Tokenizers 0.10.3

 ---
 license: apache-2.0
+base_model: distilbert-base-cased
+tags:
 - generated_from_keras_callback
+model-index:
 - name: transformers-qa
+  results: []
 ---
 <!-- This model card has been generated automatically according to the information Keras had access to. You should
 probably proofread and complete it, then remove this comment. -->
+# transformers-qa
+This model is a fine-tuned version of [distilbert-base-cased](https://huggingface.co/distilbert-base-cased) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Train Loss: 1.5435
+- Validation Loss: 1.1638
+- Epoch: 0
 ## Model description
+More information needed
 ## Intended uses & limitations
+More information needed
 ## Training and evaluation data
+More information needed
 ## Training procedure
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- optimizer: {'name': 'Adam', 'weight_decay': None, 'clipnorm': None, 'global_clipnorm': None, 'clipvalue': None, 'use_ema': False, 'ema_momentum': 0.99, 'ema_overwrite_frequency': None, 'jit_compile': True, 'is_legacy_optimizer': False, 'learning_rate': 5e-05, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False}
 - training_precision: mixed_float16
 ### Training results
 | Train Loss | Validation Loss | Epoch |
 |:----------:|:---------------:|:-----:|
+| 1.5435     | 1.1638          | 0     |
 ### Framework versions
+- Transformers 4.32.0.dev0
+- TensorFlow 2.12.0
+- Datasets 2.14.4
+- Tokenizers 0.13.3

config.json CHANGED Viewed

@@ -19,6 +19,6 @@
   "seq_classif_dropout": 0.2,
   "sinusoidal_pos_embds": false,
   "tie_weights_": true,
-  "transformers_version": "4.16.0.dev0",
   "vocab_size": 28996
 }

   "seq_classif_dropout": 0.2,
   "sinusoidal_pos_embds": false,
   "tie_weights_": true,
+  "transformers_version": "4.32.0.dev0",
   "vocab_size": 28996
 }

tf_model.h5 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d9a2797dee03f701b3d03f25de05f9144946933237ec2791e176babc0d288e6a
-size 260895816

 version https://git-lfs.github.com/spec/v1
+oid sha256:5b8183fd67b3ab9cbef687a4848c3a9143c2e7b2b63797c8589db0777fed0abc
+size 260895720