File size: 2,356 Bytes
55be50e
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
0f3b1d5
 
 
 
 
55be50e
 
 
bb9e066
55be50e
 
 
58c725f
55be50e
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
6a8f6eb
0f3b1d5
55be50e
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
---
license: apache-2.0
base_model: distilbert-base-uncased
tags:
- generated_from_keras_callback
model-index:
- name: hfnlpmodels/eos_prediction_distilbert_1
  results: []
---

<!-- This model card has been generated automatically according to the information Keras had access to. You should
probably proofread and complete it, then remove this comment. -->

# hfnlpmodels/eos_prediction_distilbert_1

This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on an unknown dataset.
It achieves the following results on the evaluation set:
- Train Loss: 0.0879
- Train Accuracy: 0.9714
- Validation Loss: 0.2180
- Validation Accuracy: 0.9305
- Epoch: 2

## Model description

Predicts whether a given sentence is complete. Trained on sentences that were randomly truncate as '0' labels, hence some sentences which were grammatically correct may have been labelled as incomplete. Overall accuracy near 0.9 means that this was most likely a small factor.

## Intended uses & limitations

Found better performance on shorter sentences.

## Training and evaluation data

More information needed

## Training procedure

### Training hyperparameters

The following hyperparameters were used during training:
- optimizer: {'name': 'Adam', 'weight_decay': None, 'clipnorm': None, 'global_clipnorm': None, 'clipvalue': None, 'use_ema': False, 'ema_momentum': 0.99, 'ema_overwrite_frequency': None, 'jit_compile': True, 'is_legacy_optimizer': False, 'learning_rate': {'module': 'keras.optimizers.schedules', 'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 2e-05, 'decay_steps': 4165, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}, 'registered_name': None}, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-08, 'amsgrad': False}
- training_precision: float32

### Training results

| Train Loss | Train Accuracy | Validation Loss | Validation Accuracy | Epoch |
|:----------:|:--------------:|:---------------:|:-------------------:|:-----:|
| 0.2709     | 0.8942         | 0.2015          | 0.9211              | 0     |
| 0.1421     | 0.9505         | 0.2055          | 0.9318              | 1     |
| 0.0879     | 0.9714         | 0.2180          | 0.9305              | 2     |


### Framework versions

- Transformers 4.38.2
- TensorFlow 2.15.0
- Tokenizers 0.15.2