rparasa
/

sf_20240219_v1

generated_from_keras_callback

Inference Endpoints

Model card Files Files and versions Community

Edit model card

sf_20240219_v1

This model is a fine-tuned version of nvidia/mit-b0 on an unknown dataset. It achieves the following results on the evaluation set:

Train Loss: 0.0640
Validation Loss: 0.0852
Epoch: 19

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

optimizer: {'name': 'Adam', 'weight_decay': None, 'clipnorm': None, 'global_clipnorm': None, 'clipvalue': None, 'use_ema': False, 'ema_momentum': 0.99, 'ema_overwrite_frequency': None, 'jit_compile': True, 'is_legacy_optimizer': False, 'learning_rate': 6e-05, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False}
training_precision: float32

Training results

Train Loss	Validation Loss	Epoch
0.3507	0.2338	0
0.1528	0.1519	1
0.1186	0.1249	2
0.1038	0.1116	3
0.1008	0.0938	4
0.0907	0.0920	5
0.0936	0.1053	6
0.0898	0.0914	7
0.0843	0.0833	8
0.0795	0.0828	9
0.0779	0.0875	10
0.0750	0.0818	11
0.0729	0.0877	12
0.0723	0.0769	13
0.0709	0.0939	14
0.0722	0.0787	15
0.0692	0.0796	16
0.0663	0.0736	17
0.0651	0.0727	18
0.0640	0.0852	19

Framework versions

Transformers 4.37.0
TensorFlow 2.15.0
Datasets 2.1.0
Tokenizers 0.15.1

Downloads last month: 0

Unable to determine this model’s pipeline type. Check the docs .

Finetuned from

Evaluation results

Metadata error: specify a dataset to view leaderboard