aisuko commited on
Commit
7e729db
1 Parent(s): e6a6c4b

ft-vit-base-patch16-224-in21k-on-food101-lora

Browse files
README.md CHANGED
@@ -20,8 +20,8 @@ should probably proofread and complete it, then remove this comment. -->
20
 
21
  This model is a fine-tuned version of [google/vit-base-patch16-224-in21k](https://huggingface.co/google/vit-base-patch16-224-in21k) on the food101 dataset.
22
  It achieves the following results on the evaluation set:
23
- - Loss: 0.2032
24
- - Accuracy: 0.938
25
 
26
  ## Model description
27
 
@@ -55,8 +55,8 @@ The following hyperparameters were used during training:
55
 
56
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
57
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
58
- | 2.2286 | 0.96 | 17 | 0.2821 | 0.914 |
59
- | 0.2266 | 1.92 | 34 | 0.2032 | 0.938 |
60
 
61
 
62
  ### Framework versions
 
20
 
21
  This model is a fine-tuned version of [google/vit-base-patch16-224-in21k](https://huggingface.co/google/vit-base-patch16-224-in21k) on the food101 dataset.
22
  It achieves the following results on the evaluation set:
23
+ - Loss: 2.4845
24
+ - Accuracy: 1.0
25
 
26
  ## Model description
27
 
 
55
 
56
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
57
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
58
+ | No log | 1.0 | 2 | 2.4845 | 1.0 |
59
+ | No log | 2.0 | 4 | 1.0485 | 1.0 |
60
 
61
 
62
  ### Framework versions
adapter_config.json CHANGED
@@ -12,7 +12,7 @@
12
  "layers_pattern": null,
13
  "layers_to_transform": null,
14
  "loftq_config": {},
15
- "lora_alpha": 16,
16
  "lora_dropout": 0.1,
17
  "megatron_config": null,
18
  "megatron_core": "megatron.core",
@@ -25,6 +25,8 @@
25
  "revision": null,
26
  "target_modules": [
27
  "query",
 
 
28
  "value"
29
  ],
30
  "task_type": null
 
12
  "layers_pattern": null,
13
  "layers_to_transform": null,
14
  "loftq_config": {},
15
+ "lora_alpha": 32,
16
  "lora_dropout": 0.1,
17
  "megatron_config": null,
18
  "megatron_core": "megatron.core",
 
25
  "revision": null,
26
  "target_modules": [
27
  "query",
28
+ "key",
29
+ "dense",
30
  "value"
31
  ],
32
  "task_type": null
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ec29b1a584af9d87b709977625cfaeff56d132b7ac08db4bad87810146e333a9
3
- size 2677140
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7fd8401a192a4ddfa9fc56309ed5da3d251fef2cbc1de90274e43c0508a0dcb8
3
+ size 10948068
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:95907d32c4832fa8afb419253f987f866aa5c007ed34e06b200a92b458275153
3
  size 4411
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1ec7f67e55cc437105cadbc2322de5b0e6ca66d4f235fcc0c6577d215da8745a
3
  size 4411