Update README.md
Browse files
README.md
CHANGED
|
@@ -18,14 +18,14 @@ verification, and loss/theft prevention. This model is best suited for stores
|
|
| 18 |
whose inventory closely mathes the training data, as performance will degrade
|
| 19 |
on unseen brands or product types not represented in the training data.
|
| 20 |
|
| 21 |
-
|
| 22 |
## Training Data
|
| 23 |
|
| 24 |
The training dataset is a subset of the **RPC-Dataset**
|
| 25 |
([rpc-dataset.github.io](https://rpc-dataset.github.io/)),
|
| 26 |
-
a large-scale retail product checkout dataset
|
|
|
|
| 27 |
The working dataset is a subset of this,
|
| 28 |
-
consisting of 9,616 images across
|
| 29 |
Roboflow
|
| 30 |
([universe.roboflow.com/groceries-jxjfd/grocery-goods](https://universe.roboflow.com/groceries-jxjfd/grocery-goods)).
|
| 31 |
|
|
@@ -37,7 +37,6 @@ to improve generalization, reduce class imbalance, and better reflect how a self
|
|
| 37 |
For example, all milk classes were merged into a single `milk` class, reducing the total class count from 200 to 17.
|
| 38 |
Random samples were reviewed after relabeling to validate annotation quality, with no corrections needed.
|
| 39 |
|
| 40 |
-
|
| 41 |
This is the final dataset used for training, after the annotation process
|
| 42 |
([https://app.roboflow.com/bdata-497-advanced-topics-in-dv-nqagm/grocery-goods-ezyyb](https://universe.roboflow.com/bdata-497-advanced-topics-in-dv-nqagm/grocery-goods-ezyyb)).
|
| 43 |
|
|
@@ -85,20 +84,31 @@ The following augmentations were applied during training to simulate real-world
|
|
| 85 |
- Fresh and unpackaged produce (such as fruits or vegetables) are not represented in the dataset
|
| 86 |
- Limited lighting variation — real checkout environments may have inconsistent lighting not well represented in training images
|
| 87 |
|
| 88 |
-
|
| 89 |
## Training Procedure
|
| 90 |
|
| 91 |
-
- **
|
| 92 |
-
- **
|
| 93 |
-
- **
|
| 94 |
-
- **
|
| 95 |
-
- **
|
| 96 |
-
- **
|
| 97 |
-
|
| 98 |
-
|
| 99 |
-
|
| 100 |
-
|
| 101 |
-
|
| 102 |
-
|
| 103 |
-
|
| 104 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 18 |
whose inventory closely mathes the training data, as performance will degrade
|
| 19 |
on unseen brands or product types not represented in the training data.
|
| 20 |
|
|
|
|
| 21 |
## Training Data
|
| 22 |
|
| 23 |
The training dataset is a subset of the **RPC-Dataset**
|
| 24 |
([rpc-dataset.github.io](https://rpc-dataset.github.io/)),
|
| 25 |
+
a large-scale retail product checkout dataset consisting of 83699 images
|
| 26 |
+
across 200 grocery product classes.
|
| 27 |
The working dataset is a subset of this,
|
| 28 |
+
consisting of 9,616 images across the same 200 classes, sourced via
|
| 29 |
Roboflow
|
| 30 |
([universe.roboflow.com/groceries-jxjfd/grocery-goods](https://universe.roboflow.com/groceries-jxjfd/grocery-goods)).
|
| 31 |
|
|
|
|
| 37 |
For example, all milk classes were merged into a single `milk` class, reducing the total class count from 200 to 17.
|
| 38 |
Random samples were reviewed after relabeling to validate annotation quality, with no corrections needed.
|
| 39 |
|
|
|
|
| 40 |
This is the final dataset used for training, after the annotation process
|
| 41 |
([https://app.roboflow.com/bdata-497-advanced-topics-in-dv-nqagm/grocery-goods-ezyyb](https://universe.roboflow.com/bdata-497-advanced-topics-in-dv-nqagm/grocery-goods-ezyyb)).
|
| 42 |
|
|
|
|
| 84 |
- Fresh and unpackaged produce (such as fruits or vegetables) are not represented in the dataset
|
| 85 |
- Limited lighting variation — real checkout environments may have inconsistent lighting not well represented in training images
|
| 86 |
|
|
|
|
| 87 |
## Training Procedure
|
| 88 |
|
| 89 |
+
- **Framework**: Ultralytics YOLOv11n
|
| 90 |
+
- **Hardware**: A100 GPU in Google Colab
|
| 91 |
+
- **Epochs**: 50
|
| 92 |
+
- **Batch Size**: 64
|
| 93 |
+
- **Image Size**: 640x640
|
| 94 |
+
- **Patience**: 50
|
| 95 |
+
- **Training Time** ~36.5 minutes (2,189.69 seconds)
|
| 96 |
+
- **Preprocessing**: Augmentations applied at training time (see Data Augmentation section)
|
| 97 |
+
|
| 98 |
+
## Evaluation Results
|
| 99 |
+
|
| 100 |
+
### Comprehensive Metrics
|
| 101 |
+
All files from The model was evaluated on a held-out test set of 1,928 images
|
| 102 |
+
containing 9,825 instances across all 17 classes. The model demonstrates
|
| 103 |
+
strong performance across all metrics, achieving near-perfect precision
|
| 104 |
+
and recall with a mAP50 of 0.992.
|
| 105 |
+
|
| 106 |
+
| Metric | Value |
|
| 107 |
+
| --------- | ----- |
|
| 108 |
+
| Precision | 0.989 |
|
| 109 |
+
| Recall | 0.985 |
|
| 110 |
+
| mAP50 | 0.992 |
|
| 111 |
+
| mAP50-95 | 0.862 |
|
| 112 |
+
|
| 113 |
+
###
|
| 114 |
+
|