PawanKrGunjan
/

license_plate_recognizer

@@ -1,134 +1,80 @@
 ---
 base_model: microsoft/trocr-base-handwritten
 tags:
-- trocr
-- image-to-text
-- license-plate-number
 model-index:
 - name: license_plate_recognizer
-  results:
-  - task:
-      type: image-to-text
-      name: License Plate Recognition
-    dataset:
-      type: custom_dataset
-      name: Custom License Plate Dataset
-      config: default
-      split: test
-      revision: main
-    metrics:
-    - type: cer
-      value: 0.0231
-      name: Test CER
-      config: default
-      args:
-        max_order: 4
-    source:
-      name: Hugging Face Model Card
-      url: https://huggingface.co/PawanKrGunjan/license_plate_recognizer
-license: mit
-language:
-- en
-metrics:
-- cer
-library_name: transformers
-pipeline_tag: image-to-text
-datasets:
-- charliexu07/license_plates
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/pawankrgunjan/huggingface/runs/v5cu1qdh)
 # license_plate_recognizer
-This model is a fine-tuned version of [microsoft/trocr-base-handwritten](https://huggingface.co/microsoft/trocr-base-handwritten) specifically tailored for recognizing license plate numbers from images. The fine-tuning process has been optimized to accurately decode alphanumeric characters typically found on license plates.
-## Model Description
-The base model, `microsoft/trocr-base-handwritten`, is a Transformer-based OCR model designed for recognizing handwritten text. This fine-tuned version is adapted for license plate recognition, enhancing its ability to read and transcribe license plates from various sources, including images captured under different lighting and angles.
-## Intended Uses & Limitations
-### Intended Uses
-- **License Plate Recognition:** This model is designed to extract and transcribe alphanumeric characters from images of license plates. It can be used in various applications such as automated toll systems, parking management, and law enforcement.
-### Limitations
-- **Character Set:** The model is optimized for the specific alphanumeric characters commonly found on license plates. It may not perform well on text outside this domain.
-- **Environmental Factors:** While robust to typical variations in image quality, extreme conditions like very low light, heavy blurring, or unusual angles may reduce accuracy.
-## Training and Evaluation Data
-The model was fine-tuned on a dataset consisting of license plate images. The dataset includes a diverse set of license plates captured in various environments and lighting conditions, ensuring robustness in real-world applications. However, specific details about the dataset (e.g., size, source) are not provided here.
-## Training Procedure
-### Training Hyperparameters
 The following hyperparameters were used during training:
-- **learning_rate:** 2e-05
-- **train_batch_size:** 8
-- **eval_batch_size:** 8
-- **seed:** 42
-- **optimizer:** Adam with betas=(0.9, 0.999) and epsilon=1e-08
-- **lr_scheduler_type:** linear
-- **num_epochs:** 7
-### Training Results
 | Training Loss | Epoch | Step | Validation Loss | Cer    |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
-| 0.2605        | 1.0   | 254  | 0.0798          | 0.0253 |
-| 0.138         | 2.0   | 508  | 0.0660          | 0.0177 |
-| 0.0435        | 3.0   | 762  | 0.0645          | 0.0146 |
-| 0.0344        | 4.0   | 1016 | 0.0594          | 0.0173 |
-| 0.011         | 5.0   | 1270 | 0.0626          | 0.0160 |
-| 0.0021        | 6.0   | 1524 | 0.0567          | 0.0120 |
-| 0.0007        | 7.0   | 1778 | 0.0599          | 0.0137 |
-### Final Evaluation Metrics
-- **Loss:** 0.0653
-- **Cer:** 0.0231
-Certainly! Here’s the updated "How to Use the Model" section with the correct username:
-## How to Use the Model
-Here is how you can use this fine-tuned model in PyTorch to recognize license plate numbers:
-```python
-from transformers import TrOCRProcessor, VisionEncoderDecoderModel
-from PIL import Image
-import requests
-# Load an image of a license plate
-url = 'https://example.com/path/to/license_plate_image.jpg'
-image = Image.open(requests.get(url, stream=True).raw).convert("RGB")
-# Initialize the processor and the fine-tuned model
-processor = TrOCRProcessor.from_pretrained('PawanKrGunjan/license_plate_recognizer')
-model = VisionEncoderDecoderModel.from_pretrained('PawanKrGunjan/license_plate_recognizer')
-# Preprocess the image
-pixel_values = processor(images=image, return_tensors="pt").pixel_values
-# Generate text (license plate number)
-generated_ids = model.generate(pixel_values)
-generated_text = processor.batch_decode(generated_ids, skip_special_tokens=True)[0]
-print("Recognized License Plate Number:", generated_text)
-```
-In this example:
-1. Replace the `url` with the actual URL of an image containing a license plate.
-2. The model and processor are loaded from your fine-tuned model on the Hugging Face Hub (`PawanKrGunjan/license_plate_recognizer`).
-## Framework Versions
-- **Transformers:** 4.42.3
-- **Pytorch:** 2.1.2
-- **Datasets:** 2.20.0
-- **Tokenizers:** 0.19.1

 ---
 base_model: microsoft/trocr-base-handwritten
 tags:
+- generated_from_trainer
 model-index:
 - name: license_plate_recognizer
+  results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/pawankrgunjan/huggingface/runs/ajvl0e6b)
 # license_plate_recognizer
+This model is a fine-tuned version of [microsoft/trocr-base-handwritten](https://huggingface.co/microsoft/trocr-base-handwritten) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.0097
+- Cer: 0.0036
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 2e-05
+- train_batch_size: 8
+- eval_batch_size: 8
+- seed: 42
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- num_epochs: 23
+### Training results
 | Training Loss | Epoch | Step | Validation Loss | Cer    |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
+| 0.1485        | 1.0   | 397  | 0.0528          | 0.0182 |
+| 0.0843        | 2.0   | 794  | 0.0371          | 0.0089 |
+| 0.0552        | 3.0   | 1191 | 0.0417          | 0.0129 |
+| 0.0812        | 4.0   | 1588 | 0.0386          | 0.0115 |
+| 0.0315        | 5.0   | 1985 | 0.0198          | 0.0053 |
+| 0.0178        | 6.0   | 2382 | 0.0263          | 0.0084 |
+| 0.0341        | 7.0   | 2779 | 0.0179          | 0.0067 |
+| 0.0143        | 8.0   | 3176 | 0.0149          | 0.0080 |
+| 0.0047        | 9.0   | 3573 | 0.0055          | 0.0027 |
+| 0.0163        | 10.0  | 3970 | 0.0062          | 0.0022 |
+| 0.0045        | 11.0  | 4367 | 0.0049          | 0.0027 |
+| 0.0115        | 12.0  | 4764 | 0.0077          | 0.0053 |
+| 0.0014        | 13.0  | 5161 | 0.0031          | 0.0022 |
+| 0.0081        | 14.0  | 5558 | 0.0052          | 0.0031 |
+| 0.0001        | 15.0  | 5955 | 0.0056          | 0.0035 |
+| 0.0005        | 16.0  | 6352 | 0.0057          | 0.0027 |
+| 0.0009        | 17.0  | 6749 | 0.0053          | 0.0022 |
+| 0.0003        | 18.0  | 7146 | 0.0067          | 0.0027 |
+| 0.0001        | 19.0  | 7543 | 0.0044          | 0.0018 |
+| 0.0001        | 20.0  | 7940 | 0.0052          | 0.0018 |
+| 0.0           | 21.0  | 8337 | 0.0050          | 0.0018 |
+| 0.0           | 22.0  | 8734 | 0.0051          | 0.0018 |
+| 0.0           | 23.0  | 9131 | 0.0051          | 0.0018 |
+### Framework versions
+- Transformers 4.42.3
+- Pytorch 2.1.2
+- Datasets 2.20.0
+- Tokenizers 0.19.1

generation_config.json CHANGED Viewed

@@ -3,7 +3,6 @@
   "decoder_start_token_id": 0,
   "early_stopping": true,
   "eos_token_id": 2,
-  "max_length": 128,
   "num_beams": 3,
   "pad_token_id": 1,
   "transformers_version": "4.42.3",

   "decoder_start_token_id": 0,
   "early_stopping": true,
   "eos_token_id": 2,
   "num_beams": 3,
   "pad_token_id": 1,
   "transformers_version": "4.42.3",

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b70831db27e76688d7618656a4d9f252cefeef4853b96ca3587219a2a1804f2d
 size 1335747032

 version https://git-lfs.github.com/spec/v1
+oid sha256:5161a5497ff8a87587c30bf6b64cdb82087efb6b49475f319939794bebd34f00
 size 1335747032