Spaces:

wgetdd
/

YoloV3-PASCAL-VOC

Runtime error

App Files Files Community

YoloV3-PASCAL-VOC / README.md

wgetdd

Update README.md

12880dc 11 months ago

preview code

raw

history blame

No virus

5.91 kB

	---
	title: YoloV3 on PASCAL VOC Dataset From Scratch (Slide for GradCam output)
	emoji: 🚀
	colorFrom: gray
	colorTo: blue
	sdk: gradio
	sdk_version: 3.39.0
	app_file: app.py
	pinned: false
	license: mit
	---

	Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

	# [GithubREPO](https://github.com/deepanshudashora/ERAV1/tree/master/session13)

	# Training Procedure

	1. The model is trained on Tesla T4 (15GB GPU memory)
	2. The training is completed in two phases
	3. The first phase contains 20 epochs and second phase contains another 20 epochs
	4. In the first training we see loss dropping correctly but in the second training it drops less
	5. We run our two training loops separately and do not run any kind of validation on them, except for validation loss
	6. Later we evaluate the model and get the numbers
	7. The lightning generally saves the model as .ckpt format, so we convert it to torch format by saving state dict as .pt format
	8. For doing this we use these two lines of code

	```
	best_model = torch.load(weights_path)
	torch.save(best_model['state_dict'], f'best_model.pth')
	litemodel = YOLOv3(num_classes=num_classes)
	litemodel.load_state_dict(torch.load("best_model.pth",map_location='cpu'))
	device = "cpu"
	torch.save(litemodel.state_dict(), PATH)
	```


	8. The model starts overfitting on the dataset after 30 epochs
	9. Future Improvements
	1. Train the model in 1 shot instead of two different phases
	2. Keep a better batch size (Basically earn more money and buy a good GPU)
	3. Data transformation also plays a vital role here
	4. OneCycle LR range needs to be appropriately modified for a better LR

	# Data Transformation

	Along with the transforms mentioned in the [config file](https://github.com/deepanshudashora/ERAV1/blob/master/session13/lightning_version/config.py), we also apply mosaic transform on 75% images

	[Reference](https://www.kaggle.com/code/nvnnghia/awesome-augmentation/notebook)

	# Accuracy Report

	```
	Class accuracy is: 82.999725%
	No obj accuracy is: 96.828300%
	Obj accuracy is: 76.898473%

	MAP: 0.29939851760864258

	```

	# [Training Logs](https://github.com/deepanshudashora/ERAV1/blob/master/session13/lightning_version/merged_logs.csv)

	#### For faster execution we run the validation step after 20 epochs for the first 20 epochs of training and after that after every 5 epochs till 40 epochs

	```
	Unnamed: 0 lr-Adam step train_loss epoch val_loss
	6576 6576 NaN 164299 4.186745 39.0 NaN
	6577 6577 0.000132 164349 NaN NaN NaN
	6578 6578 NaN 164349 2.936086 39.0 NaN
	6579 6579 0.000132 164399 NaN NaN NaN
	6580 6580 NaN 164399 4.777130 39.0 NaN
	6581 6581 0.000132 164449 NaN NaN NaN
	6582 6582 NaN 164449 3.139145 39.0 NaN
	6583 6583 0.000132 164499 NaN NaN NaN
	6584 6584 NaN 164499 4.596097 39.0 NaN
	6585 6585 0.000132 164549 NaN NaN NaN
	6586 6586 NaN 164549 5.587294 39.0 NaN
	6587 6587 0.000132 164599 NaN NaN NaN
	6588 6588 NaN 164599 4.592830 39.0 NaN
	6589 6589 0.000132 164649 NaN NaN NaN
	6590 6590 NaN 164649 3.914468 39.0 NaN
	6591 6591 0.000132 164699 NaN NaN NaN
	6592 6592 NaN 164699 3.180615 39.0 NaN
	6593 6593 0.000132 164749 NaN NaN NaN
	6594 6594 NaN 164749 5.772174 39.0 NaN
	6595 6595 0.000132 164799 NaN NaN NaN
	6596 6596 NaN 164799 2.894014 39.0 NaN
	6597 6597 0.000132 164849 NaN NaN NaN
	6598 6598 NaN 164849 4.473828 39.0 NaN
	6599 6599 0.000132 164899 NaN NaN NaN
	6600 6600 NaN 164899 6.397766 39.0 NaN
	6601 6601 0.000132 164949 NaN NaN NaN
	6602 6602 NaN 164949 3.789242 39.0 NaN
	6603 6603 0.000132 164999 NaN NaN NaN
	6604 6604 NaN 164999 5.182691 39.0 NaN
	6605 6605 0.000132 165049 NaN NaN NaN
	6606 6606 NaN 165049 4.845749 39.0 NaN
	6607 6607 0.000132 165099 NaN NaN NaN
	6608 6608 NaN 165099 3.672542 39.0 NaN
	6609 6609 0.000132 165149 NaN NaN NaN
	6610 6610 NaN 165149 4.230726 39.0 NaN
	6611 6611 0.000132 165199 NaN NaN NaN
	6612 6612 NaN 165199 4.625024 39.0 NaN
	6613 6613 0.000132 165249 NaN NaN NaN
	6614 6614 NaN 165249 4.549682 39.0 NaN
	6615 6615 0.000132 165299 NaN NaN NaN
	6616 6616 NaN 165299 4.040627 39.0 NaN
	6617 6617 0.000132 165349 NaN NaN NaN
	6618 6618 NaN 165349 4.857126 39.0 NaN
	6619 6619 0.000132 165399 NaN NaN NaN
	6620 6620 NaN 165399 3.081895 39.0 NaN
	6621 6621 0.000132 165449 NaN NaN NaN
	6622 6622 NaN 165449 3.945353 39.0 NaN
	6623 6623 0.000132 165499 NaN NaN NaN
	6624 6624 NaN 165499 3.203420 39.0 NaN
	6625 6625 NaN 165519 NaN 39.0 3.081895



	```

	# Results

	## For epochs 0 to 19
	![train_logs_1.png](images/train_logs_1.png)

	## From 19 to 20
	![train_logs_2.png](images/train_logs_2.png)

	## Full training logs for loss

	![full_training.png](images/full_training.png)