CVRP: A Rice Image Dataset with High-Quality Annotations for Image Segmentation and Plant Phenomics Research
Multi-cultivar and multi-view rice plant image dataset (CVRP) consists of 2,303 field images with their annotated masks and 123 indoor images of individual panicles.
Annotation Workflow
To optimize the process of annotation, we combine deep learning methods with manual curation. The workflow comprises two stages: manual annotation and model-based prediction followed by manual curation.
Getting Started
Recommend python 3.7, CUDA v11.3 and Pytorch 1.10.0.
Clone
pip install huggingface_hub
huggingface-cli download CVRPDataset/Model --local-dir /your/path/to/save/Model
cd Model
git clone https://github.com/open-mmlab/mmsegmentation.git -b v1.1.2 mmsegmentation
pip install -U openmim
mim install mmengine
mim install mmcv==2.0.0
cd mmsegmentation
pip install -v -e .
UI
We create a web user interface for annotation based on gradio.
pip install gradio
pip install "mmdet>=3.0.0rc4"
python app.py
The UI :
- Users can upload an image or use a sample image at β .Then, they can select one of four models at β‘. We recommend Mask2Former. After that, click Run.
- We provide two forms of segmentation results for download at β’.
Train and Test
- Creating a Dataset
Here is an example if you want to make own dataset.
β Directory structure of the dataset:
π CVRPDataset/ ββπ images/ ββπ labelme_jsons/
β‘ Convert labelme files to mask:
python run/labelme2mask.py
now, the structure looks like:
π CVRPDataset/ ββπ img_dir/ ββπ ann_dir/
β’ Split the training set and test set.
python run/split_dataset.py
now, the structure looks like:
π CVRPDataset/ ββπ img_dir/ β ββπ train/ β ββπ val/ ββπ ann_dir/ ββπ train/ ββπ val/
You can download our training set and validation set here.
- Dataset Configs
pip install -r run/requirements.txt
cd mmsegmentation/mmseg/datasets
rm -rf __init__.py # delete original file
wget https://huggingface.co/CVRPDataset/Model/resolve/main/dataset_configs/CVRP.py
wget https://huggingface.co/CVRPDataset/Model/resolve/main/dataset_configs/__init__.py
cd ../../configs/_base_/datasets
wget https://huggingface.co/CVRPDataset/Model/resolve/main/dataset_configs/CVRP_pipeline.py
β If you want to register your own dataset, import and register datasets in mmseg/datasets/__init__.py
from .CVRP import CVRPDataset
Add,
# other datasets
__all__ = ['CVRPDataset']
β‘ Register dataset class in `mmseg/datasets/CVRP.py'
class CVRPDataset(BaseSegDataset):
METAINFO = {
'classes':['background','panicle'],
'palette':[[127,127,127],[200,0,0]]
}
β’ Modify pipeline of data process in config/_base_/CVRP_pipeline.py
dataset_type = 'CVRPDataset'
data_root = 'CVRPDataset/'
you'll need to specify the paths for the train and evalution data directories.
# train_dataloader:
data_prefix=dict(img_path='img_dir/train', seg_map_path='ann_dir/train')
# val_dataloader:
data_prefix=dict(img_path='img_dir/val', seg_map_path='ann_dir/val')
- Model Configs
You can generate model config files using run_configs.py
cd mmsegmentation
mkdir 'work_dirs' 'CVRP_configs' 'outputs' 'CVRPDataset'
python ../run/run_configs.py --model_name deeplabv3plus -m configs/deeplabv3plus/deeplabv3plus_r101-d8_4xb4-160k_ade20k-512x512.py -d configs/_base_/datasets/CVRP_pipeline.py -c 2 -w work_dirs -s CVRP_configs
python ../run/run_configs.py --model_name knet -m configs/knet/knet-s3_swin-l_upernet_8xb2-adamw-80k_ade20k-512x512.py -d configs/_base_/datasets/CVRP_pipeline.py -c 2 -w work_dirs -s CVRP_configs
python ../run/run_configs.py --model_name mask2former -m configs/mask2former/mask2former_swin-l-in22k-384x384-pre_8xb2-160k_ade20k-640x640.py -d configs/_base_/datasets/CVRP_pipeline.py -c 2 -w work_dirs -s CVRP_configs
python ../run/run_configs.py --model_name segformer -m configs/segformer/segformer_mit-b5_8xb2-160k_ade20k-512x512.py -d configs/_base_/datasets/CVRP_pipeline.py -c 2 -w work_dirs -s CVRP_configs
Also, you can download model config files here.
cd CVRP_configs
wget https://huggingface.co/CVRPDataset/Model/resolve/main/model_configs/CVRP_mask2former.py
- Train
cd mmsegmentation
python tools/train.py CVRP_configs/CVRP_mask2former.py
Also, you can download checkpoint here.
cd work_dirs
mkdir CVRP_mask2former
cd CVRP_mask2former
wget https://huggingface.co/CVRPDataset/Model/resolve/main/checkpoint/Mask2Former.pth
- Test
python ../run/test.py -d CVRPDataset/val -m CVRP_configs/CVRP_mask2former.py -pth work_dirs/CVRP_mask2former/Mask2Former.pth -o outputs/CVRP_mask2former
LabelMe
If you need to manually adjust the annotation, you can use LabelMe.
python run/mask2json.py
pip install labelme==3.16.7
labelme
python run/json2png.py
Data and Code Availability
The CVRP dataset and accompanying codes are publicly available from Hugging Face at https://huggingface.co/datasets/CVRPDataset/CVRP & https://huggingface.co/CVRPDataset/Model, and Bioinformatics service in Nanjing Agricultural University at http://bic.njau.edu.cn/CVRP.html.
Acknowledgements
We thank Mr.Zhitao Zhu, Dr. Weijie Tang, and Dr. Yunhui Zhang for their technical support.