Map It Anywhere (MIA): Empowering Bird’s Eye View Mapping using Large-scale Public Data

Cherie Ho* · Jiaye (Tony) Zou* · Omar Alama*
Sai Mitheran Jagadesh Kumar · Benjamin Chiang · Taneesh Gupta · Chen Wang
Nikhil Keetha · Katia Sycara · Sebastian Scherer

Using the MIA Data Engine
Downloading the MIA dataset
Training
Evaluation
Acknowledgement

Using the MIA data engine

0. Setting up the environment

Install docker by following the instructions on their website
Build the docker image mia/Dockerfile by running:
```
 docker build -t mia:release mia
```

Launch the container while mounting this repository to the container file system.

 docker run -v <PATH_TO_THIS_REPO>:/home/MapItAnywhere --network=bridge -it mia:release

1. Getting FPVs

The first stage of the MIA data engine is to get the first person images. First, if you want to pull your own locations, copy the example configuration from mia/conf/example.yaml and edit the cities list to specify the cities you want. Feel free to explore the other well-documented FPV options in the configuration file.

Once configuration is done simply run the following from inside your docker container with working dir set to this repo:

python3.9 -m mia.fpv.get_fpv --cfg mia/conf/<YOUR_CONFIG>.yaml

That's it ! The engine will now automatically fetch, filter, and process your FPV images. You may get a few errors specifying that some images were unable to be fetched due to permission limitations. That is normal and the engine will continue.

Once all your locations have been downloaded, you will see that parquet files, images, and raw_images, have been populated in your dataset_dir for each location. You can now move on to getting BEVs.

2. Getting BEVs

Once you have the FPV parquet dataframes downloaded, you are now ready to fetch and generate the BEV smenatic maps.

Edit the documented bev options in your configuration file to suit your use case. The defaults are tuned to what we used to produce the MIA datasets and you can use them as is.

Once configuration is done simply run the following from inside your docker container with working dir set to this repo:

python3.9 -m mia.bev.get_bev

The data engine will now fetch, process, and save the semantic masks.

You now have FPV-BEV pairs with associated metadata and camera parameters !

Note to get satellite imagery for comparison you must first download it by toggling the store_sat option in the configuration

3. (Optional) Visualize your data

You can visualize a few samples using the tool mia/misc_tools/vis_samples.py.

From inside the container with working dir set to this repo, run:

python3.9 -m mia/misc_tools/vis_samples --dataset_dir /home/mia_dataset_release --locations <LOCATION_OF_INTEREST>

If successful, the script will generate a PDF called compare.pdf in the pittsburgh directory. Upon openning you should see the metadata, FPVs, and BEVs of a few samples of the dataset.

Downloading the MIA dataset

Refer to mia/dataset.md for instructions.

Training

Pre-train with MIA Dataset

To pretrain using our paper configuration simply run:

python -m mapper.mapper data.split=<PATH TO SPLIT FILE> data.data_dir=<PATH TO MIA DATASET>

Finetune with NuScenes Dataset

To finetune using NuScenes Dataset with our paper configuration, run:

python -m mapper.mapper -cn mapper_nuscenes training.checkpoint=<PATH TO PRETRAINED MODEL> data.data_dir=<PATH TO NUSCENES DATA> data.map_dir=<PATH TO GENERATED NUSCENES MAP>

Reproduction

Dataset Setup

MIA: Follow download instructions in Downloading the MIA Dataset

NuScenes: Follow the data generation instructions in Mono-Semantic-Maps. To match the newest available information, we use v1.3 of the NuScenes' map expansion pack.

KITTI360-BEV: Follow the KITTI360-BEV dataset instructions in SkyEye

Inference

To generate MIA dataset prediction results(on test split), use:

python -m mapper.mapper data.split=<PATH TO SPLIT FILE> data.data_dir=<PATH TO MIA DATASET> training.checkpoint=<TRAINED WEIGHTS> training.eval=true

To specify location, add data.scenes in the argument. For example, for held-out cities data.scenes="[pittsburgh, houston]"

To Generate NuScenes dataset prediction results(on validation split), use:

python -m mapper.mapper -cn mapper_nuscenes training.checkpoint=<PATH TO PRETRAINED MODEL> data.data_dir=<PATH TO NUSCENES DATA> data.map_dir=<PATH TO GENERATED NUSCENES MAP> training.eval=true

To Generate KITTI360-BEV dataset prediction results (on validation split), use:

python -m mapper.mapper -cn mapper_kitti training.checkpoint=<PATH TO PRETRAINED MODEL> data.seam_root_dir=<PATH TO SEAM ROOT> data.dataset_root_dir=<PATH TO KITTI DATASET> training.eval=true

License

The FPVs were curated and processed from Mapillary and have the same CC by SA license. These include all images files, parquet dataframes, and dump.json. The BEVs were curated and processed from OpenStreetMap and has the same Open Data Commons Open Database (ODbL) License. These include all semantic masks and flood masks. The rest of the data is licensed under CC by SA license.

Code is licensed under CC by SA license.

Acknowledgement

We thank the authors of the following repositories for their open-source code:

Spaces:
Duplicated from mapitanywhere/Mapper

John6666
/

Mapper

Building