--- license: mit --- # Model: PhilEO Bench A novel evaluation framework for EO Foundation Models. ## Model Details ### Model Description The PhilEO Bench evaluation framework comprises of a testbed that can be used to test any EO Foundation Model. The three downstream tasks are building density estimation, road segmentation, and land cover classification. - **Developed by:** ESA, Phi-lab - **Model type:** Evaluation Framework - **License:** MIT The aim of Foundation Models is to improve the performance on several diverse downstream tasks. However, these models are often evaluated on a range of datasets with different characteristics (size, resolution, locations, satellite sources, and capture dates). There is also a focus on evaluating classification downstream tasks, while omitting image-to-image downstream tasks (such as segmentation). Therefore, it is challenging to fairly compare the performance of these burgeoning EO FMs and draw meaningful conclusions. To evaluate FMs, we propose the PhilEO Bench, an evaluation framework with the aim of providing a flexible, consistent, and fair benchmark for EO Sentinel-2 FMs. ## Uses The PhilEO Bench is used to evaluate EO Foundation Models. - We introduce a new flexible evaluation framework focused on generating comparable, fair, and reproducible results. ### Model Sources The basic links for the model are: - **Repository:** http://huggingface.co/ESA-philab/PhilEO-Bench - **Paper:** https://arxiv.org/pdf/2401.04464.pdf - **arXiv:** https://arxiv.org/abs/2401.04464 ## Citation Casper Fibaek, Luke Camilleri, Andreas Luyts, Nikolaos Dionelis, and Bertrand Le Saux, “PhilEO Bench: Evaluating Geo-Spatial Foundation Models,” arXiv:2401.04464, 2024.