Towards Large Scale Geostatistical Methane Monitoring
with Part-based Object Detection

ICCV SEA Workshop
Adhémar
de Senneville
Xavi
Bou
Thibaud
Ehret
Rafael
Grompone
Nicolas
Dumelie
Jean-Louis
Bonne
Thomas
Lauvaux
Gabriele
Facciolo
ENS Paris-Saclay, Centre Borelli and Université de Reims
[Paper]
[GitHub]
[Interactive Map]
[Dataset]

Overview of our results in the French Grand Est region with the number of detected bio-digester sites in each department in 2023. We use our model to detect unknown bio-digester sites in large areas. On the right, we show (a) some examples of annotated bio-digester sites (from the validation set) with their sub-elements. (b) Shows predictions from our model; even with detection errors and a small training set, the part-based detector reliably identifies bio-digester sites at scale.

Abstract

Object detection is one of the main applications of computer vision in remote sensing imagery. Despite its increasing availability, the sheer volume of remote sensing data poses a challenge when detecting rare objects across large geographic areas. Paradoxically, this common challenge is crucial to many applications, such as estimating environmental impact of certain human activities at scale. In this paper, we propose to address the problem by investigating the methane production and emissions of bio-digesters in France. We first introduce a novel dataset containing bio-digesters, with small training and validation sets, and a large test set with a high imbalance towards observations without objects since such sites are rare. We develop a part-based method that considers essential bio-digester sub-elements to boost initial detections. To this end, we apply our method to new, unseen regions to build an inventory of bio-digesters. We then compute geostatistical estimates of the quantity of methane produced that can be attributed to these infrastructures in a given area at a given time.


Dataset

Interactive dataset image

To the best of our knowledge, we release the first large-scale satellite dataset of bio-digesters, including facility and part annotations. Our dataset comprises high-resolution aerial and satellite imagery of bio-digester sites within France’s Grand Est region across different sources (Aerial and SPOT). Ground truth labels are provided as geolocated segmentation annotations. For each bio-digester, we annotated three different classes: the whole bio-digester installation, the (anaerobic) digestion tanks, and biomass piles (feedstock storage areas). The latter two are smaller structures within bio-digester sites and are crucial for confirming site presence. For large-scale evaluation, a test split covers images of the entire French department of Marne, containing 21 known sites.

[Download dataset]


Proposed framework

Proposed framework. The initially annotated dataset is used to train a conventional object detector network. Then, we apply a part-based statistical method to boost detection performance, making large-scale detection bearable. Subsequently, an iterative process takes place, where the trained detector is applied over larger, diverse regions and the top K detections are manually verified and introduced to the dataset. The same process is then conducted with the resulting larger dataset.

1. Part-Based Boosting

AP illustration

Precision-Recall curves for different detection methods over all the Grand Est region. The probabilistic part-based method (orange) significantly boosts performance.

Humans naturally identify fine-grained objects like biodigesters by first spotting potential candidates, then confirming them by checking for characteristic parts—such as tanks and piles. Following this intuition, we define the detection confidence based on the presence of these parts within the detected bio-digester.

2. Iterative Refinement

With only a limited annotation starting point and vast unlabeled territory, bio-digester detection models suffer from high false positive rates, since digesters show visual similarity to industrial zones, oil storage areas, or farms. Hard negative mining tackles this by identifying high-confidence false positive detections via human inspection and adding these hard examples back into the training set.

Iteration mAP₅₀ Site Pile Tank #BG α
0 0.26 0.52 0.11 0.14 163 50 %
1 0.29 0.66 0.06 0.16 263 38 %
2 0.59 0.76 0.14 0.86 363 31 %

Hard Negative Mining validation performance across iterations. Conducting three iterations (including the initial training) using this “teach what not to detect” strategy yields a detector that is markedly more robust to difficult observations and significantly reduces false alarms.

Your browser does not support SVGss

Hard negative examples across three training iterations. At iteration 0, samples are randomly drawn from the Grand-Est region.


Results

Detection Results Across Regions

Region In-Database Overall
TP GT Recall Correct Total Precision
Grand Est 188 224 83.9% 251 311 80.7%
Marne 18 21 85.7% 28 36 77.8%
Bretagne 96 179 53.6% 131 185 70.8%

Detection results in Marne, Grand Est, and Bretagne sub-regions. Results are split between in-database detections (existing ground truth) and overall detections, including new discoveries. Precision and recall are computed using confirmed matches; overall counts include detections outside the reference database.

Our system was able to detect new sites outside of the reference training set and generalize to new regions, such as Bretagne, showing transferability and discovery capability.

Power Production Estimation in Bretagne

Power production prediction vs tank area

Power Production (kW) against Tank Area (m²) in Bretagne region (r² = 0.332).

Estimating power production over a large area provides valuable insight into potential methane leaks and atmospheric impact. Leveraging the generated bio-digester inventory of Bretagne, we estimate its aggregated production in kilowatts (kW), for which ground truth is available. We build a linear regression model that predicts power production based on the overall detected tank area at each site. While the model exhibits a large predictive error, it provides reasonable aggregated estimates over many observations.


[Try our code]


New Detections at scale in Bretagne


Paper

A. de Senneville, X. Bou, T. Ehret, R. Grompone, J-L. Bonne, N. Dumelie, T. Lauvaux, and G. Facciolo
Towards Large Scale Geostatistical Methane Monitoring with Part-based Object Detection
(hosted on ArXiv)

[Bibtex]


Acknowledgements

This template was originally made by Phillip Isola and Richard Zhang for a colorful ECCV project; the code can be found here.