ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences
Publications Copernicus
Download
Citation
Volume IV-4/W8
ISPRS Ann. Photogramm. Remote Sens. Spatial Inf. Sci., IV-4/W8, 3–10, 2019
https://doi.org/10.5194/isprs-annals-IV-4-W8-3-2019
© Author(s) 2019. This work is distributed under
the Creative Commons Attribution 4.0 License.
ISPRS Ann. Photogramm. Remote Sens. Spatial Inf. Sci., IV-4/W8, 3–10, 2019
https://doi.org/10.5194/isprs-annals-IV-4-W8-3-2019
© Author(s) 2019. This work is distributed under
the Creative Commons Attribution 4.0 License.

  23 Sep 2019

23 Sep 2019

GENERATING 3D CITY MODELS BASED ON THE SEMANTIC SEGMENTATION OF LIDAR DATA USING CONVOLUTIONAL NEURAL NETWORKS

A. Agoub, V. Schmidt, and M. Kada A. Agoub et al.
  • Institute of Geodesy and Geoinformation Science (IGG), Technische Universität Berlin, Straße des 17. Juni 135, 10623 Berlin, Germany

Keywords: Deep Learning, Semantic Segmentation, Building Footprint Extraction, 3D Building Reconstruction, CNN

Abstract. Virtual city models are important for many applications such as urban planning, virtual and augmented reality, disaster management, and gaming. Urban features such as buildings, roads, and trees are essential components of these models and are subject to frequent change and alteration. It is laborious to manually build and update virtual city models, due to a large number of instances and temporal changes on such features. The increase of publicly available spatial data provides an important source for pipelines that automate virtual city model generation. The large quantity of data also opens an opportunity to use Deep Learning (DL) as a technique that minimizes the need for expert domain knowledge. In addition, many Deep Learning models calculations can be parallelized on modern hardware such as graphical processing units, which reduces the computation time substantially.

We explore the opportunity of using publicly available data in computing multiple thematic data layers from Digital Surface Models (DSMs) using an automatic pipeline that is powered by a semantic segmentation network. To evaluate this design, we implement our pipeline using multiple Convolutional Neural Networks (CNN) with an encoder-decoder architecture. We produce a variety of two and three-dimensional thematic data. We focus our evaluation on the pipeline’s ability to produce accurate building footprints. In our experiments we vary the depths, the number of input channels and data resolutions of the evaluated networks. Our experiments process public data that is provided by New York City.