Deep learning-based WWR and floor count extraction from façade images to improve UBEM
Journal of Physics: Conference Series, CISBAT 2025
Abstract. Urban Building Energy Modeling (UBEM) enables the energy performance evaluation of entire neighborhoods and cities, playing an important role in achieving net-zero carbon objectives. However, bottom-up UBEM frameworks rely on large amounts of input data, such as floor counts, window-to-wall (WWR) ratio, and material constructions, which are often difficult to obtain or missing from open data sources like OpenStreetMap. This study proposes a deep-learning (DL)-based computer vision workflow to automate the extraction of two features, namely (1) floor count and (2) window-to-wall ratio, from street-level façade imagery. The extracted features are integrated into the UBEM platform City Energy Analyst to quantify the differences in simulated annual energy demand between the standard model, the proposed workflow, and the ground truth. The method is applied to a district in Zurich, Switzerland, which includes 24 buildings. The UBEM model with the DL-predicted floor counts and WWR underestimates the total energy demand by 9% relative to the model using true values, whereas the model with default values underestimates it by 31%. Although issues such as partial occlusions and image perspective reduce accuracy, the proposed method demonstrates a scalable approach to reduce manual data collection and improve urban-scale energy assessments.
Keywords: UBEM, deep learning, semantic segmentation, FCN, Segformer, Grounded-SAM-2