Skip to content

OpenBuildingMapGlobal building footprint dataset

A worldwide dataset of buildings with height and occupancy types, combining OpenStreetMap, Google Open Buildings, and Microsoft ML Building Footprints.

Dataset

OpenBuildingMap (OBM) is a global assessment of building footprints organised in a tiled grid. For each building the dataset provides geometry, occupancy type, height, and floorspace. It covers all buildings from:

  • OpenStreetMap (OSM planet dump, 1 July 2024)
  • Google Open Buildings (Sirko et al., 2021)
  • Microsoft Global ML Building Footprints (Microsoft, 2023)

Occupancy type and number of stories are identified using OSM land use and points of interest. Building height is estimated using the Global Human Settlement Built-up Characteristics 2023A Layer (GHSL). The dataset is described in full in the data description (PDF).

Citation

When using the data, please cite:

Oostwegel, L. J. N.; Schorlemmer, D.; Lingner, L.; Evaz Zadeh, T. (2025): OpenBuildingMap. GFZ Data Services. https://doi.org/10.5880/GFZ.LKUT.2025.002

The data are supplementary material to:

Oostwegel, L. J. N.; Schorlemmer, D.; Guéguen, P. (2025): From Footprints to Functions: A Comprehensive Global and Semantic Building Footprint Dataset. Scientific Data. https://doi.org/10.1038/s41597-025-06132-z

License: Open Data Commons Open Database License (ODbL) v1.0

File format

The dataset is distributed per country as GeoPackage (.gpkg.bz2), organised per level-6 Quadkey tile (1 271 files total). Each file contains two tables:

Building table

ColumnTypeDescription
idIntegerBuilding ID. OSM ID for OSM buildings; generated for Google/Microsoft.
floorspaceRealEstimated floor area.
occupancyStringOccupancy type per GEM Building Taxonomy v2.0.
heightStringHeight / number of stories per GEM taxonomy.
quadkeyStringLevel-18 Quadkey ID (Web Mercator, EPSG:3857).
source_idInteger0 = OpenStreetMap · 1 = Google · 2 = Microsoft
relation_idIntegerOSM relation ID if the building is part of a relation.
last_updateDatetimeTime of processing.

Metadata table

ColumnTypeDescription
licenseStringAlways ODbL v1.0.
number_of_buildingsIntegerTotal buildings in the file.
percentage_known_occupancyRealShare of buildings with a non-UNK occupancy type.
percentage_known_heightRealShare of buildings with a height estimate.
percentage_known_floorspaceRealShare of buildings with a floorspace estimate.
percentage_source_openstreetmapRealShare sourced from OpenStreetMap.
percentage_source_googleRealShare sourced from Google Open Buildings.
percentage_source_microsoftRealShare sourced from Microsoft ML Building Footprints.

Data sources

Funding

This work was funded by the European Union through the following projects:


We like to be fully transparent about the AI tools we use. This website was built with the assistance of Claude (Anthropic).