Population Data

Residential buildings

Script: your-GOAT-directory/app/data_preparation/SQL/buildings_residential.sql

This script identifies all residential buildings at a specified location based on land use data and buildings data. Attributes of the buildings are additionally used. However, as always the accuracy depends largely on the data quality of your custom and OSM data.

The scripts allows you to customize the filters that identify residential buildings. The default values are customized to the City of Munich, therefore it is important to double-check them if you are in another context. The following filters can be adjusted in the goat-config.yaml:

  • custom_landuse_no_residents: if you have a custom landuse-layer you can define landuse categories that are not residential
  • osm_landuse_no_residents: osm-tag landuse that is not for residents, this filter is combined in case you have custom landuse data
  • building_types_potentially_residential: osm-tag building that should not be excluded – Note: many buildings are just called building=yes
  • building_types_residential: osm-tag building that are for sure residential e.g. building=residential (to know more about the building types visit https://wiki.openstreetmap.org/wiki/Key:building)
  • tourism_no_residents: osm-tag tourism that has no residents (to know more about the key tourism visit https://wiki.openstreetmap.org/wiki/Tourism )
  • amenity_no_residents: osm-tag amenity that has no residents (to know more about the amenities visit https://wiki.openstreetmap.org/wiki/Key:amenity)
  • minimum_building_size_residential: all buildings that have a smaller size than this are excluded (very often garages etc.)

the script furthermore extracts building levels for the residential buildings, in case no building levels are in the osm-attributes (unfortunately this is usually the case) a default value for the building levels can be adjusted in the goat-config.yaml.

Note: these building levels are used for the census extrapolation or the population disaggregation.

The output of this function is a table called buildings_residential. It should contain only residential buildings. But as always the accuracy very much depends on the data quality.

Residential buildings

The script buildings_residential.sql is the basis for the census extrapolation.

Census extrapolation

Custom data


landuse.shp (optional)

fixed_population.shp (optional)

Script: your-GOAT-directory/app/data_preparation/SQL/census.sql

In the case you have census data in your study area but you know the data is outdated. GOAT has an script your-GOAT-directory/app/data_preparation/SQL/census.sql that allows you to update the census grids based on current population numbers in your whole study area, which you provided in the layer-table study_area. A prerequisite is that your population numbers for your study_area are more recent, than your census data. Currently the script does only work if the population numbers have increased in the timespan that is lying between the reference year of the census data and the reference year of the administrative boundaries of your study_area.

The script checks for areas where new development took place and estimates based on average gross living area how many residents live in the affected grids. The same can be customized in the goat-config.yaml. This procedure also makes use of the extracted residential buildings as described in your-GOAT-directory\app\data_preparation\SQL\buildings_residential.sql

This script is designed in a way that allows to iteratively improve the results. It is recommended to simply run the setup of GOAT and check for the quality of the population data. There will be following tables if the script was running successfully:

buildings_to_map: all buildings of buildings_residential that are located within a grid cell in which no population from the census is documented.

This means that the scripts assumes that these buildings were built after the census and should be reviewed if they are mapped correctly. The building type, building levels and land use of the area are of particular importance. It is recommended to map this information directly in the OSM database (See making changes in OSM). When updating GOAT you can get your most recent changes in OpenStreetMap.

census_split_new_development: all grid cells where new residential development took place.

The script is following simplistic rules and assigns new population to places based on the building volume (area x number of floors) it happens that there are assign very high population numbers to big buildings. Often this is correct, however in case of e.g. Villas in the forest this could be wrong. Therefore, you have the possibility to upload a layer where you are able to fix the population numbers for certain buildings. You can simply create a shapefile with the structure of your-GOAT-directory/app/database/data/fixed_populatipon.shp mentioned in Data Preparation. The result of the this process is an updated census grid that is used for further processing.

population: the population numbers from the updated census grids is distributed to the associated buildings based on there build-up area, if a building has several addresses the population is equally split between the addresses.

Currently in this step that involves all buildings the number of building levels is not included as this information is very often not available for all buildings, however in case of small population grid (e.g. Germany 100mx100m) it can be assumed that the buildings in one grid cell have a very similiar number of levels. For grid cells for which no population was recorded in the census, the additional population moved between the census and now into the study_area (calculated from the difference between the census data and the up to date population data stored in the table study_area), is distributed proportionally among all new residential building units. Negative population data means there is no population ascertained in this building.

This script delivers the following output:

Output of the script

Data Preparation