Harmonization efforts

Harmonization is a fundamental task in CEDAR to improve census data quality and comparability. A first release (r1) of harmonized census data is available at the SPARQL endpoint

http://lod.cedar-project.nl:8080/sparql/cedar

under the graph group <http://lod.cedar-project.nl/resource/r1/cedar-dataset>.

The harmonization rules that have been used are available in human and machine readable format here.

Preliminar studies of harmonized queries on the dataset are also available. This IPhython notebook shows the distribution of dimensions in all census datasets. This other notebook shows time series charts of simple demographic queries that will be the starting point to debug source data errors, conversion mistakes and harmonization misconceptions.