CEDAR minisymposium, March 31st – April 1st 2014


Time: Monday March 31st – Tuesday April 1st, 2014

Location: Meertens Instituut. Joan Muyskenweg 25, 1096 CJ Amsterdam https://goo.gl/maps/obg8L

Registration: All attendants please register here before March 24th (limited to 50 places). Attendance is free, but if you register and fail to show up without a good reason and prior notice you will be charged EUR 20.

CEDAR is a joint collaboration between the VU University Amsterdam, the Erasmus University Rotterdam, the International Institute of Social History (IISH)Data Archiving and Networked Services (DANS), and the eHumanities Programme. In CEDAR we work on publishing and analysing the Dutch historical censuses in the Semantic Web as RDF Data Cube. The original dataset consists of 17 census snapshots, performed between 1795 and 1971 in the Netherlands more or less once every 10 years. It is organised as 2,288 tables. We have successfully converted these files into RDF Data Cube and published an alpha release in a SPARQL endpoint.

Once a year we celebrate a project mini symposium, where we come together, report our advances, and open the floor to researchers interested in the variety of topics the project involves: statistical data, social history, the Semantic Web, data harmonisation, and many others. This year we will have a two-day program. The first day will consist, first, of talks by project members, giving an update on advances, milestones and further work; and, second, of invited talks by international researchers, with the purpose of making research links, identifying new challenges and integrating the knowledge of experts coming from multiple fields. The second day will consist of a hackathon bringing together social history and census data publishing on the Web. We expect an audience coming from various countries, heterogeneous fields and different experience (from PhD students to professors).

Preliminary programme

Monday, March 31st

  • 10.00 – 10.05. Welcome (Sally Wyatt)
  • 10.05 – 10.15. Opening (Andrea Scharnhorst)
  • 10.15 – 11.15. Keynote (TBA)
  • 11.15 – 11.30. Cofee break
  • 11.30 – 12.00. Sociohistorical advances in CEDAR (Ashkan Ashkpour)
  • 12.00 – 12.30. Semweb advances in CEDAR (Albert Meroño-Peñuela)
  • 12.30 – 13.30. Lunch
  • 13.30 – 14.00. CLIO INFRA. Charting and explaining the history of global inequality (Jan Kok)
  • 14.00 – 14.30. Standardizing, classifying and disseminating Dutch 19th and 20th century occupational titles (Kees Mandemakers)
  • 14.30 – 15.20. Linked Statistical Dataspaces and Analysis (Sarven Capadisli, invited speaker)
  • 15.20 – 15.35. Coffe break
  • 15.35 – 16.25. Linked Data Pilot @ Swiss Federal Statistical Office: Lessons learned (Adrian Gschwend, invited skpeaker)
  • 16.25 – 17.15. XKOS: Extending SKOS for Describing Statistical Classifications (Guillaume Duffes, invited speaker)
  • 17.15 – 17.30. Closing (Andrea Scharnhorst)
  • 17.30. Drinks

Tuesday, April 1st

  • 10.00 – 10.15. Opening
  • 10.15 – 10.30. Splitting in groups
  • 10.30 – 11.00. Setting goals, work groups, expected outcomes
  • 11.00 – 16.00. Hackathon / brainstorming
  • 16.00 – 17.00. Presentations
  • 17.00 – 17.30. Prizes, closing remarks
  • 17.30. Drinks

Linked Statistical Dataspaces and Analysis

Sarven’s background is in user-experience design and open Web technologies, and ultimately interested in human-computer interaction, and how it changes us. In the past several years, his focus has been in making statistical data on the Web more accessible and usable for Semantic Web developers and in parallel aiming to create citizen-centric interfaces in order to further knowledge about human societies, and to foster better decision-making environments. See also: http://270a.info/

Sarven Capadisli‘s short bio is available here.


Linked Data Pilot @ Swiss Federal Statistical Office: Lessons learned

In 2013 the Swiss Federal Statistical Office developed a pilot platform for publishing linked statistical data. The presentation will explain what was done, how we used data cubes and where we might be heading with the platform and visualization.

Adrian Gschwend is the founder of netlabs.org, the Enterprise Linked Data specialist in Switzerland. He provides profound experience in linked data, knowledge management, open source software and project management, a domain he supports for over 15 years.


XKOS: Extending SKOS for Describing Statistical Classifications

A statistical classification scheme is used in statistics to place units in one and only one category from that scheme. These categories are used as units of analysis; dimensions in databases, tables, and time series; criteria for subdividing or aggregating populations; etc. SKOS provides a simple model for rendering a statistical classification scheme in Linked Open Data (LOD) format. However, it is too simple for some purposes, so XKOS (eXtended Knowledge Organization System) was designed to extend it in order to allow richer representations of statistical classifications. This presentation will describe the statistical needs to address and the results of the XKOS design work, including some examples on well-known statistical classifications. A live demo will showcase the potential use of the XKOS vocabulary.

Guillaume Duffes has been working for INSEE (National Institute of Statistics and Economic Studies, France) for more than 8 years. He started working in the Consumer Price Index unit (3 years), then spent another three years in the department of international relationships, and then moved to his current position in the “RMéS” project. The RMéS project aims at building a statistical metadata repository at INSEE and foresees to make use of Semantic Web standards and RDF vocabularies such as XKOS. In regard to hobbies, he enjoys playing football more than watching it on TV, and travelling.