- The MIDI Linked Data Cloud, a Linked Data representation of digital music in MIDI format containing 350K songs and 10B triples
- grlc, a service that transforms GitHub repositories with SPARQL queries into Linked Data APIs automatically
- TabLinker, a framework for integrating messy spreadsheet collections into fully fledged RDF Data Cube (example result here)
- LSD Dimensions, an aggregator of statistical dimensions and Data Structure Definitions in use in public SPARQL endpoints
- Linked Edit Rules, the hub for publishing edit rules (data quality checkers) as Linked Data
- SCRY, a lightweight SPARQL endpoint with customizable, standard-compliant and implementation-independent functions
I am currently involved in CLARIAH, a Dutch Science Foundation (NWO) project that aims at building a distributed infrastructure for the humanities and social sciences. I completed my PhD in 2016, with the thesis Refining Statistical Data on the Web, under the supervision of Frank van Harmelen, Stefan Schlobach and Andrea Scharnhorst (browse it, or order a printed copy). I had an unconditional support from Data Archiving and Networked Services and the eHumanities Group of the Royal Netherlands Academy of Arts and Sciences (KNAW). Until 2011, I developed semantic-based tools to support legal decision making at the Institute of Law and Technology.
Over the years, I have participated in various (mainly public) funded research projects:
- CLARIAH. Common Lab Research Infrastructure for the Arts and Humanities. Funder: NWO, 2015-2018.
- CEDAR. Census Data Open Linked: Dutch census data in a web of global cultural and historic information. Funder: KNAW, Computational Humanities Programme, 2011-2015.
- OGRC. New OGRC solutions for policy and digital security compliance. Funder: MICINN, FEDER (IPT 430000-2010-43), 2010-2012 (INNPACTO).
- NEURONA. Development of an intelligent system to control data protection compliance. Funder: Ministerio de Industria, Turismo y Comercio, Spain (TSI-200100-2008-134), 2008-2010 (AVANZA).