Projects

I am the proud inventor and main developer of grlc, an automated Linked Data API construction tool that transforms any SPARQL endpoint into an OpenAPI specification by leveraging shared queries on the Web. grlc has an active pool of 3,200 users; Elsevier, TNO and the eScience Center among them.

I am also the author of midi2rdf, an algorithm that transforms any MIDI file or MIDI event stream input into an RDF Knowledge Graph of interlinked music information. midi2rdf has been used to build the MIDI Linked Data Cloud, a Knowledge Graph connecting more than 10 billion RDF musical facts from MIDI files collected across the Web.

I have also written compliant implementations of W3C Web standards, including CSV on the Web converters and Linked Data Notifications receivers and consumers.

Past projects

  • TabLinker, a framework for integrating messy spreadsheet collections into fully fledged RDF Data Cube (example result here)
  • LSD Dimensions, an aggregator of statistical dimensions and Data Structure Definitions in use in public SPARQL endpoints
  • Linked Edit Rules, the hub for publishing edit rules (data quality checkers) as Linked Data
  • SCRY, a lightweight SPARQL endpoint with customizable, standard-compliant and implementation-independent functions

I am currently involved in CLARIAH, a Dutch Science Foundation (NWO) project that aims at building a distributed infrastructure for the humanities and social sciences. I completed my PhD in 2016, with the thesis Refining Statistical Data on the Web, under the supervision of Frank van Harmelen, Stefan Schlobach and Andrea Scharnhorst (browse it, or order a printed copy). I had an unconditional support from Data Archiving and Networked Services and the eHumanities Group of the Royal Netherlands Academy of Arts and Sciences (KNAW). Until 2011, I developed semantic-based tools to support legal decision making at the Institute of Law and Technology.

Over the years, I have participated in various (mainly public) funded research projects:

  • CLARIAH. Common Lab Research Infrastructure for the Arts and Humanities. Funder: NWO, 2015-2018.
  • CEDAR. Census Data Open Linked: Dutch census data in a web of global cultural and historic information. Funder: KNAW, Computational Humanities Programme, 2011-2015.
  • OGRC. New OGRC solutions for policy and digital security compliance. Funder: MICINN, FEDER (IPT 430000-2010-43), 2010-2012 (INNPACTO).
  • NEURONA. Development of an intelligent system to control data protection compliance. Funder: Ministerio de Industria, Turismo y Comercio, Spain (TSI-200100-2008-134), 2008-2010 (AVANZA).