MS2016-12: Data Quality Improvement in Data Space Environments

posted Oct 26, 2016, 8:17 AM by Marco Spruit   [ updated Mar 20, 2018, 4:51 AM ]
The Research and Documentation Centre (WODC) of the Dutch Ministry of Security and Justice uses a lot of different, heterogeneous, data sets in their research. The need for integration depends strongly on the research being done, which is why the data sets are managed using a data space approach. In this approach, data integration and other data quality improvement are initiated by the need for it in specific research projects (pay-as-you-go). However, decentralizing data quality improvement is not always the most efficient way; when several projects encounter the same data quality issues collaboration on the improvement of these issues is desirable and the issues are also likely to become more urgent to solve. The WODC wants more insight in the impact of known data quality issues, and looks for solutions to determine which issues are better to solve in a more centralized way.

NB: Due to the Dutch data and documentation, understanding of written Dutch is preferable.