News from Jun 06, 2011
We are happy to announce Version 2.4 of the Silk - Link Discovery Framework for the Web of Data.
The central idea of the Web of Data is to interlink data items using RDF links. However, in practice most data sources are not sufficiently interlinked with related data sources. The Silk Link Discovery Framework addresses this problem by providing tools to generate links between data items based on user-provided link specifications. It can be used by data publishers to generate links between datasets as well as by Linked Data consumers to augment Web data with additional RDF links.
Version 2.4 adds a new interactive workbench to Silk. The Silk Workbench is a web application which guides the user through the process of interlinking different data sources. The Silk Workbench offers the following features:
More information about the Silk Workbench is found in the Silk Wiki
As the Web of Data grows, there is an increasing need for link discovery tools which scale to very large datasets. In record linkage, many partitioning methods have been proposed which substantially reduce the number of entity comparisons required for comparing object (discover links). Unfortunately, most of these methods either lead to a decrease in recall or only work on metric spaces. Silk Version 2.3 implements a novel blocking method called MultiBlock which uses a multidimensional index in which similar objects are located near each other. In each dimension the entities are indexed by a different property increasing the efficiency of the index significantly. In addition, it guarantees that no false dismissals can occur. This approach works on complex link specifications which aggregate several different similarity measures. Details about MultiBlock can be found in the following paper:
Robert Isele, Anja Jentzsch, Christian Bizer: Efficient Multidimensional Blocking for Link Discovery without losing Recall. 14th International Workshop on the Web and Databases (WebDB 2011), Athens, June 2011.
More information about the Silk framework, the Silk Link Specification Language, as well as several examples that demonstrate how Silk is used to set links between different data sources in the LOD cloud is found at:
The Silk framework is provided under the terms of the Apache License. Version 2.0 and can be downloaded from
The development of Silk was supported by Vulcan Inc. as part of its Project Halo (www.projecthalo.com) and by the EU FP7 project LOD2 - Creating Knowledge out of Interlinked Data (http://lod2.eu/, Ref. No. 257943).
Thanks to Christian Becker, Michal Murawicki and Andrea Matteini for contributing to the Silk Workbench.
Robert Isele, Anja Jentzsch and Christian Bizer