Annotating web tables through ontology matching
Vasilis Efthymiou, Oktie Hassanzadeh, et al.
OM 2016
Entity Resolution (ER) is the task of finding records that refer to the same real-world entities. A common scenario is when entities across two clean sources need to be resolved, which we refer to as Clean-Clean ER. In this paper, we perform an extensive empirical evaluation of 8 bipartite graph matching algorithms that take in as input a bipartite similarity graph and provide as output a set of matched entities. We consider a wide range of matching algorithms, including algorithms that have not previously been applied to ER, or have been evaluated only in other ER settings. We assess the relative performance of the algorithms with respect to accuracy and time efficiency over 10 established, real datasets, from which we extract >700 different similarity graphs. Our results provide insights into the relative performance of these algorithms and guidelines for choosing the best one, depending on the data at hand.
Vasilis Efthymiou, Oktie Hassanzadeh, et al.
OM 2016
Rana Alotaibi, Chuan Lei, et al.
ICDE 2021
Xue Han, Lianxue Hu, et al.
SCC 2020
Junheng Hao, Chuan Lei, et al.
KDD 2021