Reconciling Schema Matching Networks Through Crowdsourcing

Journal Title: EAI Endorsed Transactions on Collaborative Computing - Year 2015, Vol 1, Issue 2

Abstract

for data integration purposes. Although several automatic schema matching tools have been developed, their results are often incomplete or erroneous. To obtain a correct set of correspondences, usually human effort is required to validate the generated correspondences. This validation process is often costly, as it is performed by highly skilled experts. Our paper analyzes how to leverage crowdsourcing techniques to validate the generated correspondences by a large group of non-experts. In our work we assume that one needs to establish attribute correspondences not only between two schemas but in a network. We also assume that the matching is realized in a pairwise fashion, in the presence of consistency expectations about the network of attribute correspondences. We demonstrate that formulating these expectations in the form of integrity constraints can improve the process of reconciliation. As in the case of crowdsourcing the user’s input is unreliable, we need specific aggregation techniques to obtain good quality. We demonstrate that consistency constraints can not only improve the quality of aggregated answers, but they also enable us to more reliably estimate the quality answers of individual workers and detect spammers. Moreover, these constraints also enable to minimize the necessary human effort needed, for the same expected quality of results.

Authors and Affiliations

Nguyen Quoc Viet Hung, Nguyen Thanh Tam, Zoltán Miklós, Karl Aberer

Keywords

Related Articles

A Scheme for Collaboratively Processing Nearest Neighbor Queries in Oblivious Storage

Security concerns are a substantial impediment to the wider deployment of cloud storage. There are two main concerns on the confidentiality of outsourced data: i) protecting the data, and ii) protecting the access patter...

A Highly Concurrent Replicated Data Structure EAI Endorsed Transactions

Well defined concurrent replicated data structure is very important to design collaborative editing system, particularly, certain properties like out-of-order execution of concurrent operations and data convergence. In t...

Welcome Message from the Editors-in-Chief

On behalf of the Editorial Board and the Advisory Board, we are pleased to welcome all to the inaugural issue of the EAI Endorsed Transactions on Collaborative Computing. This journal reflects the increasing maturity...

Impact on procurement and training by research on the interaction design of medical devices

We present a case study of how research can influence practice in the procurement of healthcare technology based on the CHI+MED project. CHI+MED is concerned with interaction design and the safety of medical devices. It...

PVSio-web: mathematically based tool support for the design of interactive and interoperable medical systems

Use errors, where medical devices work to specification but lead to the clinicians making mistakes resulting in patient harm, is a critical problem. Manufacturers need tools to help them find such design flaws at an earl...

Download PDF file
  • EP ID EP45684
  • DOI http://dx.doi.org/10.4108/cc.1.2.e2
  • Views 476
  • Downloads 0

How To Cite

Nguyen Quoc Viet Hung, Nguyen Thanh Tam, Zoltán Miklós, Karl Aberer (2015). Reconciling Schema Matching Networks Through Crowdsourcing. EAI Endorsed Transactions on Collaborative Computing, 1(2), -. https://www.europub.co.uk/articles/-A-45684