Please use this identifier to cite or link to this item: http://hdl.handle.net/123456789/6147
Full metadata record
DC FieldValueLanguage
dc.contributor.authorHaruna, Charles R.-
dc.contributor.authorHou, MengShu-
dc.contributor.authorEghan, Moses J.-
dc.contributor.authorKpiebaareh, Michael Y.-
dc.contributor.authorTandoh, Lawrence-
dc.contributor.authorEghan-Yartel, Barbie-
dc.contributor.authorAsante-Mensah, Maame G.-
dc.date.accessioned2021-10-07T11:17:53Z-
dc.date.available2021-10-07T11:17:53Z-
dc.date.issued2018-
dc.identifier.issn23105496-
dc.identifier.urihttp://hdl.handle.net/123456789/6147-
dc.description6p:, ill.en_US
dc.description.abstractIn real world, databases often have several records representing the same entity and these duplicates have no common key, thus making deduplication difficult. Machine-based and crowdsourcing techniques were dis jointly used in improving quality in data deduplication. Crowdsourcing were used for solving tasks that the machine-based algorithms were not good at. Though, the crowds, compared with machines, provided relatively more accurate results, both platforms were slow in execution and hence expensive to implement. In this paper, a hybrid human machine system was proposed where machines were firstly used on the data set before the humans were further used to identify potential duplicates. We performed experiments using three benchmark datasets; paper, restaurant and product datasets. Our algorithm was compared with some existing techniques and our approach outperformed some methods by achieving a high accuracy of deduplication and good deduplication efficiency while incurring low crowdsourcing costsen_US
dc.language.isoenen_US
dc.publisherUniversity of Cape Coasten_US
dc.subjectQualitative Error Detectionen_US
dc.subjectHybrid Data Deduplicationen_US
dc.subjectClusteringen_US
dc.subjectPivot Graphsen_US
dc.subjectEntity Resolutionen_US
dc.subjectCrowd sourcingen_US
dc.titleCost-based and effective human-machine based data deduplication model in entity reconciliationen_US
dc.typeArticleen_US
Appears in Collections:Department of Physics

Files in This Item:
File Description SizeFormat 
Cost-Based and Effective Human-Machine Based.pdfArticle248.52 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.