An Efficient Approach towards Duplicate Detection System

Abstract

Information on the web is very huge in size and the tasks of search engines have become more and more complex as a single entity on the web have two or more representations in databases. The duplicate detection is the process of identifying the entities who has multiple representation of the same real world entity, as the duplicate detection methods has to process large datasets, the identification of duplicate document in a large database is a issue significantly with wide-spread applications. In this paper a review on various approaches of duplicate detection will be presented. Proposed system will compare two Duplication detection methods, the first is based on two novel progressive duplicate detection algorithms that significantly increase the efficiency of finding duplicates if the execution time is limited. The second is based on Secure Hashing Algorithm which will detect and delete duplicate data, the secure hash algorithm will perform data de-duplication task in order to overcome the issues of time and to reduce hash collision.

Authors and Affiliations

Miss. Ruchira Dhananjay Deshpande, Sonali Bodkhe

Keywords

Related Articles

slugImplementation of Smart Gateway For Automation of Home Devices and Appliances

In the “Internet of Things” concept, the physical world can be integrated with computer networks and applications. The Embedded computers as well as visual markers on everyday or objects allow the information about t...

Increasing Network Lifetime by Using Secure Clustering With Reliable Node Disjoint Multi-path Routing in Wireless Sensor Networks

In order to increase the network latency and resolve the security bottlenecks induced by the camouflaged malicious nodes in Wireless Sensor Networks, the residual energy and trust values are used to form a secured clust...

Dual Resonant Frequency Antenna for Mobile Communications

The dual resonant frequency antenna module is proposed for the purpose of mobile communications. The microstrip patch antenna incorporated with a slot design that makes the antenna to perform both higher and lower freq...

The effect of turning parameters on surface smoothness of D3 cold work steel

This paper studies the effect of turning parameters such as cutting speed, feed rate, load depth, cutting time and tool radius on surface roughness in the process of turning the D3 cold-work steel. To perform this resea...

Nano materials filled Polymers for reducing the thermal Peak temperature in a vehicle

There is an increasing demand for fuel nowadays and it is soon expected that there will be an acute shortage in the fuel that we are using at present. Hence there is a need to optimize the fuel usage. Almost 10% of f...

Download PDF file
  • EP ID EP23038
  • DOI -
  • Views 286
  • Downloads 4

How To Cite

Miss. Ruchira Dhananjay Deshpande, Sonali Bodkhe (2017). An Efficient Approach towards Duplicate Detection System. International Journal for Research in Applied Science and Engineering Technology (IJRASET), 5(1), -. https://www.europub.co.uk/articles/-A-23038