Preserving the Privacy and Sharing the Data using Classification on Perturbed Data
Journal Title: International Journal on Computer Science and Engineering - Year 2010, Vol 2, Issue 3
Abstract
Data mining is a powerful tool which supports automatic extraction of unknown patterns from large amounts of data. The knowledge extracted by data mining process support a variety of domains like marketing, weather forecasting, and medical diagnosis .The process of data mining requires a large data to be collected from diverse sites. With the rapid growth of the Internet, networking, hardware and software technology there is tremendous growth in the amount of data collection and data sharing. Huge volumes of detailed data are regularly collected from organizations and such datasets also contain personal as well as sensitive data about individuals. Though the data mining operation extracts useful knowledge to support variety of domains but access to personal data poses a threat to individual privacy. There is increased concern on how sensitive and private information can be protected while performing data mining operation. Privacy preserving data mining algorithms gives solution for the privacy problem. PPDM gives valid data mining results and also guarantees privacy protection for sensitive data stored in the data warehouse. In this paper we analyzed the threats to privacy that can occur due to data mining process. We have proposed a framework that allows systemic transformation of original data using randomized data perturbation technique and the modified data is submitted as a result of query to the parties using decision tree approach. This approach gives the valid results for analysis purpose but the actual or true data is not revealed and the privacy is preserved.
Authors and Affiliations
P. Kamakshi , Dr. A. Vinaya Babu
Application of Markov Process Model and Entropy Analysis in Data Classification and Information Retrieval
This paper proposes a statistical approach by a modified arkov chain process model and entropy function in the analysis of a arge data set. The basic idea is that entropy nd conditional ntropy are used to measure the...
Exploring Issues for QoS Based Routing Algorithms
The Internet is growing at an astonishing rate, nearly doubling its enormous size every year. There is an increasing demand for using real time multimedia applications over the Internet. One of the challenging issues in...
Analysis on Image Processing of Human Hip Joints during Lifting Using MAT Lab and ANSYS
Human Joint paints exhibit abnormal motion and vise versa during movements. Most of the patients were suffering from joint paints. This joint paints like Hip joints, Knee joints, Foot joints, Shoulder joints Elbow joints...
THE DESIGN OF A RIG FOR THE DIECASTING OF AL-SI PISTON
Pressure die casting is the process where molten metal is forced by pressure into mould. The usual pressure is from 10.3 – 14 MPa. This is the design of an experimental rig for pressure die casting of an Al-Si alloy auto...
Design and Simulation of Circularly Polarized Compact Microstrip Patch Antenna for C-Band Applications
A probe-fed, slotted rectangular patch antenna has been proposed. Bandwidth enhancement has been achieved by suitably cutting slots into the rectangular patch, and efficiently exciting the slot by short circuiting the co...