Open Access Open Access  Restricted Access Subscription or Fee Access

A Novel Association Rule Mining Algorithm to Enhance Confidentiality in Data Mining

A. Kutralam, Dr. Antony Selvadoss Thanamani

Abstract


Data mining is the process of extracting hidden patterns from data. As more data is gathered, with the amount of data increasing every year, data mining is becoming an increasingly important tool to transform this data into information. We focus on APRIORI algorithm, a popular data mining technique and analyze the performance of linked list based implementation as a basis for mining frequent item sequences in a transactional database. This algorithm has given us new capabilities to identify associations in large data sets. But an important issue, still not sufficiently scanned, is the need to balance the confidentiality of the disclosed data with the legitimate needs of the data users. We work with some association rule hiding algorithms and examine their performances in order to analyze their time complexity and the impact that they have in the original database. We work a side effect – the number of new rules generated during the hiding process.

Keywords


Association Rule Mining, Apriori Algorithm, Privacy Issues, Hiding Strategies

Full Text:

PDF

References


Charu C. Aggarwal and Philip S. Yu,‖Privacy-Preserving Data Mining: A Survey‖, IBM, T.J. Watson Research Center.

Verykios V. S., Elmagarmid A., Bertino E., Saygin Y., Dasseni E., ―Association Rule Hiding‖, IEEE Transactions on Knowledge and Data Engineering, Vol. 16, No. 4, April 2004.

D. Agrawal and C. C. Aggarwal,‖On the Design and Quantification of Privacy Preserving

Data Mining Algorithms‖, Proc. of ACM PODS Conference, 2001.

R. Agrawal, T. Imielinski, A. Swami,‖Mining Association Rules between Sets of Items in Large Databases", Proc. SIGMOD Conference, 1993.

Rakesh Agrawal and Ramakrishnan Srikant, ―Fast algorithms for mining association rules in large databases‖, In Jorge B. Bocca, Matthias Jarke, and Carlo Zaniolo, editors, Proc. of the 20th International Conference on Very Large Data Bases, VLDB, pages 487-499, Santiago, Chile, September 1994.

Arun K. Pujari, ― Data Mining Techniques‖, 14th impression, 2008.

Aaron M. Tenenbaum et.al, ―Data Structures Using C and C++‖, 2nd edition.

Daniel E. O‘Leary.‖ Knowledge Discovery as a Threat to Database Security‖, Proc. Of the 1st International Conference on Knowledge Discovery and Databases‖ pages 107– 516, 1991.


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.