Logo image
Positive and negative association rule mining in Hadoop’s MapReduce environment
Journal article   Open access   Peer reviewed

Positive and negative association rule mining in Hadoop’s MapReduce environment

Sikha Bagui and Probal Chandra Dhar
Journal of big data, Vol.6(1), pp.1-16
08/22/2019
Web of Science ID: WOS:000599139200001

Metrics

Abstract

In this paper, we present a Hadoop implementation of the Apriori algorithm. Using Hadoop’s distributed and parallel MapReduce environment, we present an architecture to mine positive as well as negative association rules in big data using frequent itemset mining and the Apriori algorithm. We also analyze and present the results of a few optimization parameters in Hadoop’s MapReduce environment as it relates to this algorithm. The results are presented based on the number of rules generated as well as the run-time efficiency. We find that, a higher amount of parallelization, which means larger block sizes, will increase the run-time efficiency of the Hadoop implementation of the Apriori algorithm.
url
Positive and negative association rule mining in Hadoop’s MapReduce environmentView
Published (Version of record)link to articleCC BY V4.0 Open

Related links

Details

Logo image