Mining High Average-Efficiency Itemsets

Loading...
Publication Logo

Date

2024

Authors

Yildirim, I.

Journal Title

Journal ISSN

Volume Title

Publisher

Institute of Electrical and Electronics Engineers Inc.

Open Access Color

OpenAIRE Downloads

OpenAIRE Views

Research Projects

Journal Issue

Abstract

Mining high-utility itemsets is a popular area within itemset mining, aiming to discover itemsets that provide high utility (profit) in a given dataset. However, it does not consider the investment costs required to obtain these profits. To address this, the problem of high-efficiency itemset mining (HEIM) was recently introduced to discover high-efficiency itemsets based on a given minimum efficiency threshold. In HEIM, the efficiency of an itemset is defined as its utility derived from transactions containing it, divided by its total investment. However, the efficiency of an itemset may increase with its length since the length of itemsets is not considered in the efficiency calculation. Therefore, it may be unfair to compare the efficiency of itemsets of different lengths using the same threshold value. In this study, the problem of high average-efficiency itemset mining (HAEIM) is introduced as an extension of HEIM problem. To evaluate the importance of itemsets with a fairer measurement, HAEIM takes into account not only the utility and investment information of itemsets but also their lengths. An algorithm named HAEIMiner is developed for finding high average-efficiency itemsets. Additionally, an upper-bound model is introduced to improve the mining process by pruning the search space. The results of the experiments conducted to demonstrate the effectiveness of HAEIMiner show that the number of high average-efficiency itemsets is fewer than the number of high-efficiency itemsets under the same threshold. This indicates that the HAEIM problem can be addressed with lower efficiency threshold values compared to the HEIM problem. © 2024 IEEE.

Description

Keywords

Average-Efficiency, Data Mining, Efficiency, Investment, Itemset Mining

Fields of Science

Citation

WoS Q

N/A

Scopus Q

N/A

Source

-- 8th International Artificial Intelligence and Data Processing Symposium, IDAP 2024 -- 2024-09-21 through 2024-09-22 -- Malatya -- 203423

Volume

Issue

Start Page

End Page

Google Scholar Logo
Google Scholar™

Sustainable Development Goals

SDG data could not be loaded because of an error. Please refresh the page or try again later.