TY - GEN
T1 - Online mining of temporal maximal utility itemsets from data streams
AU - Shie, Bai En
AU - Tseng, S.
AU - Yu, Philip S.
PY - 2010/7/23
Y1 - 2010/7/23
N2 - Data stream mining has become an emerging research topic in the data mining field, and finding frequent itemsets is an important task in data stream mining with wide applications. Recently, utility mining is receiving extensive attentions with two issues reconsidered: First, the utility (e.g., profit) of each item may be different in real applications; second, the frequent itemsets might not produce the highest utility. In this paper, we propose a novel algorithm named GUIDE (Generation of temporal maximal Utility Itemsets from Data strEams) which can find temporal maximal utility itemsets from data streams. A novel data structure, namely, TMUI-tree (Temporal Maximal Utility Itemset tree), is also proposed for efficiently capturing the utility of each itemset with one-time scanning. The main contributions of this paper are as follows: 1) GUIDE is the first one-pass utility-based algorithm for mining temporal maximal utility itemsets from data streams, and 2) TMUI-tree is efficient and easy to maintain. The experimental results show that our approach outperforms other existing utility mining algorithms like Two-Phase algorithm under the data stream environments.
AB - Data stream mining has become an emerging research topic in the data mining field, and finding frequent itemsets is an important task in data stream mining with wide applications. Recently, utility mining is receiving extensive attentions with two issues reconsidered: First, the utility (e.g., profit) of each item may be different in real applications; second, the frequent itemsets might not produce the highest utility. In this paper, we propose a novel algorithm named GUIDE (Generation of temporal maximal Utility Itemsets from Data strEams) which can find temporal maximal utility itemsets from data streams. A novel data structure, namely, TMUI-tree (Temporal Maximal Utility Itemset tree), is also proposed for efficiently capturing the utility of each itemset with one-time scanning. The main contributions of this paper are as follows: 1) GUIDE is the first one-pass utility-based algorithm for mining temporal maximal utility itemsets from data streams, and 2) TMUI-tree is efficient and easy to maintain. The experimental results show that our approach outperforms other existing utility mining algorithms like Two-Phase algorithm under the data stream environments.
KW - data stream mining
KW - maximal itemsets
KW - temporal high utility itemsets
KW - utility mining
UR - http://www.scopus.com/inward/record.url?scp=77954704076&partnerID=8YFLogxK
U2 - 10.1145/1774088.1774436
DO - 10.1145/1774088.1774436
M3 - Conference contribution
AN - SCOPUS:77954704076
SN - 9781605586380
T3 - Proceedings of the ACM Symposium on Applied Computing
SP - 1622
EP - 1626
BT - APPLIED COMPUTING 2010 - The 25th Annual ACM Symposium on Applied Computing
Y2 - 22 March 2010 through 26 March 2010
ER -