Mining High Utility Itemsets from Multiple Databases

Lin, Jerry Chun-wei; Li, Yuanfa; Fournier-Viger, Philippe; Tang, Linlin

doi:10.1007/978-3-030-04585-2_17

Jerry Chun-wei Lin^8,9,
Yuanfa Li⁸,
Philippe Fournier-Viger¹⁰ &
…
Linlin Tang⁸

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 128))

Included in the following conference series:

International Conference on Smart Vehicular Technology, Transportation, Communication and Applications

550 Accesses

Abstract

In the past, many algorithms have been developed to efficiently mine the high-utility itemsets from a single data source, which is not a realistic scenario since the data may be distributed into varied branches, and the discovered information should be integrated together for making the effective decision. In this paper, we focus on developing an efficient algorithm for synthesizing the mined high-utility itemsets from different sources. A baseline algorithm is first designed and two criteria are then developed to verify whether the designed algorithm is efficient to generate the same number of the high-utility itemsets as the batch-processed algorithm. Experiments are then shown that the designed algorithm has good performance for rule synthesization.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Agrawal, R. Srikant, R.: Fast algorithms for mining association rules in large databases. In: The International Conference on Very Large Data Bases, pp. 487–499 (1994)
Google Scholar
Agrawal, R., Shafer, J.C.: Parallel mining of association rules. IEEE Trans. Knowl. Data Eng. 8(6), 962–969 (1996)
Article Google Scholar
Adhikari, A.: Synthesizing global exceptional patterns in different data sources. J. Intell. Syst. 21(3), 293–323 (2012)
Google Scholar
Fournier-Viger, P., Lin, J.C.W., Gomariz, A., Gueniche, T., Soltani, A., Deng, Z., Lam, H.T.: The SPMF open-source data mining library version 2. In: The Joint European Conference on Machine Learning and Knowledge Discovery in Databases, pp. 36–40 (2016)
Google Scholar
Fournier-Viger, P., Lin, J.C.W., Duong, Q.H., Dam, T.L.: FHM +: faster high-utility itemset mining using length upper-bound reduction. The International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems, pp. 115–127 (2016)
Google Scholar
Gan, W., Lin, J.C.W., Chao, H.C., Zhan, J.: Data mining in distributed environment: a survey. Wiley Interdiscip. Rev.: Data Min. Knowl. Discov. 7(6), 1–19 (2017)
Google Scholar
Gan, W., Lin, J.C.W., Fournier-Viger, P., Chao, H.C., Tseng, V.S., Yu, P.: A survey of utility-oriented pattern mining, https://arxiv.org/pdf/1805.10511.pdf (2018)
Gan, W., Lin, J.C.W., Fournier-Viger, P., Chao, H.C., Hong, T.P., Fujita, H.: A survey of incremental high-utility itemset mining. Wiley Interdiscip. Rev.: Data Min. Knowl. Discov. 8(2), 1–23 (2018)
Google Scholar
Gan, W., Lin, J.C.W., Fournier-Viger, P., Chao, H.C., Fujita, H.: Extracting non-redundant correlated purchase behaviors by utility measure. Knowl.-Based Syst. 143, 30–41 (2018)
Article Google Scholar
Han, J., Pei, J., Yin, Y., Mao, R.: Mining frequent patterns without candidate generation: a frequent-pattern tree approach. Data Min. Knowl. Discov. 8(1), 53–87 (2004)
Article MathSciNet Google Scholar
Liu, Y., Liao, W.K., Choundhary A.: A fast high utility itemsets mining algorithm. In: The International Workshop on Utility-based Data Mining, pp. 90–99 (2005)
Google Scholar
Lin, J.C.W., Gan, W., Hong, T.P., Zhang, B.: An incremental high-utility mining algorithm with transaction insertion. Sci. World J. 161564, 15 pages (2015)
Google Scholar
Lin, J.C.W., Ren, S., Fournier-Viger, P.: MEMU: more efficient algorithm to mine high average-utility patterns with multiple minimum average-utility thresholds. IEEE Access 14(8), 7593–7609 (2018)
Article Google Scholar
Otey, M.E., Parthasarathy, S., Wang, C., Veloso, A., Meira, W.: Parallel and disctributed methods for incremental frequent itemset mining. IEEE Trans. Syst. Man Cybern. Part B: Cybern. 36(6), 2439–2450 (2004)
Article Google Scholar
Ozkural, E., Ucar, B., Aykanat, C.: Parallel frequent item set mining with selective item replication. IEEE Trans. Parallel Distrib. Syst. 22(10), 1632–1640 (2011)
Article Google Scholar
Sohrabi, M.K., Barforoush, A.A.: Parallel frequent itemset mining using systolic arrays. Knowl.-Based Syst. 37, 462–471 (2013)
Article Google Scholar
Wu, X., Zhang, S.: Synthesizing high-frequency rules from different data sources. IEEE Trans. Knowl. Data Eng. 15(2), 353–367 (2003)
Article Google Scholar
Yao, H., Hamilton, H.J.: Mining itemset utilities from transaction databases. Data Knowl. Eng. 59(3), 603–626 (2006)
Article Google Scholar
Zhong, N., Yao, Y.Y.Y., Ohshima, M.: Peculiarity oriented multidatabase mining. IEEE Trans. Knowl. Data Eng. 15(4), 952–960 (2003)
Article Google Scholar
Zhang, S., Zhang, C., Wu, X.: Knowledge Discovery in Multiple Databases. Springer, London (2004)
Book Google Scholar
Zhang, B., Lin, J.C.W., Shao, Y., Fournier-Viger, P., Djenouri, Y.: Maintenance of discovered high average-utility itemsets in dynamic databases. Appl. Sci. 8(5), 1–19 (2018)
Google Scholar

Download references

Acknowledgements

This research was partially supported by the Shenzhen Technical Project under JCYJ20170307151733005 and KQJSCX20170726103424709 and by the National Natural Science Foundation of China (NSFC) under grant No. 61503092.

Author information

Authors and Affiliations

School of Computer Science and Technology, Harbin Institute of Technology (Shenzhen), Shenzhen, China
Jerry Chun-wei Lin, Yuanfa Li & Linlin Tang
Department of Computing, Mathematics, and Physics, Western Norway University of Applied Sciences, Bergen, Norway
Jerry Chun-wei Lin
School of Humanities and Social Sciences, Harbin Institute of Technology (Shenzhen), Shenzhen, China
Philippe Fournier-Viger

Authors

Jerry Chun-wei Lin
View author publications
You can also search for this author in PubMed Google Scholar
Yuanfa Li
View author publications
You can also search for this author in PubMed Google Scholar
Philippe Fournier-Viger
View author publications
You can also search for this author in PubMed Google Scholar
Linlin Tang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jerry Chun-wei Lin .

Editor information

Editors and Affiliations

College of Physics and Energy, Fujian Normal University, Fuzhou, Fujian, China
Yong Zhao
College of Information Science and Engineering, Fujian University of Technology, Fuzhou, Fujian, China
Tsu-Yang Wu
School of Transportation, Fujian University of Technology, Fuzhou, Fujian, China
Tang-Hsien Chang
College of Information Science and Engineering, Fujian University of Technology, Fuzhou, Fujian, China
Jeng-Shyang Pan
University of Technology Sydney, Sydney, NSW, Australia
Lakhmi C. Jain

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lin, J.Cw., Li, Y., Fournier-Viger, P., Tang, L. (2019). Mining High Utility Itemsets from Multiple Databases. In: Zhao, Y., Wu, TY., Chang, TH., Pan, JS., Jain, L. (eds) Advances in Smart Vehicular Technology, Transportation, Communication and Applications. VTCA 2018. Smart Innovation, Systems and Technologies, vol 128. Springer, Cham. https://doi.org/10.1007/978-3-030-04585-2_17

Download citation

DOI: https://doi.org/10.1007/978-3-030-04585-2_17
Published: 01 December 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-04584-5
Online ISBN: 978-3-030-04585-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics