Skip to main content

Mining High Utility Itemsets from Multiple Databases

  • Conference paper
  • First Online:
Advances in Smart Vehicular Technology, Transportation, Communication and Applications (VTCA 2018)

Abstract

In the past, many algorithms have been developed to efficiently mine the high-utility itemsets from a single data source, which is not a realistic scenario since the data may be distributed into varied branches, and the discovered information should be integrated together for making the effective decision. In this paper, we focus on developing an efficient algorithm for synthesizing the mined high-utility itemsets from different sources. A baseline algorithm is first designed and two criteria are then developed to verify whether the designed algorithm is efficient to generate the same number of the high-utility itemsets as the batch-processed algorithm. Experiments are then shown that the designed algorithm has good performance for rule synthesization.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Agrawal, R. Srikant, R.: Fast algorithms for mining association rules in large databases. In: The International Conference on Very Large Data Bases, pp. 487–499 (1994)

    Google Scholar 

  2. Agrawal, R., Shafer, J.C.: Parallel mining of association rules. IEEE Trans. Knowl. Data Eng. 8(6), 962–969 (1996)

    Article  Google Scholar 

  3. Adhikari, A.: Synthesizing global exceptional patterns in different data sources. J. Intell. Syst. 21(3), 293–323 (2012)

    Google Scholar 

  4. Fournier-Viger, P., Lin, J.C.W., Gomariz, A., Gueniche, T., Soltani, A., Deng, Z., Lam, H.T.: The SPMF open-source data mining library version 2. In: The Joint European Conference on Machine Learning and Knowledge Discovery in Databases, pp. 36–40 (2016)

    Google Scholar 

  5. Fournier-Viger, P., Lin, J.C.W., Duong, Q.H., Dam, T.L.: FHM +: faster high-utility itemset mining using length upper-bound reduction. The International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems, pp. 115–127 (2016)

    Google Scholar 

  6. Gan, W., Lin, J.C.W., Chao, H.C., Zhan, J.: Data mining in distributed environment: a survey. Wiley Interdiscip. Rev.: Data Min. Knowl. Discov. 7(6), 1–19 (2017)

    Google Scholar 

  7. Gan, W., Lin, J.C.W., Fournier-Viger, P., Chao, H.C., Tseng, V.S., Yu, P.: A survey of utility-oriented pattern mining, https://arxiv.org/pdf/1805.10511.pdf (2018)

  8. Gan, W., Lin, J.C.W., Fournier-Viger, P., Chao, H.C., Hong, T.P., Fujita, H.: A survey of incremental high-utility itemset mining. Wiley Interdiscip. Rev.: Data Min. Knowl. Discov. 8(2), 1–23 (2018)

    Google Scholar 

  9. Gan, W., Lin, J.C.W., Fournier-Viger, P., Chao, H.C., Fujita, H.: Extracting non-redundant correlated purchase behaviors by utility measure. Knowl.-Based Syst. 143, 30–41 (2018)

    Article  Google Scholar 

  10. Han, J., Pei, J., Yin, Y., Mao, R.: Mining frequent patterns without candidate generation: a frequent-pattern tree approach. Data Min. Knowl. Discov. 8(1), 53–87 (2004)

    Article  MathSciNet  Google Scholar 

  11. Liu, Y., Liao, W.K., Choundhary A.: A fast high utility itemsets mining algorithm. In: The International Workshop on Utility-based Data Mining, pp. 90–99 (2005)

    Google Scholar 

  12. Lin, J.C.W., Gan, W., Hong, T.P., Zhang, B.: An incremental high-utility mining algorithm with transaction insertion. Sci. World J. 161564, 15 pages (2015)

    Google Scholar 

  13. Lin, J.C.W., Ren, S., Fournier-Viger, P.: MEMU: more efficient algorithm to mine high average-utility patterns with multiple minimum average-utility thresholds. IEEE Access 14(8), 7593–7609 (2018)

    Article  Google Scholar 

  14. Otey, M.E., Parthasarathy, S., Wang, C., Veloso, A., Meira, W.: Parallel and disctributed methods for incremental frequent itemset mining. IEEE Trans. Syst. Man Cybern. Part B: Cybern. 36(6), 2439–2450 (2004)

    Article  Google Scholar 

  15. Ozkural, E., Ucar, B., Aykanat, C.: Parallel frequent item set mining with selective item replication. IEEE Trans. Parallel Distrib. Syst. 22(10), 1632–1640 (2011)

    Article  Google Scholar 

  16. Sohrabi, M.K., Barforoush, A.A.: Parallel frequent itemset mining using systolic arrays. Knowl.-Based Syst. 37, 462–471 (2013)

    Article  Google Scholar 

  17. Wu, X., Zhang, S.: Synthesizing high-frequency rules from different data sources. IEEE Trans. Knowl. Data Eng. 15(2), 353–367 (2003)

    Article  Google Scholar 

  18. Yao, H., Hamilton, H.J.: Mining itemset utilities from transaction databases. Data Knowl. Eng. 59(3), 603–626 (2006)

    Article  Google Scholar 

  19. Zhong, N., Yao, Y.Y.Y., Ohshima, M.: Peculiarity oriented multidatabase mining. IEEE Trans. Knowl. Data Eng. 15(4), 952–960 (2003)

    Article  Google Scholar 

  20. Zhang, S., Zhang, C., Wu, X.: Knowledge Discovery in Multiple Databases. Springer, London (2004)

    Book  Google Scholar 

  21. Zhang, B., Lin, J.C.W., Shao, Y., Fournier-Viger, P., Djenouri, Y.: Maintenance of discovered high average-utility itemsets in dynamic databases. Appl. Sci. 8(5), 1–19 (2018)

    Google Scholar 

Download references

Acknowledgements

This research was partially supported by the Shenzhen Technical Project under JCYJ20170307151733005 and KQJSCX20170726103424709 and by the National Natural Science Foundation of China (NSFC) under grant No. 61503092.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jerry Chun-wei Lin .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Lin, J.Cw., Li, Y., Fournier-Viger, P., Tang, L. (2019). Mining High Utility Itemsets from Multiple Databases. In: Zhao, Y., Wu, TY., Chang, TH., Pan, JS., Jain, L. (eds) Advances in Smart Vehicular Technology, Transportation, Communication and Applications. VTCA 2018. Smart Innovation, Systems and Technologies, vol 128. Springer, Cham. https://doi.org/10.1007/978-3-030-04585-2_17

Download citation

Publish with us

Policies and ethics