Skip to main content

Analysis of Breast Cancer Detection Techniques Using RapidMiner

  • Conference paper
  • First Online:
Proceedings of International Conference on Artificial Intelligence and Applications

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1164))

Abstract

One of the most widely spread diseases among women is the breast cancer. In past few years, the incidences of breast cancer kept on rising. At this point, diagnosis of the cancer is crucial. However, breast cancer is treatable if it is identified during the earlier stages. Classification of tumor in case of breast cancer is done using machine learning algorithms, viz. decision trees, regression, SVM, k-NN, and Naïve Bayes. Then, accuracy of these algorithms is compared to predict the class (benign, malignant) of tumor, and the most appropriate algorithm is suggested based on the results. Wisconsin Breast Cancer Diagnostic Dataset is used. The data has been preprocessed, split, and applied to respective models. Tenfold cross-validation is applied to determine the accuracies.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. E. Alpaydin, Introduction to Machine Learning, 2nd edn. (MIT Press, Cambridge, 2010)

    Google Scholar 

  2. D. Mittal, D. Gaurav, S.S. Roy, An effective hybridized classifier for breast cancer diagnosis, in Proceeding of IEEE International Conference on Advanced Intelligent Mechatronics (2015), pp. 1026–1031

    Google Scholar 

  3. M. Vasantha, Bharathi, V. Subbiah, R. Dhamodharan, Medical image feature, extraction, selection and classification. Int. J. Eng. Sci. Technol. 2(6), 2071–2076 (2010)

    Google Scholar 

  4. P. Saranya, B. Satheeskumar, A survey on feature selection of cancer disease using data mining techniques. Int. J. Comput. Sci. Mob. Comput. 5(5), 713–719 (2016)

    Google Scholar 

  5. K. Kourou, T.P. Exarchos, K.P. Exarchos, M.V. Karamouzis, D.I. Fotiadis, Mini review on machine learning applications in cancer prognosis and prediction. Comput. Struct. Biotechnol. J. 13, 8–17 (2015)

    Article  Google Scholar 

  6. J.A. Cruz, D.S. Wishart, Applications of machine learning in cancer prediction and prognosis. Cancer Inform. 2, 59–77 (2007)

    Google Scholar 

  7. S. Syed Shajahaan, S. Shanthi, V. Mano Chitra, Application of data mining techniques to model breast cancer data. Int. J. Emerg. Technol. Adv. Eng. 3(11), 362–369 (2013)

    Google Scholar 

  8. O.L. Mangasarian, W. Nick Street, W.H. Wolberg, Breast cancer diagnosis and prognosis via linear programming. Oper. Res. 43(4), 570–577 (1995)

    Google Scholar 

  9. C. Nalini, D. Meera, Breast cancer prediction using data mining method. Int. J. Pure Appl. Math. 119(12), 10901–10912 (2018)

    Google Scholar 

  10. H. Asri, H. Mousannif, H. Al Moatassime, N. Thomas, Using machine learning algorithms for breast cancer risk prediction and diagnosis. Procedia Comput. Sci. 83, 1064–1069 (2016)

    Google Scholar 

  11. N.S. Altman, An introduction to kernel and nearest-neighbour nonparametric regression. Am. Stat. 46(3), 175–185 (1992)

    Google Scholar 

  12. M. van Gerven, S. Bohte, Artificial neural networks as models of neural information processing. Front Comput. Neurosci. 114(11), 1–2 (2017)

    Google Scholar 

  13. N. Tariq, Breast cancer detection using artificial neural networks. J. Mol. Biomark. Diagn. 9(1), 1–6 (2017)

    MathSciNet  Google Scholar 

  14. D.A. Freedman, Statistical Models: Theory and Practice, 2nd edn. (Cambridge University Press, Berkeley, 2007), p. 498

    Google Scholar 

  15. A.C. Rencher, F. William Christensen, Multivariate Regression, 3rd edn. (Wiley, Hoboken, 2012)

    Google Scholar 

  16. H.L. Seal, The historical development of the Gauss linear model. Biometrika 54(12), 1–24 (1967)

    MathSciNet  MATH  Google Scholar 

  17. X. Yan, X.G. Su, Linear regression analysis: theory and computing. Int. Stat. Rev. 78, 134–159 (2010)

    Google Scholar 

  18. C. Cortes, V.N. Vladimir, Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)

    MATH  Google Scholar 

  19. A. Ben-Hur, D. Horn, H.T. Siegelmann, V. Vapnik, Support vector clustering. J. Mach. Learn. Res. 2, 125–137 (2011)

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Aman Jatain .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Nanda, A., Jatain, A. (2021). Analysis of Breast Cancer Detection Techniques Using RapidMiner. In: Bansal, P., Tushir, M., Balas, V., Srivastava, R. (eds) Proceedings of International Conference on Artificial Intelligence and Applications. Advances in Intelligent Systems and Computing, vol 1164. Springer, Singapore. https://doi.org/10.1007/978-981-15-4992-2_1

Download citation

Publish with us

Policies and ethics