Skip to main content

Image Segment Classification Using CNN

  • Conference paper
  • First Online:

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1081))

Abstract

At present, the rapid development of convolutional neural networks (CNNs or ConvNets) as a tool for image detection and classification encourages us to implement them more widely in CBIR systems. This paper compares the transfer learning architecture for selected pre-trained CNNs: AlexNet, GoogleNet, VGG16 and VGG19 with the simple ConvNet training from scratch. We have analysed three solvers (optimisation methods): the stochastic gradient descent with the momentum (SGDM) optimizer, the RMSProp optimizer and the ADAM optimization method for both models with and without transfer learning. We concentrated on (3 × 3), (7 × 7) and (9 × 9) sizes of convolutional filters, respectively for each of the above mentioned ConvNets.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature 521, 436–444 (2015)

    Article  Google Scholar 

  2. Jaworska, T.: Spatial representation of object location for image matching in CBIR. In: Zgrzywa, A., Choroś, K., Siemiński, A. (eds.) New Research in Multimedia and Internet Systems, vol. 314, pp. 25–34. Springer, Wrocław (2014)

    Google Scholar 

  3. Rish, I.: An empirical study of the Naïve Bayes classifier. In: Proceedings of the IJCAI 2001 Workshop on Empirical Methods in AI, Brussels (2001)

    Google Scholar 

  4. Ishibuchi, H., Nojima, Y.: Toward quantitative definition of explanation ability of fuzzy rule-based classifiers. In: IEEE International Conference on Fuzzy Systems, Taipei, Taiwan, 27–39 June 2011

    Google Scholar 

  5. Jaworska, T.: Application of fuzzy rule-based classifier to CBIR in comparison with other classifiers. In: 11th International Conference on Fuzzy Systems and Knowledge Discovery, Xiamen, China, 19–21 August 2014

    Google Scholar 

  6. Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: Proceedings of the 25th International Conference on Neural Information Processing Systems – NIPS 2012, Lake Tahoe, Nevada, USA, 3–6 December 2012

    Google Scholar 

  7. Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., Rabinovich, A.: Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2015)

    Google Scholar 

  8. Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: Proceedings of International Conference on Learning Representations, San Diego, CA, 7–9 May 2015

    Google Scholar 

  9. Pan, S.J., Yang, Q.: A survey on transfer learning. IEEE Trans. Knowl. Data Eng. 22(10), 1345–1359 (2010)

    Article  Google Scholar 

  10. Huang, Z., Pan, Z., Lei, B.: Transfer learning with deep convolutional neural network for SAR target classification with limited labeled data. Remote Sens. 9(9), 907 (2017)

    Article  Google Scholar 

  11. Chen, S., Zhang, C., Dong, M.: Coupled end-to-end transfer learning with generalized fisher information. In: The Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, June 2018

    Google Scholar 

  12. LeCun, Y., Bottou, L., Orr, G.B., Müller, K.R.: Efficient backprop. In: Orr, G.B., Müller, K.R. (eds.) Neural Networks: Tricks of the Trade, pp. 9–50. Springer, Heidelberg (1998)

    Chapter  Google Scholar 

  13. Lin, M., Chen, Q., Yan, S.: Network in network, March 2014

    Google Scholar 

  14. Hinton, G., Srivastava, N., Swersky, K.: Neural Networks for Machine Learning, The momentum method, Toronto (2012)

    Google Scholar 

  15. Loos, S., Irving, G., Szegedy, C., Kaliszyk, C.: Deep network guided proof search. In: 21st International Conference on Logic for Programming, Artificial Intelligence and Reasoning, Maun, Botswana, 7–12 May 2017

    Google Scholar 

  16. Dauphin, Y., de Vries, H., Chung, J., Bengio, Y.: RMSProp and equilibrated adaptive learning rates for non-convex optimization. Neural Information Processing Systems, 15 February 2015

    Google Scholar 

  17. Kingma, D.P., Ba, J.L.: ADAM: a method for stochastic optimization. In: Proceedings of the 3rd International Conference on Learning Representations, San Diego, USA, May 2015

    Google Scholar 

  18. Masters, D., Luschi, C.: Revisiting small batch training for deep neural networks, Bristol, UK. arXiv e-prints (2018)

    Google Scholar 

Download references

Acknowledgement

We would like to thank the Project Office MTM Style for the kind permission to use their visualisation of projects ‘Amadeusz’ and ‘Aksamit’, on which we tested our results.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tatiana Jaworska .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Jaworska, T. (2021). Image Segment Classification Using CNN. In: Atanassov, K., et al. Uncertainty and Imprecision in Decision Making and Decision Support: New Challenges, Solutions and Perspectives. IWIFSGN 2018. Advances in Intelligent Systems and Computing, vol 1081. Springer, Cham. https://doi.org/10.1007/978-3-030-47024-1_38

Download citation

Publish with us

Policies and ethics