Skip to main content

An Analysis of Machine Learning Approach for Detecting Automated Spammer in Twitter

  • Conference paper
  • First Online:
Book cover Advances in Distributed Computing and Machine Learning

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 127))

  • 592 Accesses

Abstract

With the evolution of social media, information sharing in online is turning into ubiquitous each and every day. Various facts are propagating thru online social networks along with each the wonderful and negative. In this paper, our essential focal point is to resolve terrible or incorrect data such as rumors. Rumor is a serious problem in online social media. The wrong rumors can create many problems in society; hence, it is important to block these rumors. First in this paper, we have compared Naive Bayes algorithm and logistic regression for deciding which algorithm is better for spam classification. Then, we have proposed a model for detecting spammers in twitters and automatically blocking those rumors by hindering an assured subset of the nodes. In order to decrease the influence of the rumors, nodes are allotted an acceptance time threshold. If a user posts the rumor more than a particular threshold, the user gets automatically blocked. We have created a Java-based Web application using tomcat SQL server, JavaScript html and CSS, like Facebook where a user can search for friends, accept friend requests, chat with them, change profile pictures and post messages. In this paper, we used a supervised learning technique known as Naive Bayes Algorithm for blocking of the users. The analysis of users blocked per month can be also done which will help in study of users and rumor details.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Fazil M, Abulaish M (2018) A hybrid approach for detecting automated spammers in Twitter. IEEE Trans Inform Forensics Secur 13(11)

    Google Scholar 

  2. Amleshwaram AA, Reddy N, Yadav S, Gu G, Yang C (2013) CATS: characterizing automation of Twitter spammers. IEEE, Department of Electrical and Computer Engineering Texas A&M University

    Google Scholar 

  3. Talha A, Kara R (2017) A survey of spam detection methods on Twitter. Int J Adv Comput Sci Appl (IJACSA) 8(3)

    Google Scholar 

  4. Rajadesingan A, Zafarani R, Liu H (2015) Sarcasm detection on Twitter: a behavioral modeling approach. In: WSDM’15, Shanghai, China, 2–6 Feb 2015

    Google Scholar 

  5. Concone F, Lo Re G, Morana M, Ruocco C (2019) Twitter spam account detection by effective labelling. In: CEUR-WS.org/Vol-2315

    Google Scholar 

  6. Chatzakouy D, Kourtellisz N, Blackburnz J, De Cristofaro E, Stringhini G, Vakali A (2017) Mean birds: detecting aggression and bullying on Twitter, 12 May 2017

    Google Scholar 

  7. Martinez-Romo J, Araujo L (2012) Detecting malicious tweets in trending topics using a statistical analysis of language. Elsevier, Amsterdam

    Google Scholar 

  8. Shuy K, Slivaz A, Wangy S, Tang J, Liuy H (2017) Fake news detection on social media: a data mining perspective, 3 Sept 2017

    Google Scholar 

  9. Benhardus J, Kalita J (2013) Streaming trend detection in Twitter. Int J Web Based Commun 9(1)

    Google Scholar 

  10. Vishwarupe V, Bedekar M, Pande M, Hiwale A (2018) Intelligent Twitter spam detection: a hybrid approach, Jan 2018

    Google Scholar 

  11. Fernandes MA, Patel P, Marwala T (2015) Automated detection of human users in Twitter. Procedia Comput Sci 53:224–231

    Article  Google Scholar 

  12. Washhaa M, Qaroushb A, Mezghania M, Sedesa F (2017) A topic-based hidden Markov model for real-time spam tweets filtering

    Google Scholar 

  13. DeBarr D, Wechsler H (2012) Spam detection using random boost, 24 Mar 2012

    Google Scholar 

  14. Delany SJ, Buckley M, Greene D (2012) SMS spam filtering: methods and data. Expert Syst Appl 39

    Google Scholar 

  15. Bajaj S, Garg N, Singh SK (2017) A novel user-based spam review detection. Procedia Comput Sci 122

    Google Scholar 

  16. Naem AA, Ghali NI, Saleh AA (2018) Antlion optimization and boosting classifier for spam email detection. Future Comput Inform J 3

    Google Scholar 

  17. Biggio B, Fumera G, Pillai I, Roli F (2011) A survey and experimental evaluation of image spam filtering techniques. Pattern Recogn Lett 32

    Google Scholar 

  18. Farisa H, Al-Zoubia AM, Heidarib AA, Aljaraha I, Mafarja M, Hassonaha MA, Fujitad H (2019) An intelligent system for spam detection and identification of the most relevant features based on evolutionary Random Weight Networks. Inform Fusion 48

    Google Scholar 

  19. Inuwa-Dutse I, Liptrott M, Korkontzelos I (2018) Detection of spam-posting accounts on Twitter. Neurocomputing 315

    Google Scholar 

  20. Wang B, Chen G, Fu L, Song L, Wang X, Liu X (2016) DRIMUX: dynamic rumor influence minimization with user experience in social networks

    Google Scholar 

  21. Arkalgud N (2018) Logistic regression for spam filtering, 14 Feb 2008

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to C. Vanmathi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Vanmathi, C., Mangayarkarasi, R. (2021). An Analysis of Machine Learning Approach for Detecting Automated Spammer in Twitter. In: Tripathy, A., Sarkar, M., Sahoo, J., Li, KC., Chinara, S. (eds) Advances in Distributed Computing and Machine Learning. Lecture Notes in Networks and Systems, vol 127. Springer, Singapore. https://doi.org/10.1007/978-981-15-4218-3_43

Download citation

Publish with us

Policies and ethics