Skip to main content

Text Mining Innovation for Business

  • Chapter
  • First Online:
New Trends in Business Information Systems and Technology

Abstract

This chapter reflects on the business innovation supported by developing text mining solutions to meet the business needs communicated by Swiss companies. Two related projects from different industries and with different challenges are discussed in order to identify common procedures and methodologies that can be used. One of the partners, in the gig work sector, offers a platform solution for employee recruitment for temporary work. The work assessment is performed using short reviews for which a method for sentiment assessment based on machine learning has been developed. The other partner, in the financial advice sector, operates an information extraction service for business documents, including insurance policies. This requires automation in the extraction of structured information from pdf-files. The common path to innovation in such projects includes business process modeling and the implementation of novel technological solutions, including text mining techniques.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 139.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 179.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 179.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Davidovski, V.: Exponential innovation through digital transformation. In: Proceedings of the 3rd International Conference on Applications in Information Technology, pp. 3–5. ACM, New York, NY, USA (2018). https://doi.org/10.1145/3274856.3274858

  2. Ivascu, L., Cirjaliu, B., Draghici, A.: Business model for the university-industry collaboration in open innovation. Procedia Econ. Finan. 39, 674–678 (2016). https://doi.org/10.1016/S2212-5671(16)30288-X

    Article  Google Scholar 

  3. Nambisan, S., Lyytinen, K., Majchrzak, A., Song, M.: Digital innovation management: reinventing innovation management research in a digital world. MIS Q. 41, (2017). https://doi.org/10.25300/misq/2017/41.1.01

  4. Weske, M.: Business Process Management: Concepts, Languages, Architectures. Springer (2012). https://doi.org/10.1007/978-3-642-28616-2_7

  5. Van der Aalst, W., Damiani, E.: Processes meet big data: connecting data science with process science. IEEE Trans. Serv. Comput. 8, 810–819 (2015). https://doi.org/10.1109/tsc.2015.2493732

  6. Pustulka-Hunt, E., Telesko, R., Hanne, T.: Gig work business process improvement. In: 2018 6th International Symposium on Computational and Business Intelligence (ISCBI), pp. 10–15 (2018). https://doi.org/10.1109/iscbi.2018.00013

  7. Calder, A.: EU GDPR: A Pocket Guide. IT Governance Publishing, Ely, Cambridgeshire, United Kingdom (2018). https://www.jstor.org/stable/j.ctt1cd0mkw

  8. Greenwood, B., Burtch, G., Carnahan, S.: Unknowns of the gig-economy. Commun. ACM 60(7), 27–29 (2017). https://doi.org/10.1145/3097349

    Article  Google Scholar 

  9. CNET. Your Next Insurance Agent Will Be a Robot. https://cacm.acm.org/careers/197572-your-next-insurance-agent-will-be-a-robot/fulltext. Accessed April 23, 2019

  10. Reich, M., Braasch, T.: Die Revolution der Prozessautomatisierung bei Versicherungsunternehmen: Robotic Process Automation (RPA). In: Handbuch Versicherungsmarketing, pp. 291–305. Springer (2019). https://doi.org/10.1007/978-3-662-57755-4_17

  11. smartFix. https://www.insiders-technologies.de/home/products/input-management/general-incoming-mail/smart-fix.html. Accessed 23 April 2019

  12. Weiss, S.M., Indurkhya, N., Zhang, T.: Fundamentals of Predictive Text Mining. Springer Publishing Company, Incorporated (2016). https://doi.org/10.1007/978-1-84996-226-1

  13. Baeza-Yates, R.A., Ribeiro-Neto, B.: Modern Information Retrieval. Addison-Wesley Longman Publishing Co., Inc, Boston, MA, USA (1999)

    Google Scholar 

  14. Kumar, B.S., Ravi, V.: A survey of the applications of text mining in financial domain. Know.-Based Syst. 114, 128–147 (2016). https://doi.org/10.1016/j.knosys.2016.10.003

  15. Baumgartner, R., Flesca, S., Gottlob, G.: Visual web information extraction with lixto. In: Proceedings of the 27th International Conference on Very Large Data Bases, pp. 119–128. Morgan Kaufmann Publishers Inc. (2001)

    Google Scholar 

  16. Piazza, F., Strohmeier, S.: Domain-driven data mining in human resource management: a review. In: Proceedings of the 2011 IEEE 11th International Conference on Data Mining Workshops, pp. 458–465. IEEE Computer Society (2011). https://doi.org/10.1109/icdmw.2011.68

  17. Ghani, R., Probst, K., Liu, Y., Krema, M., Fano, A.: Text mining for product attribute extraction. SIGKDD Explor. Newsl. 8, 41–48 (2006). https://doi.org/10.1145/1147234.1147241

    Article  Google Scholar 

  18. Burdick, D., Hernández, M.A., Ho, C.T.H., Koutrika, G., Krishnamurthy, R., Popa, L., Stanoi, I., Vaithyanathan, S., Das, S.R.: Extracting, linking and integrating data from public sources: a financial case study. IEEE Data Eng. Bull. 34, 60–67 (2011). https://doi.org/10.2139/ssrn.2666384

    Article  Google Scholar 

  19. Staar, P.W.J., Dolfi, M., Auer, C., Bekas, C.: Corpus conversion service: a machine learning platform to ingest documents at scale. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 774–782. ACM (2018). https://doi.org/10.1145/3219819.3219834

  20. Die Post. https://www.post.ch/de/geschaeftlich/themen-a-z/adressen-pflegen-und-geodaten-nutzen/adress-und-geodaten. Accessed 11 April 2019

  21. Vornamen in der Schweiz. Vornamen der Bevölkerung nach Geschlecht, Schweiz, 2017. BFS-Nummer: su-t-01.04.00.12. Accessed 23 April 2019

    Google Scholar 

  22. Pdfminer. Yusuke Shinyama. Pdfminer is a tool for extracting information from PDF documents. https://github.com/euske/pdfminer. Accessed 23 April 2019

  23. Pustulka-Hunt, E., Hanne, T., Blumer, E., Frieder, M.: Multilingual sentiment analysis for a swiss gig. In: 2018 6th International Symposium on Computational and Business Intelligence (ISCBI), pp. 94–98 (2018). https://doi.org/10.1109/iscbi.2018.00028

  24. Avila, J.: Scikit-learn cookbook: over 80 recipes for machine learning in python with scikit-learn. Packt Publishing, Birmingham, UK (2017). https://doi.org/10.1214/009053604000000067

  25. Matthews, B.W.: Comparison of the predicted and observed secondary structure of T4 phage lysozyme. Biochim. et Biophys. Acta (BBA)-Protein Structure 405, 442–451 (1975)

    Google Scholar 

  26. Yan, X., Guo, J., Lan, Y., Cheng, X.: A biterm topic model for short texts. In: Proceedings of the 22nd International Conference on World Wide Web, pp. 1445–1456. ACM (2013). https://doi.org/10.1145/2488388.2488514

  27. Xu, P., Barbosa, D.: Matching résumés to job descriptions with stacked models. In: Advances in Artificial Intelligence: 31st Canadian Conference on Artificial Intelligence, Canadian AI 2018, Toronto, ON, Canada, May 8–11, 2018, Proceedings 31, pp. 304–309. Springer (2018). https://doi.org/10.1007/978-3-319-89656-4_31

  28. Fua, P., Hanson, A.: An optimization framework for feature extraction. Mach. Vis. Appl. 4, 59–87 (1991)

    Article  Google Scholar 

Download references

Acknowledgements

Funding was provided by the Swiss Commission for Technology and Innovation CTI (now Innosuisse): 25826.1 PFES-ES and 34604.1 IP-ICT, and by the FHNW.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ela Pustulka .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Pustulka, E., Hanne, T. (2021). Text Mining Innovation for Business. In: Dornberger, R. (eds) New Trends in Business Information Systems and Technology. Studies in Systems, Decision and Control, vol 294. Springer, Cham. https://doi.org/10.1007/978-3-030-48332-6_4

Download citation

Publish with us

Policies and ethics