Abstract
This chapter reflects on the business innovation supported by developing text mining solutions to meet the business needs communicated by Swiss companies. Two related projects from different industries and with different challenges are discussed in order to identify common procedures and methodologies that can be used. One of the partners, in the gig work sector, offers a platform solution for employee recruitment for temporary work. The work assessment is performed using short reviews for which a method for sentiment assessment based on machine learning has been developed. The other partner, in the financial advice sector, operates an information extraction service for business documents, including insurance policies. This requires automation in the extraction of structured information from pdf-files. The common path to innovation in such projects includes business process modeling and the implementation of novel technological solutions, including text mining techniques.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Davidovski, V.: Exponential innovation through digital transformation. In: Proceedings of the 3rd International Conference on Applications in Information Technology, pp. 3–5. ACM, New York, NY, USA (2018). https://doi.org/10.1145/3274856.3274858
Ivascu, L., Cirjaliu, B., Draghici, A.: Business model for the university-industry collaboration in open innovation. Procedia Econ. Finan. 39, 674–678 (2016). https://doi.org/10.1016/S2212-5671(16)30288-X
Nambisan, S., Lyytinen, K., Majchrzak, A., Song, M.: Digital innovation management: reinventing innovation management research in a digital world. MIS Q. 41, (2017). https://doi.org/10.25300/misq/2017/41.1.01
Weske, M.: Business Process Management: Concepts, Languages, Architectures. Springer (2012). https://doi.org/10.1007/978-3-642-28616-2_7
Van der Aalst, W., Damiani, E.: Processes meet big data: connecting data science with process science. IEEE Trans. Serv. Comput. 8, 810–819 (2015). https://doi.org/10.1109/tsc.2015.2493732
Pustulka-Hunt, E., Telesko, R., Hanne, T.: Gig work business process improvement. In: 2018 6th International Symposium on Computational and Business Intelligence (ISCBI), pp. 10–15 (2018). https://doi.org/10.1109/iscbi.2018.00013
Calder, A.: EU GDPR: A Pocket Guide. IT Governance Publishing, Ely, Cambridgeshire, United Kingdom (2018). https://www.jstor.org/stable/j.ctt1cd0mkw
Greenwood, B., Burtch, G., Carnahan, S.: Unknowns of the gig-economy. Commun. ACM 60(7), 27–29 (2017). https://doi.org/10.1145/3097349
CNET. Your Next Insurance Agent Will Be a Robot. https://cacm.acm.org/careers/197572-your-next-insurance-agent-will-be-a-robot/fulltext. Accessed April 23, 2019
Reich, M., Braasch, T.: Die Revolution der Prozessautomatisierung bei Versicherungsunternehmen: Robotic Process Automation (RPA). In: Handbuch Versicherungsmarketing, pp. 291–305. Springer (2019). https://doi.org/10.1007/978-3-662-57755-4_17
smartFix. https://www.insiders-technologies.de/home/products/input-management/general-incoming-mail/smart-fix.html. Accessed 23 April 2019
Weiss, S.M., Indurkhya, N., Zhang, T.: Fundamentals of Predictive Text Mining. Springer Publishing Company, Incorporated (2016). https://doi.org/10.1007/978-1-84996-226-1
Baeza-Yates, R.A., Ribeiro-Neto, B.: Modern Information Retrieval. Addison-Wesley Longman Publishing Co., Inc, Boston, MA, USA (1999)
Kumar, B.S., Ravi, V.: A survey of the applications of text mining in financial domain. Know.-Based Syst. 114, 128–147 (2016). https://doi.org/10.1016/j.knosys.2016.10.003
Baumgartner, R., Flesca, S., Gottlob, G.: Visual web information extraction with lixto. In: Proceedings of the 27th International Conference on Very Large Data Bases, pp. 119–128. Morgan Kaufmann Publishers Inc. (2001)
Piazza, F., Strohmeier, S.: Domain-driven data mining in human resource management: a review. In: Proceedings of the 2011 IEEE 11th International Conference on Data Mining Workshops, pp. 458–465. IEEE Computer Society (2011). https://doi.org/10.1109/icdmw.2011.68
Ghani, R., Probst, K., Liu, Y., Krema, M., Fano, A.: Text mining for product attribute extraction. SIGKDD Explor. Newsl. 8, 41–48 (2006). https://doi.org/10.1145/1147234.1147241
Burdick, D., Hernández, M.A., Ho, C.T.H., Koutrika, G., Krishnamurthy, R., Popa, L., Stanoi, I., Vaithyanathan, S., Das, S.R.: Extracting, linking and integrating data from public sources: a financial case study. IEEE Data Eng. Bull. 34, 60–67 (2011). https://doi.org/10.2139/ssrn.2666384
Staar, P.W.J., Dolfi, M., Auer, C., Bekas, C.: Corpus conversion service: a machine learning platform to ingest documents at scale. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 774–782. ACM (2018). https://doi.org/10.1145/3219819.3219834
Die Post. https://www.post.ch/de/geschaeftlich/themen-a-z/adressen-pflegen-und-geodaten-nutzen/adress-und-geodaten. Accessed 11 April 2019
Vornamen in der Schweiz. Vornamen der Bevölkerung nach Geschlecht, Schweiz, 2017. BFS-Nummer: su-t-01.04.00.12. Accessed 23 April 2019
Pdfminer. Yusuke Shinyama. Pdfminer is a tool for extracting information from PDF documents. https://github.com/euske/pdfminer. Accessed 23 April 2019
Pustulka-Hunt, E., Hanne, T., Blumer, E., Frieder, M.: Multilingual sentiment analysis for a swiss gig. In: 2018 6th International Symposium on Computational and Business Intelligence (ISCBI), pp. 94–98 (2018). https://doi.org/10.1109/iscbi.2018.00028
Avila, J.: Scikit-learn cookbook: over 80 recipes for machine learning in python with scikit-learn. Packt Publishing, Birmingham, UK (2017). https://doi.org/10.1214/009053604000000067
Matthews, B.W.: Comparison of the predicted and observed secondary structure of T4 phage lysozyme. Biochim. et Biophys. Acta (BBA)-Protein Structure 405, 442–451 (1975)
Yan, X., Guo, J., Lan, Y., Cheng, X.: A biterm topic model for short texts. In: Proceedings of the 22nd International Conference on World Wide Web, pp. 1445–1456. ACM (2013). https://doi.org/10.1145/2488388.2488514
Xu, P., Barbosa, D.: Matching résumés to job descriptions with stacked models. In: Advances in Artificial Intelligence: 31st Canadian Conference on Artificial Intelligence, Canadian AI 2018, Toronto, ON, Canada, May 8–11, 2018, Proceedings 31, pp. 304–309. Springer (2018). https://doi.org/10.1007/978-3-319-89656-4_31
Fua, P., Hanson, A.: An optimization framework for feature extraction. Mach. Vis. Appl. 4, 59–87 (1991)
Acknowledgements
Funding was provided by the Swiss Commission for Technology and Innovation CTI (now Innosuisse): 25826.1 PFES-ES and 34604.1 IP-ICT, and by the FHNW.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Pustulka, E., Hanne, T. (2021). Text Mining Innovation for Business. In: Dornberger, R. (eds) New Trends in Business Information Systems and Technology. Studies in Systems, Decision and Control, vol 294. Springer, Cham. https://doi.org/10.1007/978-3-030-48332-6_4
Download citation
DOI: https://doi.org/10.1007/978-3-030-48332-6_4
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-48331-9
Online ISBN: 978-3-030-48332-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)