Abstract
Data mining is a procedure of driving hidden, unknown but potentially convenient information from massive data. While big data presents technologies for assembling, processing, analyzing and extracting convenient data from very huge volumes of structured and unstructured data processed by various sources at high speed, big data has great impacts on scientific discoveries and value creation. This paper defines the 5Vs of Big data along with the distinction of big data and big data analytics followed with the architecture of big data. Some tools representing Hadoop ecosystem are also presented.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Thakur B, Mann M (2014) Data mining for big data: a review. Int J Adv Res Comput Sci Softw Eng 4(5):469–473
Data Analytics. https://talentedge.com/blog/difference-between-big-data-and-data-analytics/
Baaziz A, Quoniam L (2013) How to use big data technologies to optimize operations in Upstream Petroleum Industry. Int J Innov 1(1):19–29
Thuan L (2018) A framework for five big V’s of big data and organizational culture in firms 2018. In: IEEE international conference on big data, pp 5411–5413
Apache Hadoop. http://hadoop.apache.org
Dean J, Ghemawat S (2004) MapReduce: simplified data processing on large clusters. In: Proceedings of the 6th conference on symposium on operating systems design & implementation (OSDI’04), vol 6, pp 137–150
Apache Cassandra. http://cassandra.apache.org
Apache HBase. http://hbase.apache.org
Alguliyev R, Imamverdiyev Y (2014) Big data: big promises for information security. In: IEEE 8th international conference on application of information and communication technologies
Apache Mahout. http://mahout.apache.org
Wang L, Wang G (2015) Data mining applications in big data. Comput Eng Appl 4(3):143–152
Apache Pig. http://www.pig.apache.org/
Apache HCatalog. https://cwiki.apache.org/confluence/display/Hive/HCatalog
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Prajapati, M., Patel, S. (2021). A Review on Big Data with Data Mining. In: Kotecha, K., Piuri, V., Shah, H., Patel, R. (eds) Data Science and Intelligent Applications. Lecture Notes on Data Engineering and Communications Technologies, vol 52. Springer, Singapore. https://doi.org/10.1007/978-981-15-4474-3_17
Download citation
DOI: https://doi.org/10.1007/978-981-15-4474-3_17
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-4473-6
Online ISBN: 978-981-15-4474-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)