eprintid: 4374 rev_number: 8 eprint_status: archive userid: 408 dir: disk0/00/00/43/74 datestamp: 2021-03-15 04:23:03 lastmod: 2021-03-15 04:23:03 status_changed: 2021-03-15 04:23:03 type: book_section succeeds: 4046 metadata_visibility: show creators_name: Le, Nguyen Tuan Thanh creators_name: Pham, Manh Linh creators_id: thanhlnt@tlu.edu.vn creators_id: linhmp@vnu.edu.vn title: Big Data Analytics and Machine Learning for Industry 4.0: An Overview ispublished: pub subjects: IT divisions: FIMO divisions: fac_fit keywords: Big Data Analytics, Industry 4.0, Machine Learning, Deep Learning abstract: The concept of “Big data” was mentioned for the first time by Roger Mougalas in 2005. Volume hints to the size and/or scale of datasets. Until now, there is no universal threshold for data volume to be considered as big data, because of the time and diversity of datasets. Velocity indicates the speed of processing data. It can fall into three categories: streaming processing, real-time processing, or batch processing. Value alludes to the usefulness of data for decision making. Veracity denotes the quality and trustworthiness of datasets. Parallelization allows one to improve computation time by dividing big problems into smaller instances, distributing smaller tasks across multiple threads and then performing them simultaneously. Feature selection is useful for preparing high scale datasets. Sampling is a method for data reducing that helps to derive patterns in big datasets by generating, manipulating, and analyzing subsets of the original data. date: 2021-01-31 date_type: published publisher: CRC Press - Taylor & Francis Group, LLC official_url: https://doi.org/10.1201/9781003048855 full_text_status: public series: Big Data for Industry 4.0: Challenges and Applications place_of_pub: Boca Raton, FL, USA pagerange: 1-11 pages: 12 refereed: TRUE isbn: 9781003048855 book_title: Industry 4.0 Interoperability, Analytics, Security, and Case studies editors_name: Rajesh, G. editors_name: X. Mercilin, Raajini editors_name: Dang, Thi Thu Hien editors_id: gr@annauniv.edu editors_id: raajii.mercy@gmail.com editors_id: hiendt@tlu.edu.vn related_url_url: https://sites.google.com/view/grajesh/iaas?authuser=0 related_url_url: https://www.taylorfrancis.com/chapters/big-data-analytics-machine-learning-industry-4-0-overview-nguyen-tuan-thanh-le-manh-linh-pham/e/10.1201/9781003048855-1?context=ubx&refId=138e5295-f15d-4ebb-a517-0cfb2b3f1bda related_url_type: org related_url_type: pub referencetext: 1. R. Magoulas and B. Lorica. Introduction to big data. O’Reilly Media, Sebastopol, CA, February 2009. 2. J. Gantz and D. Reinsel. Extracting value from chaos. IDC iview, 1142(2011):1–12, 2011. 3. R. H. Hariri, E. M. Fredericks, and K. M. Bowers. Uncertainty in big data analytics: survey, opportunities, and challenges. Journal of Big Data, 6(1):44, 2019. 4. M. Chen, S. Mao, and Y. Liu. Big data: A survey. Mobile Networks and Applications, 19(2):171–209, 2014. 5. J. Han, J. Pei, and M. Kamber. Data mining: concepts and techniques. Elsevier, 2011. 6. M. Schuldenfrei. Big data challenges of industry 4.0. Datanami, April 25 2019. 7. N. Golchha. Big data-the information revolution. International Journal of Advanced Research, 1(12):791–794, 2015. 8. C.-W. Tsai, C.-F. Lai, H.-C. Chao, and A. V. Vasilakos. Big data analytics: A survey. Journal of Big data, 2(1):21, 2015. 9. M. Hilbert. Big data for development: A review of promises and challenges. Development Policy Review, 34(1):135–174, 2016. 10. X. Wang and Y. He. Learning from uncertainty for big data: Future analytical challenges and strategies. IEEE Systems, Man, and Cybernetics Magazine, 2(2):26–31, 2016. 11. A. Bargiela and W. Pedrycz. Granular computing. In Handbook on Computational Intelligence: Volume 1: Fuzzy Logic, Systems, Artifcial Neural Networks, and Learning Systems, pages 43–66. World Scientifc, 2016. 12. J. Kacprzyk, D. Filev, and G. Beliakov. Granular, Soft and Fuzzy Approaches for Intelligent Systems. Springer International Publishing, New York City, NY, 2017. 13. R. R. Yager. Decision making under measure-based granular uncertainty. Granular Computing, 3(4):345–353, 2018. 14. H. Liu and H. Motoda. Computational Methods of Feature Selection. CRC Press, Boca Raton, FL, 2007. 15. J. A. Olvera-López, J. A. Carrasco-Ochoa, J. F. Martínez-Trinidad, and J. Kittler. A review of instance selection methods. Artifcial Intelligence Review, 34(2):133–143, 2010. 16. Y. LeCun, Y. Bengio, and G. Hinton. Deep learning. Nature, 521(7553):436–444, 2015. 17. A. Burkov. The Hundred-Page Machine Learning Book. Andriy Burkov, Quebec City, 2019. 18. V. Zubarev. Machine learning for everyone: In simple words. with real-world examples. yes, again, 2019. 19. F. Chollet. Deep Learning with Python. Manning Publications Co., 2017. 20. K. Weiss, T. M. Khoshgoftaar, and D. Wang. A survey of transfer learning. Journal of Big data, 3(1):9, 2016. 21. J. Qiu, Q. Wu, G. Ding, Y. Xu, and S. Feng. A survey of machine learning for big data processing. EURASIP Journal on Advances in Signal Processing, 2016(1):67, 2016. 22. L. M. Pham and T.-M. Pham. Autonomic fne-grained migration and replication of component-based applications across multi-clouds. In 2015 2nd National Foundation for Science and Technology Development Conference on Information and Computer Science (NICS), pages 5–10. IEEE, 2015. 23. S. Athmaja, M. Hanumanthappa, and V. Kavitha. A survey of machine learning algorithms for big data analytics. In 2017 International Conference on Innovations in Information, Embedded and Communication Systems (ICIIECS), pages 1–4. IEEE, 2017. 24. M. M. Najafabadi, F. Villanustre, T. M. Khoshgoftaar, N. Seliya, R. Wald, and E. Muharemagic. Deep learning applications and challenges in big data analytics. Journal of Big Data, 2(1):1, 2015. 25. M. Mohammadi, A. Al-Fuqaha, S. Sorour, and M. Guizani. Deep learning for iot big data and streaming analytics: A survey. IEEE Communications Surveys & Tutorials, 20(4):2923–2960, 2018. 26. J. Schmidhuber. Deep learning in neural networks: An overview. Neural Networks, 61:85–117, 2015. 27. I. Goodfellow, Y. Bengio, and A. Courville. Deep Learning. MIT Press, 2016. 28. A. Luckow, M. Cook, N. Ashcraft, E. Weill, E. Djerekarov, and B. Vorster. Deep learning in the automotive industry: Applications and tools. In 2016 IEEE International Conference on Big Data (Big Data), pages 3759–3768. IEEE, 2016. 29. H. Lee, Y. Kim, and C. O. Kim. A deep learning model for robust wafer fault monitoring with sensor measurement noise. IEEE Transactions on Semiconductor Manufacturing, 30(1):23–31, 2016. 30. W. Yan and L. Yu. On accurate and reliable anomaly detection for gas turbine combustors: A deep learning approach. arXiv preprint arXiv:1908.09238, 2019. 31. H. Shao, H. Jiang, F. Wang, and H. Zhao. An enhancement deep feature fusion method for rotating machinery fault diagnosis. Knowledge-Based Systems, 119:200–220, 2017. 32. H. Lee. Framework and development of fault detection classifcation using iot device and cloud environment. Journal of Manufacturing Systems, 43:257–270, 2017. funders: Vietnam National University, Hanoi (VNU) projects: QG.20.55 citation: Le, Nguyen Tuan Thanh and Pham, Manh Linh (2021) Big Data Analytics and Machine Learning for Industry 4.0: An Overview. In: Industry 4.0 Interoperability, Analytics, Security, and Case studies. Big Data for Industry 4.0: Challenges and Applications . CRC Press - Taylor & Francis Group, LLC, Boca Raton, FL, USA, pp. 1-11. ISBN 9781003048855 document_url: https://eprints.uet.vnu.edu.vn/eprints/id/eprint/4374/6/BookCRCPressThanhLinh.pdf