AI-Driven Big Data Analytics: Scalable Architectures and Real-Time Processing

Authors

  • An Li University of the East, Manila, Philippines Author

Keywords:

artificial intelligence, big data, real-time processing, scalability, machine learning

Abstract

With the rapid growth of big data, the integration of Artificial Intelligence (AI) has become crucial for enhancing the scalability and real-time processing capabilities of data systems. This paper explores how AI-driven models, including machine learning, deep learning, and reinforcement learning, are revolutionizing big data analytics by improving data processing efficiency and enabling immediate, data-driven decision-making. It discusses the role of scalable architectures like cloud computing, distributed systems, and edge computing in supporting AI's capabilities, and how platforms such as Kafka and Flink facilitate real-time data stream processing. Additionally, this study examines the challenges of data quality, model scalability, and ethical concerns in AI-powered big data systems. The paper concludes with insights on future trends, such as AutoML, TinyML, and federated learning, which promise to further enhance the integration of AI and big data in real-time analytics.

References

1. Y. Kumar, J. Marchena, A. H. Awlla, J. J. Li, and H. B. Abdalla, "The AI-Powered Evolution of Big Data," Appl. Sci., vol. 14, no. 22, p. 10176, 2024, doi: 10.3390/app142210176.

2. Y. Demchenko, C. De Laat, and P. Membrey, "Defining architecture components of the Big Data Ecosystem," in Proc. 2014 Int. Conf. Collabor. Technol. Syst. (CTS), 2014, pp. 104-112, doi: 10.1109/CTS.2014.6867550.

3. C. Ji, Y. Li, W. Qiu, Y. Jin, Y. Xu, U. Awada, K. Li, and W. Qu, "Big data processing: Big challenges and opportunities," J. Inter-connect. Netw., vol. 13, no. 03n04, p. 1250009, 2012, doi: 10.1142/S0219265912500090.

4. T. Nasser and R. S. Tariq, "Big data challenges," J. Comput. Eng. Inf. Technol., vol. 4, p. 3, 2015, doi: 10.4172/2324-9307.1000135.

5. J. S. Park and J. H. Park, "Enhanced machine learning algorithms: deep learning, reinforcement learning, and q-learning," J. Inf. Process. Syst., vol. 16, no. 5, pp. 1001-1007, 2020, doi: 10.3745/JIPS.02.0139.

6. R. K. Yadav, T. R. Mahesh, A. Kumar, R. Jayadurga, A. Vishal, and G. Sood, "Analyzing the benefits of applying machine learning to real-time data analysis," in Proc. 2023 3rd Int. Conf. Smart Gen. Comput., Commun. Networking (SMART GENCON), 2023, pp. 1-6, doi: 10.1109/SMARTGENCON60755.2023.10442377.

7. D. García-Gil, S. Ramírez-Gallego, S. García, and F. Herrera, "A comparison on scalability for batch big data processing on Apache Spark and Apache Flink," Big Data Anal., vol. 2, pp. 1-11, 2017, doi: 10.1186/s41044-016-0020-2.

8. K. B. Letaief, Y. Shi, J. Lu, and J. Lu, "Edge artificial intelligence for 6G: Vision, enabling technologies, and applications," IEEE J. Sel. Areas Commun., vol. 40, no. 1, pp. 5-36, 2021, doi: 10.1109/JSAC.2021.3126076.

9. A. Ahmadi and N. RabieNezhad Ganji, "AI-driven medical innovations: transforming healthcare through data intelligence," Int. J. BioLife Sci., vol. 2, no. 2, pp. 132-142, 2023, doi: 10.22034/ijbls.2023.185475.

10. S. Saxena and S. Gupta, Practical Real-Time Data Processing and Analytics: Distributed Computing and Event Processing Using Apache Spark, Flink, Storm, and Kafka, Packt Publishing Ltd., 2017. ISBN: 9781787281202.

11. M. Devan, S. Prakash, and S. Jangoan, "Predictive maintenance in banking: leveraging AI for real-time data analytics," J. Knowl. Learn. Sci. Technol., vol. 2, no. 2, pp. 483-490, 2023, doi: 10.60087/jklst.vol2.n2.p490.

12. E. Z. Snigdha, M. R. Hossain, and S. Mahabub, "AI-powered healthcare tracker development: advancing real-time patient monitoring and predictive analytics through data-driven intelligence," J. Comput. Sci. Technol. Stud., vol. 5, no. 4, pp. 229-239, 2023, doi: 10.32996/jcsts.2023.5.4.24.

13. D. Wiltshire and S. Alvanides, "Ensuring the ethical use of big data: lessons from secure data access," Heliyon, vol. 8, no. 2, 2022, doi: 10.1016/j.heliyon.2022.e08981.

14. P. Fusco, G. P. Rimoli, and M. Ficco, "TinyML and federated learning," in Artificial Intelligence Techniques for Analysing Sensitive Data in Medical Cyber-Physical Systems: System Protection and Data Analysis, p. 113, 2023. ISBN: 9783031707742.

Downloads

Published

15 April 2025

Issue

Section

Article

How to Cite

Li, A. (2025). AI-Driven Big Data Analytics: Scalable Architectures and Real-Time Processing. European Journal of AI, Computing & Informatics, 1(1), 33-41. http://pinnaclepubs.com/index.php/EJACI/article/view/39