BIG DATA ANALYSIS FOR DISEASE PREDICTION AND PREVENTION
Keywords:
Big Data, Prediction, Disease PreventionAbstract
Big Data analytics in the context of disease prediction and prevention stands out as a critical issue in today's digital age, given its unique capacity to process and analyze massive volumes of health data with unprecedented speed and precision. Through the collection of extensive data from various sources, including electronic medical records, wearable devices, and other digital inputs, Big Data analysis enables researchers and healthcare practitioners to identify patterns and trends before diseases develop, forecast outbreaks, and respond proactively to potential health crises. Its ability to integrate and map health data at scale opens up opportunities for smarter prevention and personalized approaches to disease management, significantly shifting the landscape of disease prevention from reactive to proactive, which in turn could save millions of lives and reduce the economic burden on the global health system. The study in this research uses the literature research method. The results show that the use of Big Data and machine learning has great potential in strengthening health systems through disease prediction and prevention. Key findings show that the integration of extensive health data enables more effective identification of disease patterns and trends. With these technologies in place, the ability to diagnose and forecast diseases becomes faster and more accurate, which in turn, can help in designing appropriate and evidence-based interventions. In addition, improved machine learning methods continue to push the boundaries of predictiveness, providing new insights into proactive disease management and prevention.
References
Abdussamad, Z. (2022). Buku Metode Penelitian Kualitatif. Query date: 2024-05-25 20:59:55. https://doi.org/10.31219/osf.io/juwxn
Adlini, M. N., Dinda, A. H., Yulinda, S., Chotimah, O., & Merliyana, S. J. (2022). Metode Penelitian Kualitatif Studi Pustaka. Edumaspul: Jurnal Pendidikan, 6(1), 974–980. https://doi.org/10.33487/edumaspul.v6i1.3394
Aljanobi, F. A., & Lee, J. (2021). Topological Data Analysis for Classification of Heart Disease Data. 2021 IEEE International Conference on Big Data and Smart Computing (BigComp), Query date: 2024-06-09 05:26:26. https://doi.org/10.1109/bigcomp51126.2021.00047
AL‐Rummana, G. A., Al‐Ahdal, A. H. A., & Shinde, G. N. (2021). The Role of Big Data Analysis in Increasing the Crime Prediction and Prevention Rates. Intelligent Data Analytics for Terror Threat Prediction, Query date: 2024-06-09 05:26:26, 209–220. https://doi.org/10.1002/9781119711629.ch10
Ardakani, S. P., & Cheshmehzangi, A. (2023). On-Board Unit Freight Transport Data Analysis and Prediction: Big Data Analysis for Data Pre-processing and Result Accuracy. Big Data Analytics for Smart Transport and Healthcare Systems, Query date: 2024-06-09 05:26:26, 45–61. https://doi.org/10.1007/978-981-99-6620-2_3
Arjaria, S. K., Rathore, A. S., & Cherian, J. S. (2021). Kidney disease prediction using a machine learning approach: A comparative and comprehensive analysis. Demystifying Big Data, Machine Learning, and Deep Learning for Healthcare Analytics, Query date: 2024-06-09 05:26:26, 307–333. https://doi.org/10.1016/b978-0-12-821633-0.00006-4
Asri, H., & Jarir, Z. (2022). Toward a Smart Health: Reality Mining and Big Data Analytics for Real-Time Disease Prediction. Query date: 2024-06-09 05:26:26. https://doi.org/10.21203/rs.3.rs-1621912/v1
Bonomo, M., Placa, A. L., & Rombo, S. E. (2021). Prediction of Disease–lncRNA Associations via Machine Learning and Big Data Approaches. Knowledge Modelling and Big Data Analytics in Healthcare, Query date: 2024-06-09 05:26:26, 203–226. https://doi.org/10.1201/9781003142751-14
Cahyadi, & Forshaw, M. (2021). Hard Disk Failure Prediction on Highly Imbalanced Data using LSTM Network. 2021 IEEE International Conference on Big Data (Big Data), Query date: 2024-06-09 05:26:26. https://doi.org/10.1109/bigdata52589.2021.9671555
Capobianco, E., & Deng, J. (2022). Editorial: Big Data Analytics for Precision Health and Prevention. Frontiers in Big Data, 4(Query date: 2024-06-09 05:26:26). https://doi.org/10.3389/fdata.2021.835353
Dai, P., Chen, Y., & Feng, Y. (2022). Big Data Analysis of Applying Artificial Intelligence to Criminal Justice and Their Prevention. 2022 International Conference on Computation, Big-Data and Engineering (ICCBE), Query date: 2024-06-09 05:26:26. https://doi.org/10.1109/iccbe56101.2022.9888156
Debal, D. A., & Sitote, T. M. (2022). Chronic kidney disease prediction using machine learning techniques. Journal of Big Data, 9(1). https://doi.org/10.1186/s40537-022-00657-5
Deshpande, U., & Pham, T. (2022). Storage Capacity Prediction using Population Analytics. 2022 IEEE International Conference on Big Data (Big Data), Query date: 2024-06-09 05:26:26. https://doi.org/10.1109/bigdata55660.2022.10020706
Dritsas, E., & Trigka, M. (2022). Machine Learning Techniques for Chronic Kidney Disease Risk Prediction. Big Data and Cognitive Computing, 6(3), 98–98. https://doi.org/10.3390/bdcc6030098
Duan, Y. (2022). Statistical Analysis and Prediction of Employee Turnover Propensity Based on Data Mining. 2022 International Conference on Big Data, Information and Computer Network (BDICN), Query date: 2024-06-09 05:26:26. https://doi.org/10.1109/bdicn55575.2022.00052
ED-DAOUDY, A., Maalmi, K., & ouaazizi, A. E. (2022). A scalable and real-time system for disease prediction using big data processing. Query date: 2024-06-09 05:26:26. https://doi.org/10.21203/rs.3.rs-1567163/v1
Farashah, M. V., Etebarian, A., Azmi, R., & Dastjerdi, R. E. (2021). A hybrid recommender system based-on link prediction for movie baskets analysis. Journal of Big Data, 8(1). https://doi.org/10.1186/s40537-021-00422-0
Golande, A. L., & Pavankumar, T. (2023). Optical electrocardiogram based heart disease prediction using hybrid deep learning. Journal of Big Data, 10(1). https://doi.org/10.1186/s40537-023-00820-6
Hansun, S., Wicaksana, A., & Khaliq, A. Q. M. (2022). Multivariate cryptocurrency prediction: Comparative analysis of three recurrent neural networks approaches. Journal of Big Data, 9(1). https://doi.org/10.1186/s40537-022-00601-7
Hu, H., Xin, Y., Wu, X., Bao, C., Yang, L., & Bai, L. (2021). Research on Analysis and Prediction of Big Data of Chinese Medicinal Materials in R+Hadoop. 2021 International Conference on Artificial Intelligence, Big Data and Algorithms (CAIBDA), Query date: 2024-06-09 05:26:26. https://doi.org/10.1109/caibda53561.2021.00044
Jordan, G., Brimicombe, A., & Li, Y. (2021). Big Data Quantitative Risk Analysis Method for Machine Health Indicator Prediction. Query date: 2024-06-09 05:26:26. https://doi.org/10.14293/s2199-1006.1.sor-.pp55dcw.v1
K.MANOHARI. (2023). An Efficient Exploration on Big Data Analysis in Adolescent Diabetic Prediction with Deep Learning Techniques. International Journal of Information Technology, Research and Applications, 2(2), 33–40. https://doi.org/10.59461/ijitra.v2i2.51
Kojima, R., Legaspi, R., & Wada, S. (2022). Trip Destination Prediction by Cross-City Exploratory Data Analysis Approach in People Flow Data. 2022 IEEE International Conference on Big Data (Big Data), Query date: 2024-06-09 05:26:26. https://doi.org/10.1109/bigdata55660.2022.10020611
Kumar, A., Singh, K. U., & Kumar, M. (2023). A Clinical Data Analysis Based Diagnostic Systems for Heart Disease Prediction Using Ensemble Method. Big Data Mining and Analytics, 6(4), 513–525. https://doi.org/10.26599/bdma.2022.9020052
Lbrini, S., Fadil, A., Aamir, Z., Khomali, M., Oulidi, H. J., & Rhinane, H. (2021). Big Health Data: Cardiovascular Disease Prevention Using Big Data and Machine Learning. Studies in Computational Intelligence, Query date: 2024-06-09 05:26:26, 311–327. https://doi.org/10.1007/978-3-030-72065-0_17
Lei, L. (2024). Research on Disease Prediction Based on Daily Data Analysis of the Elderly Based on Neural Network Algorithms. 2024 IEEE 3rd International Conference on Electrical Engineering, Big Data and Algorithms (EEBDA), Query date: 2024-06-09 05:26:26. https://doi.org/10.1109/eebda60612.2024.10485910
Leinonen, J. (2021). Improvements to short-term weather prediction with recurrent-convolutional networks. 2021 IEEE International Conference on Big Data (Big Data), Query date: 2024-06-09 05:26:26. https://doi.org/10.1109/bigdata52589.2021.9671869
Manjula, G., Gopi, R., Rani, S. S., Reddy, S. S., & Chelvi, E. D. (2021). Firefly—Binary Cuckoo Search Technique based heart disease prediction in Big Data Analytics. Applications of Big Data in Healthcare, Query date: 2024-06-09 05:26:26, 241–260. https://doi.org/10.1016/b978-0-12-820203-6.00007-2
Mishra, S., Pandey, M., Rautaray, S. S., & Chakraborty, S. (2022). Multiclass Prediction of Heart Disease Patients Using Big Data Analytics. Studies in Big Data, Query date: 2024-06-09 05:26:26, 177–193. https://doi.org/10.1007/978-981-19-5154-1_11
Miyoshi, T. (2021). Fusing Big Data and Big Computation in Numerical Weather Prediction. Query date: 2024-06-09 05:26:26. https://doi.org/10.52843/cassyni.n6qqk0
Montesi, D., Girdzijauskas, S., & Vlassov, V. (2020). Repeating Link Prediction over Dynamic Graphs. 2020 IEEE International Conference on Big Data (Big Data), Query date: 2024-06-09 05:26:26. https://doi.org/10.1109/bigdata50022.2020.9378360
Muthulakshmi, P., & Parveen, M. (2024). Big Data Analytics for Heart Disease Prediction using Regularized Principal and Quadratic Entropy Boosting. Indian Journal Of Science And Technology, 17(6), 533–547. https://doi.org/10.17485/ijst/v17i6.2928
Nibareke, T., & Laassiri, J. (2020). Using Big Data-machine learning models for diabetes prediction and flight delays analytics. Journal of Big Data, 7(1). https://doi.org/10.1186/s40537-020-00355-0
Nikiforakis, G. (2021). The Use of Data Collection and Big Data Analysis in Neurodegenerative Disease Prevention. GeNeDis 2020, Query date: 2024-06-09 05:26:26, 181–181. https://doi.org/10.1007/978-3-030-78775-2_21
Ning, F. (2021). Prediction and Detection of Urban Trajectory Using Data Mining and Deep Neural Network. 2021 International Conference on Big Data Analysis and Computer Science (BDACS), Query date: 2024-06-09 05:26:26. https://doi.org/10.1109/bdacs53596.2021.00017
Ordonez, C., Fund, I., & Bellatreche, L. (2022). Comparing Association Rules and Deep Neural Networks for Heart Disease Prediction. 2022 IEEE International Conference on Big Data (Big Data), Query date: 2024-06-09 05:26:26. https://doi.org/10.1109/bigdata55660.2022.10020522
Pejic-Bach, M., Pivar, J., & Krstić, Ž. (2022). Big Data for Prediction. Research Anthology on Big Data Analytics, Architectures, and Applications, Query date: 2024-06-09 05:26:26, 1192–1215. https://doi.org/10.4018/978-1-6684-3662-2.ch057
Pontes, E. L., & Benjannet, M. (2021). Contextual Sentence Analysis for the Sentiment Prediction on Financial Data. 2021 IEEE International Conference on Big Data (Big Data), Query date: 2024-06-09 05:26:26. https://doi.org/10.1109/bigdata52589.2021.9672027
Prasad, Dr. S. (2021). Machine Learning Era In Heart Disease Prediction- An Intense Learning Analysis With Big Data. Turkish Journal of Computer and Mathematics Education (TURCOMAT), 12(5), 203–208. https://doi.org/10.17762/turcomat.v12i5.874
Prehofer, C., & Mehmood, S. (2020). Big Data Architectures for Vehicle Data Analysis. 2020 IEEE International Conference on Big Data (Big Data), Query date: 2024-06-09 05:26:26. https://doi.org/10.1109/bigdata50022.2020.9378397
R., R., P., K., E., M., V., K., & Pon, H. (2022). Disease Analysis and Prediction Using Digital Twins and Big Data Analytics. New Approaches to Data Analytics and Internet of Things Through Digital Twin, Query date: 2024-06-09 05:26:26, 98–114. https://doi.org/10.4018/978-1-6684-5722-1.ch005
Rawat, V., Singh, D. P., Singh, N., & Negi, S. (2023). Heart Disease Prediction Using Machine Learning and Big Data. Big Data, Cloud Computing and IoT, Query date: 2024-06-09 05:26:26, 93–102. https://doi.org/10.1201/9781003298335-7
Roberts, H., & Segev, A. (2020). Animal Behavior Prediction with Long Short-Term Memory. 2020 IEEE International Conference on Big Data (Big Data), Query date: 2024-06-09 05:26:26. https://doi.org/10.1109/bigdata50022.2020.9378184
Ruan, J., Wu, W., & Luo, J. (2020). Stock Price Prediction Under Anomalous Circumstances. 2020 IEEE International Conference on Big Data (Big Data), Query date: 2024-06-09 05:26:26. https://doi.org/10.1109/bigdata50022.2020.9378030
Sasikala, P., & Sheela, L. M. I. (2020). Sentiment analysis of online product reviews using DLMNN and future prediction of online product using IANFIS. Journal of Big Data, 7(1). https://doi.org/10.1186/s40537-020-00308-7
Sewell, D. K. (2022). Network-Informed Constrained Divisive Pooled Testing Assignments. Frontiers in Big Data, 5(Query date: 2024-06-09 05:26:26). https://doi.org/10.3389/fdata.2022.893760
Shahrivari, H., Papapetrou, O., & Fletcher, G. (2022). Workload Prediction for Adaptive Approximate Query Processing. 2022 IEEE International Conference on Big Data (Big Data), Query date: 2024-06-09 05:26:26. https://doi.org/10.1109/bigdata55660.2022.10020614
Shankhdhar, A. (2022). Visualization and Prediction of Heart Disease using Big Data Analytics. 2022 11th International Conference on System Modeling & Advancement in Research Trends (SMART), Query date: 2024-06-09 05:26:26. https://doi.org/10.1109/smart55829.2022.10046877
Shinde, S., Satav, S., Shirole, U., & Oak, S. (2022). Comprehensive Analysis of Parkinson Disease Prediction using Vocal Parameters. 2022 International Conference on Machine Learning, Big Data, Cloud and Parallel Computing (COM-IT-CON), Query date: 2024-06-09 05:26:26. https://doi.org/10.1109/com-it-con54601.2022.9850857
Shivani, B., & Rao, S. P. G. (2021). Stock Market Analysis & Prediction. 2021 International Conference on Forensics, Analytics, Big Data, Security (FABS), Query date: 2024-06-09 05:26:26. https://doi.org/10.1109/fabs52071.2021.9702549
Simon, S. D. (2022). Centers for Disease Control and Prevention (CDC). Encyclopedia of Big Data, Query date: 2024-06-09 05:26:26, 158–161. https://doi.org/10.1007/978-3-319-32010-6_258
Singh, A., Vij, D., Jijja, A., & Verma, S. (2023). Prediction of Heart Disease Using Various Data Analysis and Machine Learning Techniques. Springer Proceedings in Mathematics & Statistics, Query date: 2024-06-09 05:26:26, 23–35. https://doi.org/10.1007/978-3-031-15175-0_3
Skretting, A., & Gronli, T.-M. (2020). Baseline for Performance Prediction of Android Applications. 2020 IEEE International Conference on Big Data (Big Data), Query date: 2024-06-09 05:26:26. https://doi.org/10.1109/bigdata50022.2020.9377882
Sorell, T. (2020). Policing with Big Data: DNA Matching vs Crime Prediction. Big Data and Democracy, Query date: 2024-06-09 05:26:26, 57–70. https://doi.org/10.3366/edinburgh/9781474463522.003.0005
Xu, J., Liu, J., Yao, T., & Li, Y. (2023). Prediction and Big Data Impact Analysis of Telecom Churn by Backpropagation Neural Network Algorithm from the Perspective of Business Model. Big Data, 11(5), 355–368. https://doi.org/10.1089/big.2021.0365
Yang, Y., Li, Y., Chen, R., Zheng, J., Cai, Y., & Fortino, G. (2021). Risk Prediction of Renal Failure for Chronic Disease Population Based on Electronic Health Record Big Data. Big Data Research, 25(Query date: 2024-06-09 05:26:26), 100234–100234. https://doi.org/10.1016/j.bdr.2021.100234