Bibliometric Analysis of Missing Value Imputation (MVI) Research Using Python and VOSviewer: Trends and Future Directions
- 1 Department of Data Science, Faculty of Engineering and Design, Institut Teknologi Sains Bandung, Bekasi, West Java, 17530, Indonesia
- 2 Department of Pulp and Paper Technology, Faculty of Vocational, Institut Teknologi Sains Bandung, Bekasi, West Java, 17530, Indonesia
- 3 Department of Palm Oil Processing Technology, Vocational Faculty, Institut Teknologi Sains Bandung, Bekasi, West Java, 17530, Indonesia
- 4 Department of Statistics, Faculty of Science, Computer and Mathematics, Matana University, Banten, 15810, Indonesia
- 5 Department of Data Science, Sains and Technology, Universitas Putra Bangsa, Kebumen, Jawa Tengah, 54361, Indonesia
Abstract
Incomplete data is a pervasive issue in data science, posing significant challenges for statistical analysis and machine learning applications. Missing Value Imputation (MVI) has thus emerged as a critical area of research aimed at mitigating data quality issues and improving decision-making accuracy. This study presents a comprehensive bibliometric analysis of MVI research using Scopus-indexed publications from 2000 to 2024. The analysis was conducted using Python and VOSviewer, with the support of Scopus AI to enhance the identification of research trends and thematic patterns. Our findings reveal a significant rise in publications on MVI in the last decade, driven by the increasing adoption of artificial intelligence and big data technologies. The results indicate that a few prolific research groups and institutions have contributed extensively to the field, particularly in healthcare, finance, and environmental sciences. We employed advanced preprocessing techniques, including keyword normalization and duplicate filtering, to ensure data quality. In addition, statistical validation methods, such as linear regression and Mann-Kendall tests, were applied to confirm trend significance. Visualizations include co-authorship networks, keyword co-occurrence maps, and citation impact distributions. This study highlights promising directions for future research, including real-time imputation for streaming data, applications in underrepresented domains, and comparative studies across bibliographic databases. The findings contribute not only to a deeper understanding of the evolution of MVI research but also offer actionable insights for researchers and practitioners seeking to navigate and advance this domain.
DOI: https://doi.org/10.3844/jcssp.2025.2816.2833
Copyright: © 2025 Samsul Arifin, Muhammad Faisal, Edwin Kristianto Sijabat, Ni Njoman Manik Susantini, Okta Nindita Priambodo, Idad Syaeful Haq, Wiwik Wiyanti and Lolanda Hamim Annisa. This is an open access article distributed under the terms of the
Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
- 64 Views
- 15 Downloads
- 0 Citations
Download
Keywords
- Missing Value Imputation
- Bibliometric Analysis
- Python
- VOSviewer
- Data Visualization
- Research Trends