Review Article Open Access

Bibliometric Analysis of Missing Value Imputation (MVI) Research Using Python and VOSviewer: Trends and Future Directions

Samsul Arifin1, Muhammad Faisal1, Edwin Kristianto Sijabat2, Ni Njoman Manik Susantini2, Okta Nindita Priambodo3, Idad Syaeful Haq3, Wiwik Wiyanti4 and Lolanda Hamim Annisa5
  • 1 Department of Data Science, Faculty of Engineering and Design, Institut Teknologi Sains Bandung, Bekasi, West Java, 17530, Indonesia
  • 2 Department of Pulp and Paper Technology, Faculty of Vocational, Institut Teknologi Sains Bandung, Bekasi, West Java, 17530, Indonesia
  • 3 Department of Palm Oil Processing Technology, Vocational Faculty, Institut Teknologi Sains Bandung, Bekasi, West Java, 17530, Indonesia
  • 4 Department of Statistics, Faculty of Science, Computer and Mathematics, Matana University, Banten, 15810, Indonesia
  • 5 Department of Data Science, Sains and Technology, Universitas Putra Bangsa, Kebumen, Jawa Tengah, 54361, Indonesia

Abstract

Incomplete data is a pervasive issue in data science, posing significant challenges for statistical analysis and machine learning applications. Missing Value Imputation (MVI) has thus emerged as a critical area of research aimed at mitigating data quality issues and improving decision-making accuracy. This study presents a comprehensive bibliometric analysis of MVI research using Scopus-indexed publications from 2000 to 2024. The analysis was conducted using Python and VOSviewer, with the support of Scopus AI to enhance the identification of research trends and thematic patterns. Our findings reveal a significant rise in publications on MVI in the last decade, driven by the increasing adoption of artificial intelligence and big data technologies. The results indicate that a few prolific research groups and institutions have contributed extensively to the field, particularly in healthcare, finance, and environmental sciences. We employed advanced preprocessing techniques, including keyword normalization and duplicate filtering, to ensure data quality. In addition, statistical validation methods, such as linear regression and Mann-Kendall tests, were applied to confirm trend significance. Visualizations include co-authorship networks, keyword co-occurrence maps, and citation impact distributions. This study highlights promising directions for future research, including real-time imputation for streaming data, applications in underrepresented domains, and comparative studies across bibliographic databases. The findings contribute not only to a deeper understanding of the evolution of MVI research but also offer actionable insights for researchers and practitioners seeking to navigate and advance this domain.

Journal of Computer Science
Volume 21 No. 12, 2025, 2816-2833

DOI: https://doi.org/10.3844/jcssp.2025.2816.2833

Submitted On: 9 February 2025 Published On: 20 January 2026

How to Cite: Arifin, S., Faisal, M., Sijabat, E. K., Susantini, N. N. M., Priambodo, O. N., Haq, I. S., Wiyanti, W. & Annisa, L. H. (2025). Bibliometric Analysis of Missing Value Imputation (MVI) Research Using Python and VOSviewer: Trends and Future Directions. Journal of Computer Science, 21(12), 2816-2833. https://doi.org/10.3844/jcssp.2025.2816.2833

  • 98 Views
  • 26 Downloads
  • 0 Citations

Download

Keywords

  • Missing Value Imputation
  • Bibliometric Analysis
  • Python
  • VOSviewer
  • Data Visualization
  • Research Trends