Automated DAPODIK Elementary School Data Extraction Using Selenium and BeautifulSoup
Keywords:
web scraping, Python, Sistem Data, Analisis Data, Sistem Informasi, Education Information System, Educational Data, Data Automation, Elementary SchoolAbstract
Educational data management is essential for supporting policy planning and improving decision-making processes in the education sector. Nevertheless, collecting DAPODIK data manually often requires considerable time and may increase the possibility of human error because the information is distributed across multiple web pages. This study aims to design and assess an automated system for extracting elementary school DAPODIK data through Python-based web scraping techniques. A quantitative experimental approach was applied using Selenium WebDriver and BeautifulSoup to obtain educational data from the official DAPODIK reference website. The collected dataset involved 40 elementary schools located in Lais District, Musi Banyuasin Regency, consisting of school identity, NPSN, accreditation level, number of students, number of teachers, and school status. Data preprocessing procedures included data cleaning, standardization, type conversion, and duplicate elimination with the assistance of the Pandas library. The experimental results indicate that the proposed system achieved a 100% extraction success rate with no detected errors and completed the scraping process within 133.61 seconds. In addition, the extracted dataset showed consistent and valid numerical as well as categorical information suitable for further analytical processing. The findings demonstrate that automated web scraping can improve the speed, accuracy, and consistency of DAPODIK data collection compared with conventional manual methods. Furthermore, the developed framework has the potential to support large-scale educational data management and monitoring systems in the future.
References
D. Dalam and P. Sekolah, “Jurnal administrasi pendidikan,” vol. 19, no. 1, pp. 1–8, 2022.
“View of Pengembangan Materi Pembelajaran dengan Memanfaatkan Teknologi Augmented Reality untuk Guru Sekolah Menengah Atas.pdf.”
D. I. Uptd, S. Skb, and K. Salatiga, “dpodik y,” vol. 10, 2024.
K. B. Bolango, “Evaluasi Penerimaan Sistem Informasi Dapodik Menggunakan Metode Technology Acceptance Model ( Tam ) Pada Sekolah Dasar,” vol. 2, no. 2, 2022.
V. N. Tahun, “Jurnal Pendidikan Mosikolah,” vol. 3, no. 1, pp. 15–24, 2024.
E. Yulianti, I. P. Pratiwi, I. Saluza, and D. Marcelina, “Penerapan Artificial Intelligence Dalam Meningkatkan Produktivitas Guru Sekolah Dasar 13 Palembang,” vol. 8, no. 2, pp. 111–121, 2024.
D. D. Ayani, H. S. Pratiwi, and H. Muhardi, “Implementasi Web Scraping untuk Pengambilan Data pada Situs Marketplace,” vol. 7, no. 4, pp. 257–262, 2019.
P. Oktaria, P. Herri, and Z. R. Mair, “Implementasi Alat Monitoring Suhu Ruangan Berbasis Internet Of Things ( IoT ) Menggunakan Metode MQTT dan HTML Pada Ruangan Server Universitas,” vol. 01, no. 01, pp. 40–50.
H. Setiawan et al., “PENGGUNAAN METODE SIGNATURED BASED USE OF SIGNATURE BASED METHOD,” vol. 8, no. 3, 2021, doi: 10.25126/jtiik.202184200.
N. Filipe and L. Marinho, “Utilização de Inteligência Artificial no Futebol : Recolha de dados por Web scraping e previsão de resultados na Liga Instituto Superior de Engenharia do Porto Utilização de Inteligência Artificial no Futebol : Recolha de dados por Web Scraping e previsão de resultados na Liga Portuguesa,” 2024.
V. No, O. Hal, A. Z. Rizquina, and C. I. Ratnasari, “Implementasi Web Scraping untuk Pengambilan Data Pada Website E- Commerce,” vol. 5, no. 4, pp. 377–383, 2023.
Y. Kirmadi, “Efektivitas Penggunaan Data Pokok Pendidikan ( Dapodik ) Sebagai Instrumen Manajemen Pendidikan,” vol. 6, no. 9, pp. 1417–1431, 2025.
S. Kusumo, “Penerapan Web Scraping Deskripsi Produk Menggunakan Selenium Python Dan Framework Laravel,” vol. 9, no. 4, pp. 3426–3435, 2022.
K. Dwicahyo, “Perbandingan Metode Web Scraping Dalam Pengambilan Data : Kajian Literatur”.
I. Permatasari, D. Sartika, and I. Saluza, “Workshop Membuat Dan Menggungah Video Pembelajaran Secara Online Bagi Para Guru Smp Negeri 30 Palembang,” vol. 6, no. 3, pp. 186–191, 2022.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 M.Beni Tanjung, Herri Setiawan, Indah Permatasari

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.


