Analisis Butir Soal Akhir Semester I Mata Pelajaran Bahasa Indonesia Kelas II Madrasah Ibtidaiyah
Analisis Butir Soal Ulangan Akhir Semester I Mata Pelajaran Bahasa Indonesia Kelas II Madrasah Ibtidaiyah
Keywords:
item analysis,, test qualityAbstract
This research aims to conduct an item analysis on the final semester I examinationof the Indonesian language for second-grade students at Madrasah Ibtidaiyah. Aquantitative analysis method was employed using the Anates version 4 program asan evaluation tool. The research findings revealed several key insights. First, the testexhibited a relatively high level of reliability (0.63), indicating good consistency inmeasurement results. The correlation between item scores and the total test scorevaried, suggesting the need for evaluation and improvement in the test structureto enhance predictive validity. Second, the high-achieving group demonstratedgood performance, while the low-achieving group indicated potential for expandingunderstanding among some participants. This analysis provided insights intothe diversity of abilities among students. Third, the majority of items fell into thecategories of "very easy" and "easy," indicating a manageable level of difficultyfor respondents. However, there was variability in the difficulty levels, offeringvaluable information about the relative difficulty of each item. Fourth, the qualityof distractors was considered high, with no options rated as poor or very poor. Thevariation in difficulty levels of answer choices presented a balanced challenge toparticipants, contributing to the test's validity. This research offers a comprehensiveoverview of test quality and item characteristics, providing a foundation for furtherresearch development and measurement instrument improvement. These findingscan serve as a reference for enhancing the validity, reliability, and sustainability oftests in the context of Madrasah Ibtidaiyah education.References
American Educational Research Association, A. P. A., & Education., N. C. on M. in. (2014). Standards
for Educational and Psychological Testing (Washington). American Educational Research
Association.
Anastasi, A., & Urbina, S. (1997). Psychological Testing. Upper Saddle River, NJ: Prentice Hall
Brown, G., & Race, P. (2002). Using effective questions. In The Lecturer’s Toolkit: A Practical Guide
to Assessment, Learning and Teaching, 121.
Bruner, J. (1983). Child’s Talk: Learning to Use Language. Norton.
Camilli, G., & Shepard, L. A. (1994). Methods for Identifying Biased Test Items. CA: Sage Publications.
Chomsky, N. (1957). Syntactic Structures. The Hague: Mouton.
Crocker, L., & Algina, J. (1986). Introduction to Classical and Modern Test Theory. FL: Holt, Rinehart
and Winston.
Derewianka, B. (1990). A Grammar for Writing. Primary English Teaching Association Australia.
Downing, S. M., & Haladyna, T. M. (2006). Handbook of test development. Routledge.
Dwipayani, A. A. (2013). Analisis Validitas dan Reliabilitas Butir Soal Ulangan Akhir Semester
Bidang Studi Bahasa Indonesia Kelas X.D SMA N 1 Terhadap Pencapaian Kompetensi. Jurnal
Pendidikan Bahasa Dan Sastra UNDIKSHA, 1(5).
Elviana. (2020). Analisis Butir Soal Evaluasi Pembelajaran Pendidikan Agama Islam Menggunakan
Program Anates. Jurnal Mudarirsuna, 10(2), 58–74.
Embretson, S. E., & Reise, S. P. (2000). Item response theory for psychologists. NJ: Lawrence Erlbaum
Associates.
Gronlund, N. E., & Linn, R. L. (1990). Measurement and Assessment in Teaching. Macmillan.
Haladyna, T. M., & Downing, S. M. (1989). alidity of a taxonomy of multiple-choice item-writing
rules. Applied Measurement in Education, 2(1), 51–78.
Haladyna, T. M., Downing, S. M., & Rodriguez, M. C. (2002). A review of multiple-choice item-writing
guidelines for classroom assessment. Applied Measurement in Education, 15(3), 309–334.
Hambleton, R. K., & Jones, R. W. (1993). Comparison of classical test theory and item response
theory and their applications to test development. Educational Measurement: Issues and
Practice, 12(3), 38–47.
Hambleton, R. K., & Swaminathan, H. (1985). Item Response Theory: Principles and Applications.
Kluwer-Nijhoff Publishing.
Ida, F. F., & Musyarofah, A. (2021). Validitas dan Reliabilitas dalam Analisis Butir Soal. Al-Mu’arrib:
Journal Of Arabic Education, 1(1). https://doi.org/10.32923/al-muarrib.v1i1.2100
Indonesia, D. A. R. (n.d.). Kurikulum Pendidikan Agama Islam (PAI) di Madrasah. www.kemenag.
go.id Ismail, M. I. (2020). Evaluasi pembelajaran. In Remaja Rosdakarya.
Lord, F. M. (1980). Applications of Item Response Theory to Practical Testing Problems. NJ: Lawrence
Erlbaum Associates.
Mulyasa, E. (2013). Kurikulum Tingkat Satuan Pendidikan: Konsep, Desain, dan Implementasi.
Remaja Rosdakarya.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2024 USWATUN Ni'mah , Indonesia , Ifada Novikasari

This work is licensed under a Creative Commons Attribution 4.0 International License.