Analisis Butir Soal Ulangan Akhir Semester I Mata Pelajaran Bahasa Indonesia Kelas II Madrasah Ibtidaiyah : Analisis Butir Soal Ulangan Akhir Semester I Mata Pelajaran Bahasa Indonesia Kelas II Madrasah Ibtidaiyah

USWATUN Ni'mah; Indonesia Indonesia; Ifada Novikasari

Authors

USWATUN Ni'mah UIN SAIZU PURWOKERTO
Indonesia Indonesia
Ifada Novikasari

Keywords:

item analysis,, test quality

Abstract

This research aims to conduct an item analysis on the final semester I examinationof the Indonesian language for second-grade students at Madrasah Ibtidaiyah. Aquantitative analysis method was employed using the Anates version 4 program asan evaluation tool. The research findings revealed several key insights. First, the testexhibited a relatively high level of reliability (0.63), indicating good consistency inmeasurement results. The correlation between item scores and the total test scorevaried, suggesting the need for evaluation and improvement in the test structureto enhance predictive validity. Second, the high-achieving group demonstratedgood performance, while the low-achieving group indicated potential for expandingunderstanding among some participants. This analysis provided insights intothe diversity of abilities among students. Third, the majority of items fell into thecategories of "very easy" and "easy," indicating a manageable level of difficultyfor respondents. However, there was variability in the difficulty levels, offeringvaluable information about the relative difficulty of each item. Fourth, the qualityof distractors was considered high, with no options rated as poor or very poor. Thevariation in difficulty levels of answer choices presented a balanced challenge toparticipants, contributing to the test's validity. This research offers a comprehensiveoverview of test quality and item characteristics, providing a foundation for furtherresearch development and measurement instrument improvement. These findingscan serve as a reference for enhancing the validity, reliability, and sustainability oftests in the context of Madrasah Ibtidaiyah education.

Author Biography

Indonesia , Indonesia

References

American Educational Research Association, A. P. A., & Education., N. C. on M. in. (2014). Standards

for Educational and Psychological Testing (Washington). American Educational Research

Association.

Anastasi, A., & Urbina, S. (1997). Psychological Testing. Upper Saddle River, NJ: Prentice Hall

Brown, G., & Race, P. (2002). Using effective questions. In The Lecturer’s Toolkit: A Practical Guide

to Assessment, Learning and Teaching, 121.

Bruner, J. (1983). Child’s Talk: Learning to Use Language. Norton.

Camilli, G., & Shepard, L. A. (1994). Methods for Identifying Biased Test Items. CA: Sage Publications.

Chomsky, N. (1957). Syntactic Structures. The Hague: Mouton.

Crocker, L., & Algina, J. (1986). Introduction to Classical and Modern Test Theory. FL: Holt, Rinehart

and Winston.

Derewianka, B. (1990). A Grammar for Writing. Primary English Teaching Association Australia.

Downing, S. M., & Haladyna, T. M. (2006). Handbook of test development. Routledge.

Dwipayani, A. A. (2013). Analisis Validitas dan Reliabilitas Butir Soal Ulangan Akhir Semester

Bidang Studi Bahasa Indonesia Kelas X.D SMA N 1 Terhadap Pencapaian Kompetensi. Jurnal

Pendidikan Bahasa Dan Sastra UNDIKSHA, 1(5).

Elviana. (2020). Analisis Butir Soal Evaluasi Pembelajaran Pendidikan Agama Islam Menggunakan

Program Anates. Jurnal Mudarirsuna, 10(2), 58–74.

Embretson, S. E., & Reise, S. P. (2000). Item response theory for psychologists. NJ: Lawrence Erlbaum

Associates.

Gronlund, N. E., & Linn, R. L. (1990). Measurement and Assessment in Teaching. Macmillan.

Haladyna, T. M., & Downing, S. M. (1989). alidity of a taxonomy of multiple-choice item-writing

rules. Applied Measurement in Education, 2(1), 51–78.

Haladyna, T. M., Downing, S. M., & Rodriguez, M. C. (2002). A review of multiple-choice item-writing

guidelines for classroom assessment. Applied Measurement in Education, 15(3), 309–334.

Hambleton, R. K., & Jones, R. W. (1993). Comparison of classical test theory and item response

theory and their applications to test development. Educational Measurement: Issues and

Practice, 12(3), 38–47.

Hambleton, R. K., & Swaminathan, H. (1985). Item Response Theory: Principles and Applications.

Kluwer-Nijhoff Publishing.

Ida, F. F., & Musyarofah, A. (2021). Validitas dan Reliabilitas dalam Analisis Butir Soal. Al-Mu’arrib:

Journal Of Arabic Education, 1(1). https://doi.org/10.32923/al-muarrib.v1i1.2100

Indonesia, D. A. R. (n.d.). Kurikulum Pendidikan Agama Islam (PAI) di Madrasah. www.kemenag.

go.id Ismail, M. I. (2020). Evaluasi pembelajaran. In Remaja Rosdakarya.

Lord, F. M. (1980). Applications of Item Response Theory to Practical Testing Problems. NJ: Lawrence

Erlbaum Associates.

Mulyasa, E. (2013). Kurikulum Tingkat Satuan Pendidikan: Konsep, Desain, dan Implementasi.

Remaja Rosdakarya.

Analisis Butir Soal Akhir Semester I Mata Pelajaran Bahasa Indonesia Kelas II Madrasah Ibtidaiyah