Improving Second
Language Proficiency Assessment
A Differential Item Functioning Study
National Institute for Testing and Evaluation,
Abstract
This
study investigates factors affecting knowledge and acquisition of a second language
(SL) by examining differential item functioning (DIF) on SL (Hebrew) test items
for two language groups: Arabic and Russian speakers. The results are
consistent with the literature on English as a SL with regard to performance in
grammar and vocabulary. Many items (42%) functioned differentially, indicating
a potential threat to validity. The most problematic item type was Sentence
Completion. To reduce the number of DIF items included in operational tests, we
suggest changing the balance between item types and performing DIF analysis on
piloted items. Further research, using the age of the examinees and the length
of time they have lived in