[1]汪文义,何韵玲,宋丽红*,等.匹配变量纯化的测验偏差检验方法[J].江西师范大学学报(自然科学版),2022,(05):447-452.[doi:10.16357/j.cnki.issn1000-5862.2022.05.02]
 WANG Wenyi,HE Yunling,SONG Lihong*,et al.The Matching Score Purification for Differential Item Functioning Method[J].Journal of Jiangxi Normal University:Natural Science Edition,2022,(05):447-452.[doi:10.16357/j.cnki.issn1000-5862.2022.05.02]
点击复制

匹配变量纯化的测验偏差检验方法()
分享到:

《江西师范大学学报》(自然科学版)[ISSN:1006-6977/CN:61-1281/TN]

卷:
期数:
2022年05期
页码:
447-452
栏目:
心理与教育测量
出版日期:
2022-09-25

文章信息/Info

Title:
The Matching Score Purification for Differential Item Functioning Method
文章编号:
1000-5862(2022)05-0447-06
作者:
汪文义1何韵玲1宋丽红2*黄 涛1
(1.江西师范大学计算机信息工程学院,江西 南昌 330022; 2.江西师范大学教育学院,江西 南昌 330022)
Author(s):
WANG Wenyi1HE Yunling1SONG Lihong2*HUANG Tao1
(1.School of Computer and Information Engineering,Jiangxi Normal University,Nanchang Jiangxi 330022,China; 2.School of Education,Jiangxi Normal University,Nanchang Jiangxi 330022,China)
关键词:
测验偏差 项目功能差异 CSIBTEST 信度 考试公平
Keywords:
test bias differential item functioning CSIBTEST reliability the fairness of testing
分类号:
B 841
DOI:
10.16357/j.cnki.issn1000-5862.2022.05.02
文献标志码:
A
摘要:
CSIBTEST方法是基于参照组和目标组2个测验信度对真分数进行估计,再按交叉位置分数将匹配分数划分为2类子样本,并分别计算其卡方统计量,然后将这2个独立的卡方统计量相加得到自由度为2的检验统计量.鉴于测验信度具有群体依赖性,即不同群体的测验信度可能不尽相同,而CSIBTEST方法将参照组和目标组分别划分为2类子样本,有必要对子样本上的测验信度也进行估计,由此拓展了CSIBTEST.新方法先使用CSIBTEST获得交叉位置参数,相当于进行DIF预分析,再使用子样本上的信度估计用于真分数估计,以在对匹配变量进行纯化后获得检测统计量.模拟研究结果显示:相比SIBTEST和CSIBTEST,匹配变量纯化的测验偏差检验方法对存在DIF试题有着更高的统计检验力.
Abstract:
The CSIBTEST method estimates the true scores based on the test reliability of the reference group and the focus group,then separates the matching scores into two kinds of sub-samples according to the crossing location,and computes the Chi-squared statistics respectively,and then adds the two independent statistics to obtain the test statistics with a degree of freedom of 2.From the view of the group dependence of test reliability,that is,the test reliability of different groups may be different,and the CSIBTEST method separates the reference group and the focus group into two sub-samples respectively,it is necessary to estimate the test reliability on the sub-samples for the extension of the CSIBTEST.The new method first uses the CSIBTEST to obtain the cross location,and then applies the reliability estimation on the sub-samples for the true score estimation as matching score purification to obtain the test statistic.The simulation study shows that the new method with matching score purification has higher statistical test power for the bias item than the SIBTEST and CSIBTEST.

参考文献/References:

[1] 司林波,裴索亚,王伟伟.新中国教育评价制度变迁的影响因素、基本规律与实践启示:基于教育评价相关政策文本的扎根理论研究[J].大学教育科学,2021(6):69-77.
[2] 宣小红,檀慧玲,曹宇新.教育学研究的热点与未来展望:基于2021年度人大复印报刊资料《教育学》转载论文的分析[J].教育研究,2022,43(2):70-82.
[3] 赵勇.教育评价的几大问题及发展方向[J].华东师范大学学报(教育科学版),2021,39(4):1-14.
[4] BITLER M,CORCORAN S,DOMINA T,et al.Teacher effects on student achievement and height:a cautionary tale[J].Journal of Research on Educational Effectiveness,2021,14(4):900-924.
[5] 张志祯,齐文鑫.教育评价中的信息技术应用:赋能、挑战与对策[J].中国远程教育,2021(3):1-11,76.
[6] 汪文义,张华华.统计测量视角下考试公平推动教育公平的对策[J].江西师范大学学报(自然科学版),2017,41(4):383-393.
[7] SHEALY R,STOUT W.A model-based standardization approach that separates true bias/DIF from group ability differences and detects test bias/DTF as well as item bias/DIF[J].Psychometrika,1993,58(2):159-194.
[8] CHANG Huahua,MAZZEO J,ROUSSOS L.Detecting DIF for polytomously scored items:an adaptation of the SIBTEST procedure[J].Journal of Educational Measurement,1996,33(3):333-353.
[9] LI H H,STOUT W.A new procedure for detection of crossing DIF[J].Psychometrika,1996,61(4):647-677.
[10] CHALMERS R P.Improving the crossing-SIBTEST statistic for detecting non-uniform DIF[J].Psychometrika,2018,83(2):376-386.
[11] CRONBACH L J.Coefficient alpha and the internal structure of tests[J].Psychometrika,1951,16(3):297-334.
[12] GREEN S B,YANG Yanyun.Commentary on coefficient alpha:a cautionary tale[J].Psychometrika,2009,74(1):121-135.
[13] ANDERSSON B,LUO Hao,MARCQ K.Reliability coefficients for multiple group item response theory models[J].British Journal of Mathematical and Statistical Psychology,2022,75(2):395-410.
[14] RAYKOV T.Examining group differences in reliability of multiple-component instruments[J].British Journal of Mathematical and Statistical Psychology,2002,55(1):145-158.
[15] BENTLER P M.Covariate-free and covariate-dependent reliability[J].Psychometrika,2016,81(4):907-920.
[16] LEE H S,GEISINGER K F.The matching criterion purification for differential item functioning analyses in a large-scale assessment[J].Educational and Psychological Measurement,2016,76(1):141-163.
[17] CHEN Chengte,HWU B S.Improving the assessment of differential item functioning in large-scale programs with dual-scale purification of Rasch models:the PISA example[J].Applied Psychological Measurement,2018,42(3):206-220.
[18] 漆书青,戴海崎,丁树良.现代教育与心理测量学原理[M].北京:高等教育出版社,2002.
[19] GUTTMAN L.A basis for analyzing test-retest reliability[J].Psychometrika,1945,10(4):255-282.
[20] REVELLE W,YOVEL I.Psych:procedures for psychological,psychometric,and personality research[EB/OL].[2022-01-06].https://cran.r-project.org/web/packages/psych/psych.pdf.
[21] MCDONALD R P.Test theory:a unified treatment[M].New Jersey:Erlbaum,1999.
[22] ZINBARG R E,REVELLE W,YOVEL I,et al.Cronbach's α,Revelle's β,and McDonald's ωH:their relations with each other and two alternative conceptualizations of reliability[J].Psychometrika,2005,70(1):123-133.
[23] ZINBARG R E,REVELLE W,YOVEL I.Estimating ωh for structures containing two group factors:perils and prospects[J].Applied Psychological Measurement,2007,31(2):135-157.
[24] JORGENSEN T D,PORNPRASERTMANIT S,SCHOEMANN A M,et al.semTools:useful tools for structural equation modeling[EB/OL].[2022-01-06].https://cran.r-project.org/web/packages/semTools/semTools.pdf.
[25] 温忠麟,叶宝娟.测验信度估计:从α系数到内部一致性信度[J].心理学报,2011,43(7):821-829.
[26] REVELLE W,ZINBARG R E.Coefficients alpha,beta,omega and the glb:comments on Sijtsma[J].Psychometrika,2009,74(1):145-154.
[27] BENTLER P M.Alpha,FACTT,and beyond[J].Psychometrika,2021,86(4):861-868.
[28] LORD F M,NOVICK M R.Statistical theory of mental test scores[M].Massachusetts:Addison-Wesley,1968.
[29] CHALMERS R P.mirt:a multidimensional item response theory package for the R environment[J].Journal of Statistical Software,2012,48(6):1-29.

相似文献/References:

[1]汪文义,张华华.统计测量视角下考试公平推动教育公平的对策[J].江西师范大学学报(自然科学版),2017,(04):383.
 WANG Wenyi,CHANG Hua-hua.A Practical View of Test Fairness to Improve Equity in Education from Statistical Measurement[J].Journal of Jiangxi Normal University:Natural Science Edition,2017,(05):383.

备注/Memo

备注/Memo:
收稿日期:2022-04-25
基金项目:江西省社会科学基金(17JY10)和国家自然科学基金(62267004,62067005,61967009)资助项目.
通信作者:宋丽红(1981—),女,江西新干人,副教授,博士,主要从事教育测量研究.E-mail:viviansong1981@163.com
更新日期/Last Update: 2022-09-25