参考文献/References:
[1] 司林波,裴索亚,王伟伟.新中国教育评价制度变迁的影响因素、基本规律与实践启示:基于教育评价相关政策文本的扎根理论研究[J].大学教育科学,2021(6):69-77.
[2] 宣小红,檀慧玲,曹宇新.教育学研究的热点与未来展望:基于2021年度人大复印报刊资料《教育学》转载论文的分析[J].教育研究,2022,43(2):70-82.
[3] 赵勇.教育评价的几大问题及发展方向[J].华东师范大学学报(教育科学版),2021,39(4):1-14.
[4] BITLER M,CORCORAN S,DOMINA T,et al.Teacher effects on student achievement and height:a cautionary tale[J].Journal of Research on Educational Effectiveness,2021,14(4):900-924.
[5] 张志祯,齐文鑫.教育评价中的信息技术应用:赋能、挑战与对策[J].中国远程教育,2021(3):1-11,76.
[6] 汪文义,张华华.统计测量视角下考试公平推动教育公平的对策[J].江西师范大学学报(自然科学版),2017,41(4):383-393.
[7] SHEALY R,STOUT W.A model-based standardization approach that separates true bias/DIF from group ability differences and detects test bias/DTF as well as item bias/DIF[J].Psychometrika,1993,58(2):159-194.
[8] CHANG Huahua,MAZZEO J,ROUSSOS L.Detecting DIF for polytomously scored items:an adaptation of the SIBTEST procedure[J].Journal of Educational Measurement,1996,33(3):333-353.
[9] LI H H,STOUT W.A new procedure for detection of crossing DIF[J].Psychometrika,1996,61(4):647-677.
[10] CHALMERS R P.Improving the crossing-SIBTEST statistic for detecting non-uniform DIF[J].Psychometrika,2018,83(2):376-386.
[11] CRONBACH L J.Coefficient alpha and the internal structure of tests[J].Psychometrika,1951,16(3):297-334.
[12] GREEN S B,YANG Yanyun.Commentary on coefficient alpha:a cautionary tale[J].Psychometrika,2009,74(1):121-135.
[13] ANDERSSON B,LUO Hao,MARCQ K.Reliability coefficients for multiple group item response theory models[J].British Journal of Mathematical and Statistical Psychology,2022,75(2):395-410.
[14] RAYKOV T.Examining group differences in reliability of multiple-component instruments[J].British Journal of Mathematical and Statistical Psychology,2002,55(1):145-158.
[15] BENTLER P M.Covariate-free and covariate-dependent reliability[J].Psychometrika,2016,81(4):907-920.
[16] LEE H S,GEISINGER K F.The matching criterion purification for differential item functioning analyses in a large-scale assessment[J].Educational and Psychological Measurement,2016,76(1):141-163.
[17] CHEN Chengte,HWU B S.Improving the assessment of differential item functioning in large-scale programs with dual-scale purification of Rasch models:the PISA example[J].Applied Psychological Measurement,2018,42(3):206-220.
[18] 漆书青,戴海崎,丁树良.现代教育与心理测量学原理[M].北京:高等教育出版社,2002.
[19] GUTTMAN L.A basis for analyzing test-retest reliability[J].Psychometrika,1945,10(4):255-282.
[20] REVELLE W,YOVEL I.Psych:procedures for psychological,psychometric,and personality research[EB/OL].[2022-01-06].https://cran.r-project.org/web/packages/psych/psych.pdf.
[21] MCDONALD R P.Test theory:a unified treatment[M].New Jersey:Erlbaum,1999.
[22] ZINBARG R E,REVELLE W,YOVEL I,et al.Cronbach's α,Revelle's β,and McDonald's ωH:their relations with each other and two alternative conceptualizations of reliability[J].Psychometrika,2005,70(1):123-133.
[23] ZINBARG R E,REVELLE W,YOVEL I.Estimating ωh for structures containing two group factors:perils and prospects[J].Applied Psychological Measurement,2007,31(2):135-157.
[24] JORGENSEN T D,PORNPRASERTMANIT S,SCHOEMANN A M,et al.semTools:useful tools for structural equation modeling[EB/OL].[2022-01-06].https://cran.r-project.org/web/packages/semTools/semTools.pdf.
[25] 温忠麟,叶宝娟.测验信度估计:从α系数到内部一致性信度[J].心理学报,2011,43(7):821-829.
[26] REVELLE W,ZINBARG R E.Coefficients alpha,beta,omega and the glb:comments on Sijtsma[J].Psychometrika,2009,74(1):145-154.
[27] BENTLER P M.Alpha,FACTT,and beyond[J].Psychometrika,2021,86(4):861-868.
[28] LORD F M,NOVICK M R.Statistical theory of mental test scores[M].Massachusetts:Addison-Wesley,1968.
[29] CHALMERS R P.mirt:a multidimensional item response theory package for the R environment[J].Journal of Statistical Software,2012,48(6):1-29.