[1]黎光明,王小婷.非等组锚题设计下IRT等值方法比较及其应用[J].江西师范大学学报(自然科学版),2017,(05):454-461.
 LI Guangming,WANG Xiaoting.The Comparison of Equating Methods in Non-Equivalent Group with Anchor Test Design Based on Item Response Theory and Its Application[J].,2017,(05):454-461.
点击复制

非等组锚题设计下IRT等值方法比较及其应用()
分享到:

《江西师范大学学报》(自然科学版)[ISSN:1006-6977/CN:61-1281/TN]

卷:
期数:
2017年05期
页码:
454-461
栏目:
出版日期:
2017-11-01

文章信息/Info

Title:
The Comparison of Equating Methods in Non-Equivalent Group with Anchor Test Design Based on Item Response Theory and Its Application
作者:
黎光明王小婷
华南师范大学心理学院,心理应用研究中心,广东 广州 510631
Author(s):
LI GuangmingWANG Xiaoting
School of Psychology,Center for Studies of Psychological Application,South China Normal University,Guangzhou Guangdong 510631,China
关键词:
项目反应理论 测验等值 非等组锚题设计
Keywords:
IRT test equating non-equivalent groups with anchor test
分类号:
TP 841
文献标志码:
A
摘要:
总结了基于非等组锚题设计下的两大类IRT等值方法:同时参数标定和分别参数标定.分别参数标定包含了线性参数转换和固定参数标定,以等值精度为评价标准对这3类等值方法的效果和适用条件进行归纳并做出相应的评析,为测验工作者选择合适的等值方法进行项目参数和测验等值提供参考依据.
Abstract:
Two kinds of methods in test equating has been commented:concurrent calibration method and separate calibration method.The second kind includes linking separate calibration methods(e.g.the moment methods and the characteristic curve methods)and FIPC(Fixed Item Parameter Calibration)method.Taking equating accuracy as the criterion,the effects and suitable conditions of each method are summarized and corresponding comments are provided.The reference for users will be prouided in selecting the appropriate methods to process test equating.

参考文献/References:

[1] 张敏强,胡晖.略论测验等值的理论、方法和应用 [J].华南师范大学学报:社会科学版,1988(4):113-118.
[2] 漆书青,戴海琦.项目反应理论及其应用研究 [M].南昌:江西高校出版社,1992.
[3] 漆书青,戴海琦,丁树良.现代教育与心理测量学原理 [M].北京:高等教育出版社,2002.
[4] Kang Taehoon,Petersen N S.Linking item parameters to a base scale [J].Asia Pacific Education Review,2012,13(2):311-321.
[5] Kolen M J,Brennan R L.Test equating,linking,and scaling:methods and practices [M].New York:Springer Verla,2004.
[6] 罗照盛.项目反应理论基础 [M].北京:北京师范大学出版社,2012.
[7] Kolen M J,Brennan R L.Test equating scaling and lingking:method and practices [M].3ed.New York:Springer Verlag,2014.
[8] Hanson A B,Beguin A A.Obtaining a common scale for item response theory item parameters using separate versus concurrent estimation in the common-item equating design [J].Applied Psychological Measurement,2002,26(1):3-24.
[9] Loyd B H,Hoover H D.Vertical equating using the rasch model [J].Journal of Educational Measurement,1980,17(3):179-193
[10] Marco G L.Item characteristic curves solutions to three intractable testing problems [J].Journal of Educational Measurement,1977,14(2):139-160
[11] Haebara T.Equating logistic ability scale by weighted least squares method [J].Japanese Psychological Research,1980,22(3):144-149.
[12] Stocking M L,Lord F M.Developing a common metric in item response theory [J].Applied Psychological Measurement,1983,7(2):201-210.
[13] Linn R L,Levine M V,Hastings C N,et al.Item Bias in a test of reading comprehension [J].Applied Psychological Measurement,1981,5(2):159-173.
[14] 丁树良,熊建华,毛萌萌.项目反应理论框架下的新等值方法:对数对比等值法 [J].心理学报,2003,35(6):835-841.
[15] 熊建华,丁树良.Haebara等值方法及其加权准则 [J].江西师范大学学报:自然科学版,2005,29(5):434-437.
[16] 程德巧.绝对值等值准则及求解算法的应用 [D].南昌:江西师范大学,2005.
[17] 吴锐,丁树良,甘登文.一种新的项目反应理论等值准则:余弦准则 [J].江西师范大学学报:自然科学版,2008,32(2):224-245.
[18] Li Yuanhua,Tam H P,Tompkins L J.A comparison of using the fixed common pre-calibrated parameter method and the matched characteristic curve method for linking multiple-test items [J].International Journal of Testing,2004,4(3):267-293.
[19] Paek I,Young M J.Investigation of student growth recovery in a fixed-item linking procedure with a fixed-person prior distribution for mixed-format test data [J].Applied Measurement in Education,2005,18(2):199-215.
[20] Kim S H,Cohen A S.A comparison of linking and concurrent calibration under item response theory [J].Applied Psychological Measurement,1996,22(2):131-143.
[21] Kim J S,Hanson B A.Test equating under the multiple-choice model [J].Applied Psychological Measurement,2002,26(3):255-270.
[22] Wingersky M S,Lord F M.An investigation of methods for reducing sampling error in certain IRT procedures [J].Applied Psychological Measurement,1983,8:347-364.
[23] Wingersky M S,Barton P,Lord F.LOGIS[EB/OL].
[2017-01-06].http://www.ets.org.
[24] Mislevy B,Bock D.BILOG[EB/OL].
[2017-01-09].http://www.ssicentral.com.
[25] Thissen D.MULTILOG:Multiple categorical item analysis and test scoring using item response theory(Version 6.0)[EB/OL].
[2017-01-10].http://www.chegg.com.
[26] Muraki E,Bock R D.PARSCALE:IRT analysis and scoring of rating scale data [J].Science,2014,343(6169):350.
[27] Ogasawara H.Asymptotic standard errors of IRT equating coefficients using moments [J].Economic Review,2000,51(1):1-23.
[28] Baker F B,Al-Karni A.A comparison of two procedures for computing IRT equating coefficients [J].Journal of Educational Measurement,1991,28(2):147-162.
[29] Ogasawara H.Item response theory true score equating and their standard errors [J].Journal of Educational Behavioral Statistics,2001,26(1):31-50.
[30] Ogasawara H.Least square estimations of item response theory linking coefficients [J].Applied Psychological Measurement,2001,25(4):3-21.
[31] Ban Jae-chun,Hanson B A,Wang Tianyou,et al.A comparative study of on-line pretest-item calibration/scaling methods in computerized adaptive testing [J].Journal of Educational Measurement,2001,38(3):191-212.
[32] Kim S.A comparative study of IRT fixed parameter calibration methods [J].Journal of Educational Measurement,2006,43(4),355-381.
[33] Zhang Z H,Ni Y J.A comparison of fixed pre-calibrated parameter method,linking separate calibration and concurrent calibration for linking different groups [C].Chicago:The Annual Meeting of American Education Research Association,2007.
[34] Petersen N S,Cook L L,Stocking M L.IRT versus conventional equating methods:a comparative study of scale stability [J].Journal of Educational Statistics,1983,8(2):137-156.
[35] Wingersky M S,Cook L L,Eignor D R.Specifying the characteristics of linking items used for item response theory item calibration [R].ETS Research Report,1987:87-24.
[36] Kim S H,Cohen A S.A comparison of linking and concurrent calibration under the graded response model [J].Applied Psychological Measurement,2002,26(1):25-41.
[37] Beguin A A,Hanson B A,Glas C A W.Effect of multidimensionality on separate and concurrent estimation in IRT equating [C].New Orleans:The National Council on Measurement in Education,2000.
[38] Beguin A A,Hanson B A.Effect of non-compensatory multidimensionality on separate and concurrent estimation in IRT observed score equating [C].Seattle:The National Council on Measurement in Education,2001.
[39] Kerkee T,Lewis D M,Hoskens M,et al.Separate versus concurrent calibration methods in vertical scaling [C].Chicago:The National Council on Measurement in Education,2003.
[40] Sayaka A,Shinichi M.A comparison of equating methods and linking designs for developing an item pool under item response theory [J].Behaviormetrika,2011,38(1):1-16.
[41] Zhang Zhonghua.Comparison of different equating methods and an application to link testlet-based tests [D].Hong Kong:Chinese University of Hong Kong,2010.
[42] 王菲,任杰,张泉慧,等.等级记分模型下几种等值方法的比较研究 [J].中国考试,2013(6):10-17.
[43] Tian Feng.A comparison of equating/ling using the stocking-Lord method and concurrent calibration with mixed-format test in the non-equivalent groups common-item design under IRT [D].Boston:Boston College,2011.
[44] Kim S H,Lee W C.An extension of four IRT linking methods for mixed-format tests [J].Journal of Educational Measurement,2006,43(1):53-76
[45] Yao Lihua,Schwarz R D.A multidimensional partial credit model with associated item and test statistics:an application to mixed-format tests [J].Applied Psychological Measurement,2006,30(6):469-492.
[46] Li Deping,Jiang Yanlin,Davier A A.The accuracy and consistency of a series of IRT true score equating [J].Journal of Educational Measurement Summer,2012,49(2):167-189.
[47] Thorndike R L.Educational measurement [M].Washington,DC:American Council on Education,1971:508-600.
[48] Kim S,von Davier A A,Haberman S.Equating with small samples [M].Princeton,NJ:Educational Testing Service,2006
[49] von Davier A A.Statistical models for test equating,scaling,and linking [M].New York:Springer,2011:89-107.
[50] Michela B.IRT test equating in complex linkage plans [J].Psychometrika,2013,78(3):464-480.

相似文献/References:

[1]关潮辉,丁树良.基于IRT模型的BP神经网络参数估计的进一步研究[J].江西师范大学学报(自然科学版),2014,(04):434.
 GUAN Chao-hui,DING Shu-liang.Further Research on the Paraneter Estination Method of BP Neural Network Based on Iten ResPonse Theory[J].,2014,(05):434.
[2]简小珠,戴海琦.4参数GRM对猜测现象和失误现象的纠正[J].江西师范大学学报(自然科学版),2016,40(02):116.
 JIAN Xiaozhu,DAI Haiqi.Four-Parameter GRM and the Countermeasure to Sleeping and Guessing Phenomena[J].,2016,40(05):116.

备注/Memo

备注/Memo:
收稿日期:2017-02-18基金项目:国家自然科学基金(1470050),广东省哲学社会科学发展“十三五”规划2017年度一般项目(GD17CXL01),广州市哲学社会科学发展“十三五”规划项目(2017GZYB111),广东省2015度高等教育教学改革项目(粤教高函〔2015〕173号)和华南师范大学2014年度校级高等教育教学研究和改革项目(教学[2014]52号)资助项目.作者简介:黎光明(1977-),男,江西广昌人,副教授,博士,主要从事心理统计与测量的研究.E-mail:Lgm20041
更新日期/Last Update: 1900-01-01