[1]康春花,杨亚坤,钟晓玲,等.4年级数学应用题Q矩阵的适宜性[J].江西师范大学学报(自然科学版),2016,40(04):369-376.
 KANG Chunhua,YANG Yakun,ZHONG Xiaoling,et al.The Suitability of Q-Matrix on the Primary School Grade Four Students’ Arithmetical Word Problem[J].,2016,40(04):369-376.
点击复制

4年级数学应用题Q矩阵的适宜性()
分享到:

《江西师范大学学报》(自然科学版)[ISSN:1006-6977/CN:61-1281/TN]

卷:
40
期数:
2016年04期
页码:
369-376
栏目:
出版日期:
2016-09-01

文章信息/Info

Title:
The Suitability of Q-Matrix on the Primary School Grade Four Students’ Arithmetical Word Problem
作者:
康春花杨亚坤钟晓玲曾平飞
1.浙江师范大学教师教育学院,浙江 金华 321004; 2.海云天教育测评有限公司, 广东 深圳 518000
Author(s):
KANG ChunhuaYANG YakunZHONG XiaolingZENG Pingfei
1.College of Teacher Education,Zhejiang Normal University,Jinhua Zhejiang 321004,China; 2.CN Test Company,Shenzhen Guangdong 518000,China
关键词:
数学应用题 测验Q矩阵 R矩阵 GRM-AHM-A方法
Keywords:
mathematical word problem Q-matrix reach-ability matrix GRM-AHM-A method
分类号:
B 841
文献标志码:
A
摘要:
在认知诊断评估中,Q矩阵的界定和挑选非常重要,因其关系到诊断测验的质量和诊断评估的准确性.在模拟研究中,Q矩阵可以任意设定,但在实践研究中,Q矩阵的界定和测验Q矩阵的选择确非易事.该研究基于已有理论和模拟研究关于Q矩阵选择的原则,以小学4年级数学应用题为例,阐述如何在实践认知评估中选择适宜的测验Q矩阵,并通过实证和模拟研究验证所选测验Q矩阵的适宜性.研究结果表明:测验Q矩阵在包含R矩阵的前提下,考核模式并非越多越好、测验长度并非越长越好,相比较而言,只包含R矩阵的测验Q矩阵均要好于考核模式太多的Q矩阵.
Abstract:
The definition and selection of Q-matrix are very important in cognitive diagnostic assessment(CDA),because these concern the quality of a test and accuracy of CDA.The Q-matrix of simulation study can be set arbitrarily,but it not always the case in practical research.Based on the principles of existing theory and related simulation studies,the primary school grade four students’ arithmetical word problem is taken as an example to illustrate how to choose a suitable testing Q-matrix in practice.Empirical and simulation studies are used toverify the appropriate of selected testing Q-matrix.The results suggested that increasing the pattern and number of test items not always improve the pattern match ratio(PMR)and marginal match ratio(MMR)when testing Q-matrix contains the reach-ability matrix; instead,a Q-matrix with reachability matrix is better than the Q-matrix which includes too many test patterns.

参考文献/References:

[1] Borsboom D,Mellenbergh G J,van Heerden J.The concept of validity [J].Psychological Review,2004,111(4):1061.
[2] de la Torre J.An empirically based method of Q-matrix validation for the DINA model:development and applications [J].Journal of Educational Measurement,2008,45(4):343-362.
[3] De Carlo L T.On the analysis of fraction subtraction data:the DINA model,classification,latent class sizes,and the Q-matrix [J].Applied Psychological Measurement,2011,35(1):8-26.
[4] Henson R,Douglas J.Test construction for cognitive diagnosis [J].Applied Psychological Measurement,2005,29(4):262-277.
[5] Rupp A A,Templin J L.The effects of Q-matrix misspecification on parameter estimates and classification accuracy in the DINA model [J].Educational and Psychological Measurement,2008,68:78-96.
[6] Im S,Corter J E.Statistical consequences of attribute misspecification in the rule space method [J].Educational and Psychological Measurement,2011,71(4):712-731.
[7] Kunina-Habenicht O,Rupp A A,Wilhelm O.The Impact of model misspecification on parameter estimation and item-fit assessment in log‐linear diagnostic classification models [J].Journal of Educational Measurement,2012,49(1):59-81.
[8]丁树良,杨淑群,汪文义.可达矩阵在认知诊断测验编制中的重要作用 [J].江西师范大学学报:自然科学版,2010,34(5):490-494.
[9] Gorin J S.Test construction and diagnostic testing [A].Leighton J P.Cognitive diagnostic assessment for education:theory and applications [M].Cambridge:Cambridge University Press,2007.
[10] Gierl M,Wang Changjiang,Zhou Jianwen.Using the attribute hierarchy method to make diagnostic inferences about examinees’ cognitive skills in algebra on the SAT(c)[J].Journal of Technology,Learning,and Assessment,2008,6(6):1-53.
[11] 丁树良,汪文义,杨淑群.认知诊断测验蓝图的设计 [J].心理科学,2011(2):258-265.
[12] 涂冬波,漆书青,戴海琦,等.教育考试中的认知诊断评估 [J].考试研究,2008(4):4-15.
[13] 祝玉芳,丁树良.基于等级反应模型的属性层级方法 [J].心理学报,2009(3),267-275.
[14] 康春花,辛涛,田伟.小学数学应用题认知诊断测验编制及效度验证 [J].考试研究,2013(6):24-43.
[15] Mayer R E.Different problem-solving strategies for algebra word and equation problems [J].Journal of Experimental Psychology:Learning,Memory,and Cognition,1982,8(5):448-462.
[16] Enright M K,Morley M,Sheehan K M.Items by design:the impact of systematic feature variation on item statistical characteristics [J].Applied Measurement in Education,2002,15(1):49-74.
[17] Arendasy G,Sommer M.Using psychometric technology in educational assessment:the case of a schema-based isomorphic approach to automatic generation of quatitative reasoning items [J].Learning and Individual Differences,2007,17:366-383.
[18] Arendasy M,Sommer M,Gittler G,et al.Automatic generation of quantitative reasoning items [J].Journal of Individual Differences,2006,27(1):2-14.
[19] Embretson S,Gorin J.Improving construct validity with cognitive psychology principles [J].Journal of Educational Measurement,2001,38(4):343-368.
[20] Tatsuoka K K,Corter J E,Tatsuoka C.Patterns of diagnosed mathematical content and process skills in TIMSS-R across a sample of 20 countries [J].American Educational Research Journal,2004,41(4):901.
[21] Cui Ying,Leighton J P.The hierarchy consistency index:evaluating person fit for cognitive diagnostic assessment [J].Journal of Educational Measurement,2009,46(4):429-449.
[22] 丁树良,毛萌萌,汪文义,等.教育认知诊断测验与认知模型一致性的评估 [J].心理学报,2012(11):1535-1546.

备注/Memo

备注/Memo:
收稿日期:2016-04-17基金项目:浙江省高校重大人文社科项目攻关计划(2013QN048)资助项目.通信作者:曾平飞(1963-),男,广西荔浦人,教授,博士,主要从事心理测量与评价方面的研究.
更新日期/Last Update: 1900-01-01