Item response theory

项目反应理论
  • 文章类型: Journal Article
    尽管项目反应理论(IRT)已被推荐用于帮助推进跨专业教育(IPE)研究,它的使用仍然有限。关于IRT对横截面数据的“限制”的潜在误解可能部分解释了这一点。这项研究的目的是证明如何在IPE研究的前后设计中有效地应用项目响应理论(IRT)。具体来说,使用扩展专业身份量表(EPIS)的调查方法进行为期两周的前后设计,一种跨专业的身份衡量标准,在n=146名混合健康科学学生中进行。结果表明,EPIS在干预前后显著增加了0.74个标准化平均差异,t146=7.73,p<0.05。IRT前后模型还给出了0.60的测试-重测可靠性估计,这被认为是可以接受的。IRT模型与常规配对t检验的比较表明,Cohen的d=.56和.54的效应大小估计值相似。我们证明了IRT对IPE前后研究的灵活性。该模型的应用可以在目标IPE构建中产生准确的变化,相对于基线差异,这对经典测试理论是有利的。
    Although Item Response Theory (IRT) has been recommended for helping advance interprofessional education (IPE) research, its use remains limited. This may be partly explained by potential misconceptions regarding IRT`s \"limitation\" to cross-sectional data. The aim of this study is to demonstrate how Item Response Theory (IRT) can be applied effectively in before-and-after designs in IPE research. Specifically, a two-week before-after design with survey methodology using the Extended Professional Identity Scale (EPIS), an interprofessional identity measure, was conducted among n = 146 mixed health-science students. Results indicated that EPIS increased significantly before-after intervention by .74 standardised mean differences, t146 = 7.73, p < .05. The before-after IRT model also gave a test-retest reliability estimate of .60 which was considered acceptable. Comparison of the IRT model with a conventional paired-t-test indicated similar effect size estimates of Cohen\'s d = .56 and .54, respectively. We demonstrate IRT`s flexibility to before-after studies in IPE. Application of this model can yield accurate changes in target IPE constructs, and it is advantageous to classical test theory vis-à-vis baseline differences.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    目的:牛津膝关节评分(OKS)和OKS活动和参与问卷(OKS-APQ)是患者报告的结果指标,用于评估接受膝关节置换手术的患者。他们没有明确地进行单维度测试(他们是否测量了一个潜在的特征,如“膝盖健康”)。这项研究应用了项目响应理论(IRT)来提高仪器的有效性,以优化持续使用。
    方法:接受初次全膝关节置换(TKR)的参与者提供了OKS和OKS-APQ的术前和术后反应。分别对OKS和OKS-APQ进行验证性因子分析(CFA),然后将两者合并为一个。将IRT模型拟合到数据。
    结果:分析了2972个个体反应模式。CFA证明,当将OKS和OKS-APQ组合为一个工具时,他们测量一个潜在的健康特征。一个用户友好的,免费使用,已经开发了Web应用程序,允许临床医生上传原始数据并立即接收IRT分数。
    结论:可以将OKS和OKS-APQ组合起来,以有效地用作单个仪器(产生单个分数)。对于单独的OKS和OKS-APQ,可以继续向患者提出原始项目和响应选项,这项研究证实了IRT加权评分的适用性。将IRT应用于现有响应,将传统的总和分数转换为具有更大粒度的连续测量,包括单个测量误差。
    OBJECTIVE: The Oxford Knee Score (OKS) and OKS Activity and Participation Questionnaire (OKS-APQ) are patient reported outcome measures used to assess people undergoing knee replacement surgery. They have not explicitly been tested for unidimensionality (whether they measure one underlying trait such as \'knee health\'). This study applied item response theory (IRT) to improve the validity of the instruments to optimise for ongoing use.
    METHODS: Participants undergoing primary total knee replacement (TKR) provided pre-operative and post-operative responses for OKS and OKS-APQ. Confirmatory factor analyses (CFA) were performed on the OKS and OKS-APQ separately, and then on both when pooled into one. An IRT model was fitted to the data.
    RESULTS: 2972 individual response patterns were analysed. CFA demonstrated that when combining OKS and OKS-APQ as one instrument, they measure one latent health trait. A user-friendly, free-to-use, web app has been developed to allow clinicians to upload raw data and instantly receive IRT scores.
    CONCLUSIONS: The OKS and OKS-APQ can be combined to use effectively as a single instrument (producing a single score). For the separate OKS and OKS-APQ the original items and response options can continue to be posed to patients, and this study has confirmed the suitability of IRT-weighted scoring. Applying IRT to existing responses, converts traditional sum scores into continuous measurements with greater granularity including individual measurement error.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    我们使用项目反应理论,以361名心理健康顾问为样本,研究了感知压力量表-4个项目版本(PSS-4)的纵向心理测量特性。参与者在一年内以六个月的间隔在三个时间点完成PSS-4。有290名参与者(80.3%)确定为女性,51(14.1%)确定为男性,八个(2.2%)被确定为性别变异/不符合,七个(1.9%)写了自己的性别认同(例如,genderqueer,性别膨胀),三名(0.8%)被确认为变性人男性,两个(0.6%)没有回应该项目。种族和族裔背景如下:白人(87.3%),多种族(5.5%),拉丁裔或西班牙裔或西班牙裔(2.8%),黑人或非裔美国人(1.4%)亚洲(0.8%),中东(0.8%),五人没有回应(1.4%)。我们发现PSS-4在所有三个时间点的单维证据和响应类别是单调排序的。我们还发现,在所有时间点,平均人员位置低于平均物品位置,这表明PSS-4可能不是这个心理健康咨询师样本的良好目标。我们观察到时间点之间没有显著的相互作用,每周工作小时数,和就业时间。调查结果的含义,包括讨论PSS-4作为压力的全球衡量标准和与心理健康顾问的效用。
    We examined the longitudinal psychometric properties of the Perceived Stress Scale - 4 items version (PSS-4) using item response theory with a sample of 361 mental health counsellors. Participants completed the PSS-4 at three timepoints at six-month intervals in a one-year period. There were 290 participants who (80.3%) identified as female, 51 (14.1%) identified as male, eight (2.2%) identified as gender variant/non-conforming, seven (1.9%) wrote in their own gender identity (e.g., genderqueer, gender expansive), three (0.8%) identified as Transgender male, and two (0.6%) did not respond to the item. The racial and ethnic backgrounds were as follows: White (87.3%), Multiracial (5.5%), Latino or Hispanic or Spanish (2.8%), Black or African American (1.4%), Asian (0.8%), Middle Eastern (0.8%), and five did not respond to the item (1.4%). We found unidimensionality evidence of the PSS-4 across all three timepoints and response categories were monotonically ordered. We also found that across all timepoints, the average person location was lower than the average item location, suggesting that the PSS-4 may not be well-targeted for this sample of mental health counsellors. We observed no significant interactions between timepoints, hours worked per week, and length of employment. Implications of the findings, including a discussion of the utility of the PSS-4 as a global measure of stress and with mental health counsellors.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    背景:神经发育表现任务通常是单独分析的,即使他们采用了类似的结构。对于暴露-神经行为结果的关联,这可能会产生混合的发现。我们开发了一种项目响应理论(IRT)方法,将多个任务变量集成在一起,以提高底层结构的测量精度。我们应用这种方法来创建儿童抑制控制的综合措施,并研究产前/产后铅暴露的影响。
    方法:使用来自墨西哥的前瞻性队列的数据(N=533),我们创建了一个抑制控制量表,该量表整合了来自四个抑制控制任务的准确性和反应时间信息(Go/NoGoLetter,Go/NoGo中立,Go/NoGoHappy,Delis-Kaplan执行功能系统(D-KEFS)颜色词干扰测试,条件3)。使用广义的部分信用项目反应理论模型,我们估计了每个参与者的抑制控制指数.然后,我们评估了脐带血和4年铅与儿童抑制控制之间的调整相关性。我们开发了一种重采样方法,以纳入抑制控制变量的误差估计,以确认铅抑制控制关联的一致性。我们分别对铅与每个抑制控制措施的时变关联进行建模。
    结果:参与者的平均年龄为9岁;51.4%为男性。脐带血[-0.06(95%CI:-0.11,-0.01)]和4年铅[-0.07(95%CI:-0.12,-0.02)]与8-10年的抑制控制指数相关。重新采样方法证实,4年铅与儿童抑制控制指数始终相关。在单独的模型中,脐带血和4年导联分别与8个指标中的3个相关。
    结论:这是IRT在环境流行病学中的首次应用,以创建用于抑制控制的潜在变量,该变量整合了来自多个,相关任务。该框架可以应用于其他相关神经行为评估或其他表型数据。
    BACKGROUND: Neurodevelopmental performance tasks are often separately analyzed, even when they tap into a similar construct. This may yield mixed findings for associations of an exposure-neurobehavioral outcome. We develop an item response theory (IRT) approach to integrate multiple task variables together to improve measurement precision of the underlying construct. We apply this approach to create an integrative measure of childhood inhibitory control, and study impacts of pre/post-natal lead exposure.
    METHODS: Using data from a prospective cohort based in Mexico (N = 533), we created an inhibitory control scale that integrates accuracy and reaction time information from four inhibitory control tasks (Go/NoGo Letter, Go/NoGo Neutral, Go/NoGo Happy, Delis-Kaplan Executive Function System (D-KEFS) Color-Word Interference Test, Condition 3). Using a generalized partial credit item response theory model, we estimated an inhibitory control index for each participant. We then assessed adjusted associations between umbilical cord blood and 4-year lead and childhood inhibitory control. We developed a resampling approach to incorporate error estimates from the inhibitory control variable to confirm the consistency of the lead-inhibitory control associations. We modeled time-varying associations of lead with each inhibitory control measure separately.
    RESULTS: Participants had a median age of 9 years; 51.4% were males. Umbilical cord blood [-0.06 (95% CI: -0.11, -0.01)] and 4-year lead [-0.07 (95% CI: -0.12, -0.02)] were associated with inhibitory control index at 8-10 years. A resampling approach confirmed that 4-year lead was consistently associated with childhood inhibitory control index. Umbilical cord blood and 4-year lead were each associated with 3 out of 8 measures in separate models.
    CONCLUSIONS: This is the first application of IRT in environmental epidemiology to create a latent variable for inhibitory control that integrates accuracy and reaction time information from multiple, related tasks. This framework can be applied to other correlated neurobehavioral assessments or other phenotype data.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:计算机自适应测试(CAT)是减少时间的有效方法,重复冗余,和回应负担,并已被用于衡量许多疾病的结果。这项研究旨在开发和验证针对慢性阻塞性肺疾病(COPD)患者报告的综合疾病特异性CAT结果测量。
    方法:使用项目反应理论分析改良的COPD患者报告结果量表(mCOPD-PRO)中项目的区分度和难度。然后是最初的项目,项目选择方法,能力估计方法,并根据协奏曲平台进一步设定了停止标准以形成CAT。最后,信度和效度得到验证。
    结果:项目区分范围为1.05至2.71,项目难度范围为-3.08至3.65。使用随机方法,CAT的测量可靠性范围为0.910至0.922,而使用最大Fisher信息(MFI)方法的范围为0.910至0.924。内容有效性良好。CAT和COPD评估测试的θ与采用随机方法的改良医学研究理事会呼吸困难量表评分之间的相关系数分别为0.628和0.540(P<0.001;P<0.001)。而MFI法分别为0.347和0.328(P=0.007;P=0.010)。使用随机方法对大约11个项目(平均减少59.3%)进行了测试,而使用MFI方法平均约有7个项目(减少74.1%)。随机法检测CAT与mCOPD-PRO总分θ的相关系数为0.919(P<0.001),而MFI方法为0.760(P<0.001)。
    结论:COPD患者报告的综合疾病特异性CAT结果测量具有良好的心理测量学特性,它可以提供有效的,准确,和用户友好的测量患者报告的COPD的结果。
    BACKGROUND: Computerized adaptive testing (CAT) is an effective way to reduce time, repetitious redundancy, and respond burden, and has been used to measure outcomes in many diseases. This study aimed to develop and validate a comprehensive disease-specific CAT for chronic obstructive pulmonary disease (COPD) patient-reported outcome measurement.
    METHODS: The discrimination and difficulty of the items from the modified patient-reported outcome scale for COPD (mCOPD-PRO) were analyzed using item response theory. Then the initial item, item selection method, ability estimation method, and stopping criteria were further set based on Concerto platform to form the CAT. Finally, the reliability and validity were validated.
    RESULTS: The item discrimination ranged from 1.05 to 2.71, and the item difficulty ranged from - 3.08 to 3.65. The measurement reliability of the CAT ranged from 0.910 to 0.922 using random method, while that ranged from 0.910 to 0.924 using maximum Fisher information (MFI) method. The content validity was good. The correlation coefficient between theta of the CAT and COPD assessment test and modified Medical Research Council dyspnea scale scores using random method was 0.628 and 0.540 (P < 0.001; P < 0.001) respectively, while that using MFI method was 0.347 and 0.328 (P = 0.007; P = 0.010) respectively. About 11 items (reducing by 59.3%) on average were tested using random method, while about seven items (reducing by 74.1%) on average using MFI method. The correlation coefficient between theta of the CAT and mCOPD-PRO total scores using random method was 0.919 (P < 0.001), while that using MFI method was 0.760 (P < 0.001).
    CONCLUSIONS: The comprehensive disease-specific CAT for COPD patient-reported outcome measurement is well developed with good psychometric properties, which can provide an efficient, accurate, and user-friendly measurement for patient-reported outcome of COPD.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    目的:在非洲很少有精神病筛查工具经过测试,然而,适当的工具可以增加自我报告的精神病症状的检测,改善精神病的检测并影响其预后。
    方法:在1928名无精神病史的埃塞俄比亚成年人样本中,采用验证性因子分析(CFA)和项目反应理论(IRT)对精神病筛查问卷(PSQ)的结构效度和因子结构进行了检验。我们测试了一个有和没有躁狂症项目的一维模型。对于IRT,使用似然比检验测试并比较了一维潜在结构单参数逻辑(1PL)和两参数(2PL)逻辑模型的相对拟合。
    结论:在一般医学背景下的埃塞俄比亚成年人样本中,终生阳性筛查的患病率为2.8%。一维模型证明了对PSQ的良好拟合,(CFI=0.993,TLI=0.986和RMSEA=0.025)。对于IRT,2PL模型是最佳拟合模型。项目难度和判别参数的IRT测试表明,偏执狂的判别α=4.59$\左(\\alpha=4.59\\右)$$最高,难度最低(β=2.53$\\beta=2.53$$),可能在低水平的精神病特征上得到认可。思想插入具有最高的项目难度(β=2.98$\\β=2.98$$)。总的来说,该措施在潜在特征的较高水平上捕获了精神病结构,可能适合于检测中度至重度精神病。
    结论:发现PSQ在埃塞俄比亚成年人的精神病筛查中具有良好的结构效度。未来的研究可能集中在PSQ的诊断有效性上,将其与结构化临床访谈进行比较。
    OBJECTIVE: Few psychosis screening instruments have been tested for use in Africa, yet appropriate tools can increase the detection of self-reported psychotic symptoms, improve the detection of psychosis and impact its prognosis.
    METHODS: The construct validity and factor structure of Psychosis Screening Questionnaire (PSQ) were tested with confirmatory factor analysis (CFA) and item response theory (IRT) in a sample of 1928 Ethiopian adults without any history of psychosis. We tested a unidimensional model with and without an item on mania. For IRT, unidimensional latent structure one-parameter logistic (1PL) and two-parameter (2PL) logistic models were tested and compared for relative fit using a likelihood-ratio test.
    CONCLUSIONS: The prevalence of lifetime positive screens was 2.8% in an Ethiopian sample of adults from a general medical setting. A unidimensional model demonstrated good fit for the PSQ, (CFI = 0.993, TLI = 0.986 and RMSEA = 0.025). For IRT, a 2PL model was the best fitting one. IRT tests of item difficulty and discrimination parameters showed that paranoia had the highest discrimination α = 4.59 $$ \\left(\\alpha =4.59\\right) $$ and lowest difficulty ( β = 2.53 $$ \\beta =2.53 $$ ), likely to be endorsed at low levels of psychotic features. Thought insertion had the highest item difficulty ( β = 2.98 $$ \\beta =2.98 $$ ). Overall, the measure captures the psychosis construct at higher levels of the latent trait and may be suited for detecting moderate to severe levels of psychosis.
    CONCLUSIONS: The PSQ is found to have good construct validity in screening for psychosis among Ethiopian adults. Future studies may focus on the diagnostic validity of the PSQ comparing it with a structured clinical interview.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    分裂型涉及精神分裂症样特征和症状,威斯康星州分裂分型量表(WSS)在以前的研究中经常使用。有一些证据表明精神分裂症的症状水平,包括使用WSS时,可能因性别、种族和族裔而异。然而,以前的研究很少研究WSS在多大程度上表现出性别和种族偏见。Further,以前没有对认知滑移量表进行过检查,杂乱无章的分裂类型的度量。在这项研究中,我们在一个大样本的本科生(n=21,829)中检查了WSS项目子集的偏差。使用项目响应理论来测试差异项目功能(DIF)的水平,我们发现所有尺度的DIF都有问题的证据,包括负面的,积极的,和杂乱无章的分裂型尺度。有证据表明DIF存在问题,尤其是性别以及黑人和多种族参与者。总的来说,目前的结果表明,在使用这些量表时,性别和/或种族偏见应该是一个重要的考虑因素,我们的结果可能对评估精神分裂症-谱系症状有意义.
    Schizotypy involves schizophrenia-like traits and symptoms, with the Wisconsin Schizotypy Scales (WSS) being frequently used in previous research. There is some evidence that schizophrenia-spectrum symptom levels, including when using the WSS, might vary both by gender and by race and ethnicity. However, previous research has rarely examined to what extent the WSS show gender and racial bias. Further, this has not been previously examined for the Cognitive Slippage Scale, a measure of disorganized schizotypy. In this study, we examined biases for a subset of items from the WSS in a large sample of undergraduate students (n = 21,829). Using item response theory to test for levels of differential item functioning (DIF), we found some evidence of problematic DIF for all scales, including for negative, positive, and disorganized schizotypy scales. There was evidence of problematic DIF especially by gender and for Black and Multiracial participants. Overall, the current results suggest that gender and/or racial bias on these scales should be an important consideration in using these scales and our results could have implications for assessment of schizophrenia-spectrum symptoms.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    背景:上个世纪匈牙利的自杀率异常高。根据欧盟统计局,匈牙利自杀死亡人数在欧盟中排名第二,并且是2020年自杀倾向增加的少数国家之一。自杀预防计划的主要任务是发展自杀知识,消除有关自杀的神话和误解。因此,我们的研究目标是匈牙利对26项自杀素养量表(LOSS)的验证.
    方法:749人(382名女性(51.0%),男性364人(48.6%),3识别为非二元或其他(0.4%);4识别为变性人(0.5%))参加了我们的在线横断面调查,平均年龄为32.4岁(SD=14.5岁)。在项目反应理论方法中,使用具有WLSE(加权最小二乘)估计的2PL(两参数logistic)模型对H-LOSS问卷进行了调整,类似于原来的英文版本。
    结果:确认了量表的单维性。模型拟合指数和内部可靠性指标均可接受。项目投入和装备价值足够,项目区分值在范围内,但是一个项目的难度参数极高,三个项目的难度参数极低。很少有物品因年龄而异,性别和自己的自杀尝试。
    结论:H-LOSS量表被认为适用于评估匈牙利语样本中的自杀素养。
    BACKGROUND: Suicide was exceptionally high in Hungary in the last century. According to Eurostat, Hungary ranks second in the EU in death by suicide and was among the few countries where the suicidal tendencies increased in 2020. Primary tasks of suicide prevention programs are to develop suicide literacy and dispel myths and misconceptions about suicide. Therefore, the goal of our research was the Hungarian validation of the 26-item Literacy of Suicide Scale (LOSS).
    METHODS: 749 people (382 female (51.0%), 364 male (48.6%), 3 identify as non-binary or other (0.4%); 4 identifying as transgender (0.5%)) participated in our online cross-sectional survey with a mean age of 32.4 years (SD = 14.5 years). The H-LOSS questionnaire was adapted using the 2PL (two-parameter logistic) model with WLSE (weighted least squares) estimation in item response theory method, similarly to the original English version.
    RESULTS: Scale unidimensionality was confirmed. Model fit indices and internal reliability indicators were acceptable. Item infit and outfit values were adequate, item discrimination values were within range, but one item had extremely high and three items had extremely low item difficulty parameters. Few items had differential item functioning by age, gender and own suidice attempt.
    CONCLUSIONS: The H-LOSS scale deemed to be appropriate for assessing suicide literacy in Hungarian speaking samples.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:进行性核上性麻痹(PSP)是一种神经退行性疾病,在评估方面具有挑战性的晚发性疾病。进行性核上性麻痹评定量表(PSPRS),临床医生报告的28项量表,是最成熟的临床结局评估方法。最近,美国食品和药物管理局(FDA)已经提出了10个项目的子量表作为完整PSPRS的替代方案。
    目的:使用项目反应理论定量评估和比较完整的PSPRS和FDA子量表的性质。开发疾病的进展模型并评估研究设计和分析选项的相对优点。
    方法:来自4项介入试验和2项注册的979名患者的数据可用于分析。我们的调查分为:(1)估计28个项目的信息量;(2)估计疾病进展;(3)比较量表,试验设计,以及关于检测临床相关治疗效果的功率的分析选项。
    结果:PSPRS项目得分与项目易怒性低成对相关性(r=0.17±0.14),睡眠困难,姿势性震颤与其他项目无关。FDA选择的项目显示出更高的相关性(r=0.35±0.14),并且是包括疾病进展在内的纵向项目反应模型的基础。试验模拟表明,如果分析基于纵向项目信息,与治疗结束时的总分相比,疾病改善治疗效果的识别需要不到研究规模的一半。
    结论:基于FDA选择的PSPRS项目的纵向项目反应模型是评估PSP治疗的有前途的工具。©2024作者(S)。由WileyPeriodicalsLLC代表国际帕金森症和运动障碍协会出版的运动障碍。
    BACKGROUND: Progressive supranuclear palsy (PSP) is a neurodegenerative, late-onset disease that is challenging in terms of assessment. The Progressive Supranuclear Palsy Rating Scale (PSPRS), a 28-item clinician-reported scale, is the most established clinical outcome assessment method. Recently, the U.S. Food and Drug Administration (FDA) has proposed a subscale of 10 items as an alternative to full PSPRS.
    OBJECTIVE: To quantitatively evaluate and compare the properties of full PSPRS and the FDA subscale using item response theory. To develop a progression model of the disease and assess relative merits of study designs and analysis options.
    METHODS: Data of 979 patients from four interventional trials and two registries were available for analysis. Our investigation was divided into: (1) estimating informativeness of the 28 items; (2) estimating disease progression; and (3) comparing the scales, trial designs, and analysis options with respect to power to detect a clinically relevant treatment effect.
    RESULTS: PSPRS item scores had a low pairwise correlation (r = 0.17 ± 0.14) and the items irritability, sleep difficulty, and postural tremor were uncorrelated with the other items. The FDA-selected items displayed higher correlation (r = 0.35 ± 0.14) and were the basis for a longitudinal item response model including disease progression. Trial simulations indicated that identification of a disease-modifying treatment effect required less than half the study size if the analysis was based on longitudinal item information compared with total scores at end-of-treatment.
    CONCLUSIONS: A longitudinal item response model based on the FDA-selected PSPRS items is a promising tool in evaluating treatments for PSP. © 2024 The Author(s). Movement Disorders published by Wiley Periodicals LLC on behalf of International Parkinson and Movement Disorder Society.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    背景:当前和以前的运动员是饮食失调(DE)的高危人群之一,影响他们的饮食习惯,身体成分,在他们的运动生涯中和之后的表现和健康。该组很少有全面的DE筛选工具。为了帮助解决这个问题,本研究利用经典测试理论(CTT)和项目反应理论(IRT)的混合方法方法来开发和验证适用于当前和以前运动员的DE筛选工具。
    方法:使用新的量表开发方法来开发和评估效度(内容,脸,跨文化,construct),测试-重测可靠性,内部一致性可靠性,一种新的DE量表的因子分析和Rasch分析。
    结果:创建了一种新的经过验证的运动饮食失调(ADE)筛查工具,有17个项目和四个分量表(食品控制,暴饮暴食,身体控制,身体不满),具有0.91的内部一致性信度,优秀的内容和结构效度,0.97的组内相关系数和优异的Rasch模型拟合。
    结论:ADE筛查工具已被双重开发用于研究目的,并作为临床适用的筛查工具,用于检测当前和以前的运动员中的DE,适用于跨运动类别的全球使用。不同的性别和竞争水平。
    BACKGROUND: Current and former athletes are one of the most at-risk population groups for disordered eating (DE), impacting their dietary practices, body composition, performance and health during and following their athletic careers. Few comprehensive DE screening tools exist for this group. To help address this, the current study utilised a mixed-methods approach of Classic Test Theory (CTT) and Item Response Theory (IRT) to develop and validate a DE screening tool suitable for current and former athletes.
    METHODS: Novel scale development methodologies were used to develop and assess the validity (content, face, cross-cultural, construct), test-retest reliability, internal consistency reliability, factor analysis and Rasch analysis of a new DE scale.
    RESULTS: A new validated Athletic Disordered Eating (ADE) screening tool was created, with 17 items and four subscales (food control, bingeing, body control, body discontent), with an internal consistency reliability of 0.91, excellent content and construct validity, an Intraclass Correlation Coefficient of 0.97 and excellent Rasch model fit.
    CONCLUSIONS: The ADE screening tool has been dually developed for research purposes and as a clinically applicable screening tool to detect DE in current and former athletes and is suitable for a global use across sporting categories, diverse genders and levels of competition.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号