Psychometric properties

心理测量属性
  • 文章类型: Journal Article
    背景:这篇综述概述了短物理性能电池(SPPB)的心理测量特性,定时并进行测试(TUG),在社区居住的老年人中进行4m步态速度测试(4mGST)和400m步行测试(400mWT)。
    方法:在MEDLINE中进行了系统搜索,CINAHL和EMBASE,纳入50项研究,数据来自19,266名参与者(平均年龄63.2~84.3).根据COSMIN指南,对患者报告的结果指标进行系统评价,提取数据并给予足够或不足的总体评级。证据质量(QoE)使用建议评级评估进行评级,开发和评估(等级)方法。
    结果:在12项研究中评估了SPPB,TUG在30,4mGST在12和400mWT在2。SPPB的可靠性,TUG和4mGST被评为足够(中等至良好的QoE)。SPPB的测量误差被评为不足(低QoE)。SPPB的标准有效性不足以表明肌肉减少症(中度QoE),虽然TUG足以确定行动限制(低QoE)和日常生活残疾活动(低QoE),分别。SPPB的结构效度,TUG,4mGST和400mWT在许多构建体中被评为不足(中等至高QoE)。响应性被评为SPPB(高QoE)和TUG(非常低的QoE)不足,而4mGST被评为足够(高QoE)。
    结论:总体而言,在社区居住的老年人中,常用的身体表现测试的心理测量质量通常被评为不足,除了可靠性。这些测试在日常实践中广泛使用,并在指南中推荐;但是,由于推荐的测量仪器的心理测量质量有限,使用者在得出诸如肌肉减少症严重程度和身体表现变化等结论时应谨慎。需要对患有肌肉减少症的人进行疾病特异性身体表现测试。这项研究没有从任何资助机构获得具体资助,并使用国际前瞻性系统审查注册(PROSPERO)(CRD42022359725)进行了先验注册。
    This review provides an overview of the psychometric properties of the short physical performance battery (SPPB), timed up and go test (TUG), 4 m gait speed test (4 m GST) and the 400 m walk test (400 m WT) in community-dwelling older adults.
    A systematic search was conducted in MEDLINE, CINAHL and EMBASE, resulting in the inclusion of 50 studies with data from in total 19,266 participants (mean age 63.2-84.3). Data were extracted and properties were given a sufficient or insufficient overall rating following the COSMIN guideline for systematic reviews of patient-reported outcome measures. Quality of evidence (QoE) was rated using the Grading of Recommendations Assessment, Development and Evaluation (GRADE) approach.
    The SPPB was evaluated in 12 studies, TUG in 30, 4 m GST in 12 and 400 m WT in 2. Reliability of the SPPB, TUG and 4 m GST was rated sufficient (moderate to good QoE). The measurement error of the SPPB was rated insufficient (low QoE). Criterion validity for the SPPB was insufficient in indicating sarcopenia (moderate QoE), while the TUG was sufficient and insufficient for determining mobility limitations (low QoE) and activities of daily living disability (low QoE), respectively. Construct validity of the SPPB, TUG, 4 m GST and 400 m WT was rated insufficient in many constructs (moderate to high QoE). Responsiveness was rated as insufficient for SPPB (high QoE) and TUG (very low QoE), while 4 m GST was rated as sufficient (high QoE).
    Overall, the psychometric quality of commonly used physical performance tests in community-dwelling older adults was generally rated insufficient, except for reliability. These tests are widely used in daily practice and recommended in guidelines; however, users should be cautious when drawing conclusions such as sarcopenia severity and change in physical performance due to limited psychometric quality of the recommended measurement instruments. There is a need for a disease-specific physical performance test for people with sarcopenia.This research received no specific grant from any funding agency and was registered a priori using the International Prospective Register of Systematic Reviews (PROSPERO) (CRD42022359725).
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    DSM-5青年焦虑量表(YAM-5)是一种自我和家长报告量表,专门用于评估主要焦虑症(第1部分或YAM-5-I)和特定恐惧症/广场恐惧症的症状(第2部分或YAM-5-II)根据当代精神病学分类系统在儿童和青少年中。自推出以来,该措施已越来越多地用于研究,使其能够提供其心理测量特性的总结。本文对20项采用YAM-5的研究进行了系统评价,涉及5325名年轻参与者。总的来说,结果支持度量的两个部分的假设因素结构,尽管也有一些研究无法完全复制YAM-5-I的原始五因素模型。YAM-5的内部一致性对于两个部分的总分通常很高,而各研究中分量表的信度系数差异更大。研究还获得了其他心理测量特性的证据,如测试-重测可靠性,亲子协议,收敛/发散有效性,和判别效度。结果进一步显示,女孩在YAM-5上的焦虑水平明显高于男孩。总的来说,这些研究结果表明,YAM-5是评估青少年焦虑症症状,包括特定恐惧症的一种有前景的工具.给出了YAM-5未来研究的一些方向,以及有关使用该措施的建议。
    The Youth Anxiety Measure for DSM-5 (YAM-5) is a self- and parent-report scale specifically developed to assess symptoms of major anxiety disorders (part 1 or YAM-5-I) and specific phobias/agoraphobia (part 2 or YAM-5-II) in children and adolescents in terms of the contemporary psychiatric classification system. Since its introduction, the measure has been increasingly used in research, making it feasible to provide a summary of its psychometric properties. The present article presents a systematic review of 20 studies that employed the YAM-5, involving 5325 young participants. Overall, the results supported the hypothesized factor structure of both parts of the measure, although there were also some studies that could not fully replicate the original five-factor model of YAM-5-I. The internal consistency of the YAM-5 was generally high for the total scores of both parts, while reliability coefficients for the subscales were more variable across studies. Research also obtained evidence for other psychometric properties, such as test-retest reliability, parent-child agreement, convergent/divergent validity, and discriminant validity. Results further revealed that girls tend to show significantly higher anxiety levels on the YAM-5 than boys. Overall, these findings indicate that the YAM-5 is a promising tool for assessing symptoms of anxiety disorders including specific phobias in young people. Some directions for future research with the YAM-5 and recommendations regarding the use of the measure are given.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    食物过敏严重影响患者及其照顾者(家庭或非正式照顾者)的健康相关生活质量(HRQoL)。目前没有全面审查来提供对实地工具的概述和批判性评估。从开始到2023年8月10日,共搜索了六个数据库,并使用了主题词和自由词的组合来搜索文献。我们使用基于Consensus的标准选择健康测量指标方法(COSMIN)来评估仪器的测量特性。41项研究报告了10项合格文书。根据COSMIN指南,A级推荐一种仪器,其余九种仪器被推荐用于B级。确定的A级仪器,食物过敏生活质量问卷-家长表格(FAQLQ-PF),可以帮助研究人员评估食物过敏患者的治疗效果,并了解该疾病对患者的社会心理影响。
    Food allergies severely impact the health-related quality of life (HRQoL) of patients and their caregivers (family or informal caregivers). Currently there is no comprehensive review to provide an overview and critical assessment of the instruments in the field. Six databases were searched from inception until 10 August 2023, and a combination of subject terms and free words was used to search the literature. We used the COnsensus-based Standards for the selection of health Measurement INstruments methodology (COSMIN) to evaluate the measurement properties of the instruments. Forty-one studies reported on ten eligible instruments. Based on COSMIN guidelines, one instrument was recommended for Grade A, and the remaining nine instruments were recommended for Grade B. The Grade A instrument identified, the Food Allergy Quality of Life Questionnaire-Parent Form (FAQLQ-PF), can help researchers assess the effectiveness of treatment for patients with food allergy and to understand the psychosocial impact of the disease on patients.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    心力衰竭(HF)是一种普遍的全球性疾病,特别是对发达国家的影响。随着世界人口老龄化,HF对生活的数量和质量的影响预计会增长。这篇综述旨在确定频率,特点,以及在HF患者中研究的所有患者报告结局指标(PROM)的特性。我们搜索了Ovid/Medline和WebofScience,以获取有关成人HF患者进行PROM的原始文章。使用预先建立的测量特性质量标准,评估和比较不同仪器的总体评级.使用COSMIN偏见风险检查表评估证据质量。在确定的4283条记录中,我们审查了296篇全文文件,包括64篇论文,涉及30,185名参与者。确定了30种不同的PROM,14专门为HF是最常用的。明尼苏达州心力衰竭(MLHF)和堪萨斯城心肌病问卷(KCCQ)分别评估了16和13次,分别,表现出良好的心理测量特性。MacNew心脏病健康相关生活质量问卷,一种常见的心脏病专用仪器,在各种心理测量中表现出负面表现。通用仪器的证据很少且不明显,并且在HF人群中被证明反应较差。MLHF和KCCQ成为最常用和支持最良好的PROM,有强有力的总体证据。它们是全面准确的仪器,特别适合在临床实践和研究中应用。未来的研究应该探索计算机适应仪器如何提高精度,减轻受访者负担,改善临床医生和患者之间的沟通,从而促进更有效和以患者为中心的服务。
    Heart failure (HF) is a prevalent global disease, particularly impacting developed countries. With the world\'s aging population, HF\'s impact on the quantity and quality of life is expected to grow. This review aims to ascertain the frequency, characteristics, and properties of all patient-reported outcomes measures (PROMs) studied in HF patients. We searched Ovid/Medline and Web of Science for original articles about PROMs performed in adults with HF. Using pre-established quality criteria for measurement properties, an overall rating was assigned to evaluate and compare different instruments. The quality of evidence was assessed with the COSMIN risk of bias checklist. Of 4283 records identified, we reviewed 296 full-text documents and included 64 papers, involving 30,185 participants. Thirty different PROMs were identified, with 14 specifically designed for HF being the most commonly used. Minnesota Living with Heart Failure (MLHF) and Kansas City Cardiomyopathy Questionnaire (KCCQ) were evaluated 16 and 13 times, respectively, demonstrating good psychometric properties. The MacNew Heart Disease Health-Related Quality of Life Questionnaire, a common heart disease-specific instrument, exhibited negative performances across various psychometric measures. Evidence for generic instruments was scant and unremarkable and they proved to be less responsive in HF populations. MLHF and KCCQ emerged as the most commonly used and well-supported PROMs, with robust overall evidence. They are comprehensive and accurate instruments, particularly suitable for application in clinical practice and research. Future research should explore how computer-adapted instruments can enhance precision, reduce respondent burden, and improve communication between clinicians and patients, thereby promoting more efficient and patient-centered services.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    到医疗机构就诊的大部分患者表现出缺乏明确的身体症状,明显的有机原因。准确评估症状严重程度对于记录结果和建立治疗效果至关重要。本系统综述和叙述性综合旨在为研究人员提供对可用和经过验证的工具的见解,以评估医学上无法解释的身体症状(MUPS)。它涉及跨电子数据库的全面搜索,包括PubMed,威利,还有Cochrane,坚持PRISMA和COSMIN准则。该研究包括两个阶段:第一阶段系统评估MUPS症状和严重程度的工具,当第二阶段对它们的测量特性进行叙事综合时,注重效度和信度。在14459条记录中,确定了191篇文章,导致识别16个经过验证的工具,用于评估MUPS症状和严重程度。大多数工具表现出优异的内部一致性和结构有效性。然而,大多数人缺乏跨文化的有效性。MUPS评估工具的选择将有助于临床医生和研究人员确定MUPS的严重程度,并制定量身定制的治疗计划,以改善这些患者的身体和心理功能。
    A substantial portion of patients presenting to healthcare settings exhibit physical symptoms lacking clear, demonstrable organic causes. Accurate assessment of symptom severity is crucial for documenting outcomes and establishing treatment efficacy. This systematic review and narrative synthesis aims to provide researchers with insights into available and validated tools for assessing medically unexplained physical symptoms (MUPS). It involved comprehensive searches across electronic databases, including PubMed, Wiley, and Cochrane, adhering to PRISMA and COSMIN guidelines. The study comprised two phases: Phase 1 systematically reviewed tools for assessing MUPS symptoms and severity, while Phase 2 conducted a narrative synthesis of their measurement properties, focusing on validity and reliability. Out of 14,459 records, 191 articles were identified, leading to the recognition of 16 validated tools for assessing MUPS symptoms and severity. Most tools demonstrated excellent internal consistency and structural validity. However, the majority lacked cross-cultural validity. The choice of tools for the assessment of MUPS will assist clinicians and researchers in determining the severity of MUPS and developing a tailored treatment plan to improve the physical and psychological functioning of these patients.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    帕金森病(PD)影响生活质量(QoL),因为它是认知障碍的原因,非运动,和运动症状。结果指标是评估治疗对QoL随时间变化的影响的基础。本系统综述旨在确定PDQ-39和PDQ-8在经过验证的不同人群中的心理测量特性。系统搜索的电子数据库是MEDLINE(通过PubMed),CINAHL,Scopus,和WebofScience;这项研究于2023年7月进行。所考虑的心理测量特性是基于Consensus的健康测量仪器(COSMIN)清单选择标准的特性。使用COSMIN检查表评估偏倚风险。搜索确定了1306个文章。消除了398个重复项;分析了908篇文章的标题和摘要;最终排除了799篇,因为使用PDQ-39和PDQ-8作为结果指标或未处理心理测量特性;阅读全文后排除了66篇文章。43篇文章被纳入评价;荟萃分析显示,所有Cronbachα值对于PDQ-39和PDQ-8的所有分量表都具有统计学意义。PDQ-39被证明是与通用HRQoL问卷相关的特定HRQoL问卷,事实上,在这篇综述中的许多研究中,发现与SF-36相关。在关于PDQ-8的心理测量特性的最后研究中,它是一种实用且信息丰富的工具,可以很容易地用于临床环境。尤其是在忙碌的时候,但在大规模研究中也是如此,在这些研究中,一个简短的工具将是首选。
    Parkinson\'s disease (PD) affects Quality of Life (QoL), since it is responsible for cognitive impairment, non-motor, and motor symptoms. Outcome measures are fundamental for evaluating treatment\'s effect on QoL over time. This systematic review aimed to identify the psychometric properties of PDQ-39 and PDQ-8 in the different populations in which they were validated. The electronic databases systematically searched are MEDLINE (via PubMed), CINAHL, SCOPUS, and Web of Science; the research was conducted in July 2023. The psychometric properties considered were those of the COnsensus-based Standards for the selection of health Measurement Instruments (COSMIN) checklist. Risk of bias was assessed using the COSMIN checklist. The search identified 1306 articles. 398 duplicates were eliminated; 908 articles were analyzed reading title and abstract; 799 were finally excluded because used PDQ-39 and PDQ-8 as outcome measures or were not dealing with psychometric properties; 66 articles were excluded after reading the full text. 43 articles were included in the review; meta-analysis showed all the Cronbach\'s alpha values were statistically significant for all the subscales of PDQ-39 and PDQ-8. PDQ-39 demonstrated to be a specific HRQoL questionnaire that is correlated with generic HRQoL questionnaires, in fact in many studies included in the review, correlations with SF-36 were found. In the last studies about psychometric properties of PDQ-8 emerged that it is a practical and informative instrument that can be easily used in clinical settings, especially in busy ones, but also in large-scale studies in which a brief instrument would be preferred.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    针对性和性别少数群体(SGM)人群的污名对SGM人群具有严重的负面健康影响。尽管在SGM中越来越需要准确的柱头测量,有效的测量仪器不足。此外,结构使用缺乏一致性使得研究之间的比较特别具有挑战性.有必要对SGM的各种污名测量的心理测量特性进行严格的审查和比较评估,以增进我们对SGM之间的污名测量的理解。基于PRISMA(系统审查和荟萃分析的首选报告项目)指南,在4个书目数据库(MEDLINE,PsycINFO,CINAHL,andWebofScience)forempiricalarticlespublishedfrom2010to2022thatevaluatedthepsychometricspropertiesofmeasurementinstrumentsassessingstimageagainstSGM.Thescreening,提取,根据既定标准和COSMIN(基于共识的健康测量仪器选择标准)清单,对所选仪器的心理测量特性和方法学质量进行评分,分别。在确定的2031项研究中,包括19项研究,报告了17种测量仪器的心理测量特性。所有仪器,除了两个,为SGM开发(n=15/17)。大多数乐器包括男男性行为者(MSM)或男同性恋者(n=11/15),而不到一半的仪器评估SGM女性的污名(n=6/15)。报告了所有仪器的内部一致性(Cronbach'salpha)和内容效度(n=17);还报告了大多数仪器的结构和结构效度(分别为n=15和10)。然而,报告了极少数仪器的重测信度和标准效度(每个n=5)。根据COSMIN核对表,我们为五种污名类型中的每一种确定了最心理和方法上最强大的工具:组合污名,颁布的耻辱,内化的污名,交叉柱头,和感知到的耻辱。对于每种柱头类型,除了预期的耻辱,至少有一种工具显示出在实证研究中使用的强大前景;然而,仪器的选择取决于目标人群和研究背景。研究结果表明,越来越多地使用评估多种柱头类型的工具。FuturestudiesneedtodevelopintersectionstighageinstrumentsthataccountforthemultipleandintersatingsocialidentitiesofSGM.Additionally,大多数现有的仪器将受益于进一步的心理测试,特别是在重测可靠性方面,标准有效性,对不同LGBTQIA+种群和文化的适应性。
    Stigma against sexual and gender minorities (SGM) populations has serious negative health effects for SGM populations. Despite the growing need for accurate stigma measurement in SGM, there are insufficient valid measurement instruments. Moreover, the lack of consistency in construct usage makes comparisons across studies particularly challenging. A critical review and comparative evaluation of the psychometric properties of the various stigma measures for SGM is necessary to advance our understanding regarding stigma measurement against/among SGMs. Based on the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines, a comprehensive search was conducted in 4 bibliographic databases (MEDLINE, PsycINFO, CINAHL, and Web of Science) for empirical articles published from 2010 to 2022 that evaluated the psychometrics properties of measurement instruments assessing stigma against SGMs. The screening, extraction, and scoring of the psychometric properties and methodological quality of selected instruments were performed by following the established standards and COSMIN (Consensus-based Standards for the selection of health Measurement Instruments) checklist, respectively. Of the 2031 studies identified, 19 studies were included that reported psychometric properties of 17 measurement instruments. All instruments, except two, were developed for SGMs (n = 15/17). Most instruments included men who have sex with men (MSM) or gay men (n = 11/15), whereas less than half of the instruments assessed stigma among SGM women (n = 6/15). Internal consistency (Cronbach\'s alpha) and content validity was reported for all instruments (n = 17); construct and structural validity was also reported for majority of the instruments (n = 15 and 10, respectively). However, test-retest reliability and criterion validity was reported for very few instruments (n = 5 each). Based on the COSMIN checklist, we identified the most psychometrically and methodologically robust instruments for each of the five stigma types: combined stigma, enacted stigma, internalized stigma, intersectional stigma, and perceived stigma. For each stigma type, except anticipated stigma, at least one instrument demonstrated strong promise for use in empirical research; however, the selection of instrument depends on the target population and context of the study. Findings indicated a growing use of instruments assessing multiple stigma types. Future studies need to develop intersectional stigma instruments that account for the multiple and intersecting social identities of SGMs. Additionally, most existing instruments would benefit from further psychometric testing, especially on test-retest reliability, criterion validity, adaptability to different LGBTQIA + populations and cultures.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    本研究旨在确定已用于评估功能的临床措施,健康相关生活质量(HRQoL),和/或使用下肢假肢(LLP)的儿童的满意度。收集了使用LLP的儿童的心理测量特性的数据。
    首先,PubMed,CINAHL,和WebofScience数据库使用广泛的搜索词进行搜索,以确定功能的标准化结果测量,HRQoL,和/或对2001年或之后发表的儿科LLP研究中使用的治疗方法的满意度。对于发现的每一项符合条件的措施,进行了第二次搜索以识别心理测量属性(例如,有效性,可靠性)对使用LLP的儿童进行评估。
    从41篇儿科LLP研究文章中确定了44项标准化结果指标。五项措施(即,下肢差异的步态结果评估,功能移动性评估,儿童截肢假肢项目-假肢满意度量表,儿童截肢假肢项目-功能量表指数,和下肢功能问卷)具有使用LLP的儿童的心理测量特性数据。
    很少有研究报告用于评估整体HRQoL的心理测量数据,函数,和/或使用LLP的儿童的满意度。需要进一步的研究来验证或创建评估HRQoL的新结果指标,满意,和/或使用LLP的儿童的功能。
    UNASSIGNED: This study aimed to identify clinical measures that have been used to evaluate function, health related quality of life (HRQoL), and/or satisfaction in children who use lower limb prostheses (LLP). The data reported on psychometric properties for children who use LLP were collected for each measure.
    UNASSIGNED: First, PubMed, CINAHL, and Web of Science databases were searched using broad search terms to identify standardized outcome measures of function, HRQoL, and/or satisfaction with treatment used in pediatric LLP research published in 2001 or after. For each of the eligible measures found, a second search was performed to identify psychometric properties (e.g., validity, reliability) assessed with children who use LLP.
    UNASSIGNED: Forty-four standardized outcome measures were identified from 41 pediatric LLP research articles. Five measures (i.e., Gait Outcomes Assessment for Lower Limb Differences, Functional Mobility Assessment, Child Amputee Prosthetics Project- Prosthesis Satisfaction Inventory, Child Amputee Prosthetics Project- Functional Scale Index, and Lower Limb Function Questionnaire) had data on psychometric properties for children who use LLP.
    UNASSIGNED: Few studies report psychometric data for assessing the overall HRQoL, function, and/or satisfaction for children who use LLP. Further research is needed to validate or create new outcome measures that assess the HRQoL, satisfaction, and/or function of children who use LLP.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    目的:准确可靠的平衡措施对于规定预防跌倒治疗和监测其有效性非常重要。因此,我们旨在系统地回顾六点步骤测试的心理测量特性,越来越多地使用动态平衡的衡量标准。
    方法:于2024年2月12日在Medline进行了使用自由文本术语“六点步骤测试”的文献检索,Embase,康复与运动医学和SPORTDiscus。资格标准为18岁或以上的成年人,评估六点步骤测试的心理测量特性的试验,和英语文章。会议摘要被排除在外。
    方法:两名研究人员独立筛选和选择数据,并使用COSMIN偏差风险核对表和改进的GRADE方法评估方法学质量和证据。一位研究者提取了设计等研究特征,人口和心理测量属性。
    结果:在确定的159篇文章中,16,评估多个测量属性,包括在最终分析中。共有1319人参加,包括受中风影响的人,多发性硬化症,Parkison病,慢性炎症性多发性神经病和有平衡问题的老年人。8篇评估可靠性的文章(n=618,组内相关系数≥0.7,最小可检测变化=22%)和12个结构效度(n=1082,83%真实假设,曲线下面积>0.7)表现出足够的方法学质量和高水平的证据,而两项检查反应性的研究(n=167)显示出非常低的证据。
    结论:除了反应性,有力的证据支持六点步骤测试的可靠性和有效性,用于评估患有神经系统疾病的特定人群和老年人的动态平衡。Further,它被认为是临床可行的。
    OBJECTIVE: Accurate and reliable balance measures are important for prescribing fall prevention treatments and monitoring their effectiveness. Thus, we aimed to systematically review the psychometric properties of the Six-Spot Step Test, an increasingly used measure of dynamic balance.
    METHODS: A literature search using the free-text term \"Six-Spot Step Test\" was performed on 12 February 2024, in Medline, Embase, Rehabilitation & Sports Medicine and SPORTDiscus. Eligibility criteria were adults aged 18 or more, trials evaluating the psychometric properties of the Six-Spot Step Test, and English-language articles. Conference abstracts were excluded.
    METHODS: Two investigators screened and selected data independently and assessed the methodological quality and evidence using the COSMIN Risk of Bias checklist and modified GRADE approach. One investigator extracted study characteristics such as design, population and psychometric properties.
    RESULTS: Of the 159 articles identified, 16, evaluating multiple measurement properties, were included in the final analysis. A total of 1319 people participated, including people affected by Stroke, multiple sclerosis, Parkison\'s disease, chronic inflammatory polyneuropathy and older adults with balance problems. Eight articles assessing reliability (n = 618, intraclass correlations coefficient ≥0.7, minimal detectable change = 22%) and 12 construct validity (n = 1082, 83% true hypothesis, area under the curve >0.7) exhibited sufficient methodological quality with high-level evidence, while two studies (n = 167) examining responsiveness showed very low evidence.
    CONCLUSIONS: Apart from responsiveness, robust evidence supports the reliability and validity of the Six-Spot Step Test for assessing dynamic balance in a specific group of individuals with neurological diseases and older adults. Further, it is considered feasible for clinical use.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    背景:神经心理学评估传统上包括执行功能(EF)的测试,因为它在日常活动中具有关键作用并与精神障碍有关。建立传统的EF评估,虽然健壮,缺乏生态有效性,仅限于单一的认知过程。这些方法,适合临床人群,在健康个体中,关于EF的信息较少。考虑到这些限制,基于沉浸式虚拟现实(VR)的EF评估已经引起了人们的兴趣,因为它们有可能提高测试灵敏度,生态有效性,和神经心理学评估可及性。
    目的:本系统综述旨在探索有关EF的沉浸式VR评估的文献,重点是(1)被评估的EF成分,(2)如何验证这些评估,和(3)监测潜在不良(网络疾病)和有益(沉浸)影响的策略。
    方法:EBSCOhost,Scopus,和WebofScience在2022年7月使用反映VR主要主题的关键字进行搜索,神经心理学测试,和EF。文章必须是用英语撰写的同行评审手稿,并在2013年之后发表,详细的经验,临床,或概念验证研究,其中使用头戴式显示器的虚拟环境来评估成人人群的EF。使用表格合成方法,其中每个研究的验证细节,包括比较评估和分数,被系统地组织在一张桌子上。对结果进行了总结和定性分析,以提供对研究结果的全面概述。
    结果:搜索检索到555篇独特文章,其中19人(3.4%)符合纳入标准。综述的研究包括EF和相关的高阶认知功能,如抑制控制,认知灵活性,工作记忆,规划,和注意。VR评估通常会针对黄金标准的传统任务进行验证。然而,观察到差异,由于一些研究缺乏报告的先验计划相关性,省略使用VR范式评估的EF构造的详细描述,经常报告不完整的结果。值得注意的是,19项研究中只有4项(21%)评估了网络疾病,19项研究中有5项(26%)包括用户体验评估。
    结论:尽管它承认VR范例评估EF的潜力,证据有局限性。纳入研究的方法学和心理计量学特性得到了不一致的解决,引起人们对其有效性和可靠性的担忧。对诸如网络疾病之类的不利影响的不频繁监测和样本量的相当大的差异可能会限制解释并阻碍心理评估。提出了一些建议,以改善EF沉浸式VR评估的理论和实践。未来的研究应该在空间导航评估的背景下探索生物传感器与VR系统的集成以及VR的功能。尽管有相当大的希望,VR评估的系统和经过验证的实施对于确保其在现实世界应用中的实际效用至关重要。
    BACKGROUND: Neuropsychological assessments traditionally include tests of executive functioning (EF) because of its critical role in daily activities and link to mental disorders. Established traditional EF assessments, although robust, lack ecological validity and are limited to single cognitive processes. These methods, which are suitable for clinical populations, are less informative regarding EF in healthy individuals. With these limitations in mind, immersive virtual reality (VR)-based assessments of EF have garnered interest because of their potential to increase test sensitivity, ecological validity, and neuropsychological assessment accessibility.
    OBJECTIVE: This systematic review aims to explore the literature on immersive VR assessments of EF focusing on (1) EF components being assessed, (2) how these assessments are validated, and (3) strategies for monitoring potential adverse (cybersickness) and beneficial (immersion) effects.
    METHODS: EBSCOhost, Scopus, and Web of Science were searched in July 2022 using keywords that reflected the main themes of VR, neuropsychological tests, and EF. Articles had to be peer-reviewed manuscripts written in English and published after 2013 that detailed empirical, clinical, or proof-of-concept studies in which a virtual environment using a head-mounted display was used to assess EF in an adult population. A tabular synthesis method was used in which validation details from each study, including comparative assessments and scores, were systematically organized in a table. The results were summed and qualitatively analyzed to provide a comprehensive overview of the findings.
    RESULTS: The search retrieved 555 unique articles, of which 19 (3.4%) met the inclusion criteria. The reviewed studies encompassed EF and associated higher-order cognitive functions such as inhibitory control, cognitive flexibility, working memory, planning, and attention. VR assessments commonly underwent validation against gold-standard traditional tasks. However, discrepancies were observed, with some studies lacking reported a priori planned correlations, omitting detailed descriptions of the EF constructs evaluated using the VR paradigms, and frequently reporting incomplete results. Notably, only 4 of the 19 (21%) studies evaluated cybersickness, and 5 of the 19 (26%) studies included user experience assessments.
    CONCLUSIONS: Although it acknowledges the potential of VR paradigms for assessing EF, the evidence has limitations. The methodological and psychometric properties of the included studies were inconsistently addressed, raising concerns about their validity and reliability. Infrequent monitoring of adverse effects such as cybersickness and considerable variability in sample sizes may limit interpretation and hinder psychometric evaluation. Several recommendations are proposed to improve the theory and practice of immersive VR assessments of EF. Future studies should explore the integration of biosensors with VR systems and the capabilities of VR in the context of spatial navigation assessments. Despite considerable promise, the systematic and validated implementation of VR assessments is essential for ensuring their practical utility in real-world applications.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号