Matching

匹配
  • 文章类型: Journal Article
    这里使用深度优先搜索来实现EdmondsBlossom算法,本质上是串行的。通过精简代码,我们的串行实现始终比以前最快的通用图匹配代码快三到五倍。通过在算法的迭代中提取并行性,带有粗粒锁定,我们能够进一步将随机规则图上的运行时间减少四倍,并在具有相似拓扑的现实世界图上获得两倍的运行时间减少。求解具有八个线程的社区结构的非常稀疏的图(平均度小于4)导致减慢三倍,但是一旦平均程度大于4,这种减速就会被边际加速所取代。我们得出的结论是,当从这种基于增强路径的算法中提取并行性时,我们的并行粗粒度锁定实现效果良好,并且对于类似的算法也可以很好地工作。
    The Edmonds Blossom algorithm is implemented here using depth-first search, which is intrinsically serial. By streamlining the code, our serial implementation is consistently three to five times faster than the previously fastest general graph matching code. By extracting parallelism across iterations of the algorithm, with coarse-grain locking, we are able to further reduce the run time on random regular graphs four-fold and obtain a two-fold reduction of run time on real-world graphs with similar topology. Solving very sparse graphs (average degree less than four) exhibiting community structure with eight threads led to a slow down of three-fold, but this slow down is replaced by marginal speed up once the average degree is greater than four. We conclude that our parallel coarse-grain locking implementation performs well when extracting parallelism from this augmenting-path-based algorithm and may work well for similar algorithms.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    被表述为通用图中的最大基数匹配(MCMGG)的匹配问题在没有限制的图上找到最大的匹配。当图稀疏时,Micali-Vazirani算法具有求解MCMGG的最佳渐近复杂度。由于多种原因,在GPU上的一般图形中的并行化匹配是困难的。首先,增强路径过程是高度递归的,和NVIDIAGPU使用寄存器来存储内核参数,最终溢出到缓存的设备内存中,表现惩罚。第二,从匹配过程中提取并行性需要对图形进行分区以避免任何重叠的扩展路径。我们提出了一种Micali-Vazirani算法的实现,该算法使用线程并行广度优先搜索来识别桥边,其次是块状平行路径扩张和开花收缩。增强路径和联合查找方法被实现为基于堆栈的迭代方法,与在共享内存中分配的堆栈。我们的实验表明,与串行实现相比,对于非常稀疏的规则图,我们的方法可以加速15倍,对于更密集的规则图,减速高达5倍,最后是幂律分布Kronecker图的50倍减速。此实现已开源,可进一步研究在GPU上开发组合图算法。
    The matching problem formulated as Maximum Cardinality Matching in General Graphs (MCMGG) finds the largest matching on graphs without restrictions. The Micali-Vazirani algorithm has the best asymptotic complexity for solving MCMGG when the graphs are sparse. Parallelizing matching in general graphs on the GPU is difficult for multiple reasons. First, the augmenting path procedure is highly recursive, and NVIDIA GPUs use registers to store kernel arguments, which eventually spill into cached device memory, with a performance penalty. Second, extracting parallelism from the matching process requires partitioning the graph to avoid any overlapping augmenting paths. We propose an implementation of the Micali-Vazirani algorithm which identifies bridge edges using thread-parallel breadth-first search, followed by block-parallel path augmentation and blossom contraction. Augmenting path and Union-find methods were implemented as stack-based iterative methods, with a stack allocated in shared memory. Our experimentation shows that compared to the serial implementation, our approach results in up to 15-fold speed-up for very sparse regular graphs, up to 5-fold slowdown for denser regular graphs, and finally a 50-fold slowdown for power-law distributed Kronecker graphs. This implementation has been open-sourced for further research on developing combinatorial graph algorithms on GPUs.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    临床研究中经常使用观察性研究来估计治疗或暴露对结果的影响。为了在估计治疗效果时减少混杂因素的影响,协变量平衡方法经常被实施。这项研究评估,使用广泛的蒙特卡罗模拟,协变量平衡的几种方法,和两种倾向得分估计方法,用于使用Cox比例风险模型的风险比估计对治疗者的平均治疗效果。关于最小化偏差和最大化治疗效果的准确性(通过均方误差衡量),对处理后权重的平均处理效果,精细分层,与传统逻辑回归模型的最佳完全匹配在所有模拟条件下的倾向得分表现最佳。其他方法在特定情况下表现良好,例如,当样本量较大(n=5000)且处理的比例<0.25时,配对匹配。加权方法的统计能力通常高于匹配方法,对于具有无偏治疗效果估计的平衡方法,I型错误率等于或低于标称水平。随着地层数量的增加,有效样本量也在减少,因此,对于基于分层的加权方法,考虑更少的地层可能很重要。一般来说,我们推荐在我们的模拟中表现良好的方法,尽管确定性能良好的方法必然受到我们模拟的特定特征的限制。使用现实世界的例子说明了这些方法,该例子比较了有中风风险的高血压患者中的β受体阻滞剂和血管紧张素转换酶抑制剂。
    Observational studies are frequently used in clinical research to estimate the effects of treatments or exposures on outcomes. To reduce the effects of confounding when estimating treatment effects, covariate balancing methods are frequently implemented. This study evaluated, using extensive Monte Carlo simulation, several methods of covariate balancing, and two methods for propensity score estimation, for estimating the average treatment effect on the treated using a hazard ratio from a Cox proportional hazards model. With respect to minimizing bias and maximizing accuracy (as measured by the mean square error) of the treatment effect, the average treatment effect on the treated weighting, fine stratification, and optimal full matching with a conventional logistic regression model for the propensity score performed best across all simulated conditions. Other methods performed well in specific circumstances, such as pair matching when sample sizes were large (n = 5000) and the proportion treated was < 0.25. Statistical power was generally higher for weighting methods than matching methods, and Type I error rates were at or below the nominal level for balancing methods with unbiased treatment effect estimates. There was also a decreasing effective sample size with an increasing number of strata, therefore for stratification-based weighting methods, it may be important to consider fewer strata. Generally, we recommend methods that performed well in our simulations, although the identification of methods that performed well is necessarily limited by the specific features of our simulation. The methods are illustrated using a real-world example comparing beta blockers and angiotensin-converting enzyme inhibitors among hypertensive patients at risk for incident stroke.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    尽管射血分数正常,但多达30%的严重主动脉瓣狭窄(SAS)患者(主动脉瓣面积指数[AVAi]<0.6cm2/m2)表现出低的跨瓣梯度。关于该实体的预后意义存在激烈的争论。
    本研究的目的是比较不一致低梯度SAS(DLG-SAS)与中度主动脉瓣狭窄(MAS)和高梯度SAS(HG-SAS)患者的预后。
    我们使用BEL-F-ASt(比利时-法国-主动脉狭窄)注册表,包括连续的AS患者。比较了临床和影像学变量的总体和匹配(逆概率加权和倾向评分匹配)后的生存率。该分析首先在总体人群(n=2,582)中进行,然后在未手术患者人群(n=1,812)中进行。
    后逆概率加权匹配,3组平衡。MAS的五年生存率优于DLG-SAS和HG-SAS患者(58.9%vs47%vs41.2%,P<0.001)。在未手术的患者中获得了类似的结果(54.1%vs37.9%vs28.1%,P<0.001)。探讨MG(≤40vs>40mmHg)和AVAi(<0.6vs≥0.6cm2/m2)对结局的影响,比较了HG-vsDLG-SAS和MAS-DLG-SAS的倾向评分匹配队列的生存率.与MG匹配后,MAS的生存率优于DLG-SAS(52%vs40%,P<0.001)。在匹配AVAi之后,DLG-SAS患者的生存率优于HG-SAS患者(45%vs33%,P<0.001)。
    DLG-SAS患者的生存率优于HG-SAS,比MAS患者的生存率差。在可比的MG,AVAi越低,预后越差,而在类似的AVAi,MG越高,预后越差.这些数据表明DLG-SAS是疾病连续体中的中间形式。
    UNASSIGNED: Up to 30% of patients with severe aortic stenosis (SAS) (indexed aortic valve area [AVAi] <0.6 cm2/m2) exhibit low-transvalvular gradient despite normal ejection fraction. There is intense debate regarding the prognostic significance of this entity.
    UNASSIGNED: The purpose of this study was to compare the outcome of patients with discordant low-gradient SAS (DLG-SAS) vs moderate aortic stenosis (MAS) and high-gradient SAS (HG-SAS).
    UNASSIGNED: We used the BEL-F-ASt (Belgium-France-Aortic Stenosis) registry including consecutive patients with AS. Survival was compared overall and after matching (inverse probability weighting and propensity-score matching) for clinical and imaging variables. The analysis was first performed in the overall population (n = 2,582) and then in the population of unoperated patients (n = 1,812).
    UNASSIGNED: After-inverse probability weighting-matching, the 3 groups were balanced. Five-year survival was better in MAS than in DLG-SAS and HG-SAS-patients (58.9% vs 47% vs 41.2%, P < 0.001). Similar results were obtained in unoperated patients (54.1% vs 37.9% vs 28.1%, P < 0.001). To explore the impact of MG (≤40 vs >40 mmHg) and AVAi (<0.6 vs ≥0.6 cm2/m2) on outcomes, survival of propensity score-matched cohorts of HG-vs DLG-SAS and MAS vs DLG-SAS were compared. After matching for MG, survival was better in MAS than in DLG-SAS (52% vs 40%, P < 0.001). After matching for AVAi, survival was better in DLG-SAS than in HG-SAS patients (45% vs 33%, P < 0.001).
    UNASSIGNED: Survival of DLG-SAS is better than that of HG-SAS and worse than that of MAS patients. At comparable MG, the lower the AVAi, the worse the prognosis, whereas at comparable AVAi, the higher the MG, the worse the prognosis. These data argue that DLG-SAS is an intermediate form in the disease continuum.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    保护区保护生物多样性和生态系统功能,但可能会阻碍当地经济增长。了解保护区有效性与邻近社区经济增长之间的全球格局和预测因素,可以更好地实施《昆明-蒙特利尔全球生物多样性框架》。我们用匹配的样本评估了全球10,143个保护区,以解决保护区的非随机位置。我们的结果表明,保护区可以抵抗人为引起的土地覆盖变化,并且不会限制邻近定居点的夜灯增加。这个结果是强大的,使用不同的匹配技术,参数设置,和协变量的选择。我们确定了每个保护区的土地覆盖变化和夜灯变化之间的四种类型的关系:“协同,\"\"撤退,\"和两个权衡关系。尽管邻近社区的夜灯增加,但约有一半的保护区(47.5%)仍保留其自然土地覆盖。这种协同关系在全球范围内最常见,但在生物群落和大陆之间有所不同。协同作用在亚马逊地区不太频繁,东南亚,和一些发展中地区,大多数生物多样性居住在那里,遭受更多的贫困。较小的保护区和更好地进入城市的地区,适度的道路密度,和更好的基线经济条件有更高的概率达到协同作用。我们的结果很有希望,随着保护区的扩大和物种保护的增加,将更多地依赖于用较小的保护区保护人类改造的景观。未来的干预措施应同时解决地方发展和生物多样性保护问题,以实现更多的共同利益。
    Protected areas conserve biodiversity and ecosystem functions but might impede local economic growth. Understanding the global patterns and predictors of different relationships between protected area effectiveness and neighboring community economic growth can inform better implementation of the Kunming-Montreal Global Biodiversity Framework. We assessed 10,143 protected areas globally with matched samples to address the non-random location of protected areas. Our results show that protected areas resist human-induced land cover changes and do not limit nightlight increases in neighboring settlements. This result is robust, using different matching techniques, parameter settings, and selection of covariates. We identify four types of relationships between land cover changes and nightlight changes for each protected area: \"synergy,\" \"retreat,\" and two tradeoff relationships. About half of the protected areas (47.5%) retain their natural land cover and do so despite an increase of nightlights in the neighboring communities. This synergy relationship is the most common globally but varies between biomes and continents. Synergy is less frequent in the Amazon, Southeast Asia, and some developing areas, where most biodiversity resides and which suffer more from poverty. Smaller protected areas and those with better access to cities, moderate road density, and better baseline economic conditions have a higher probability of reaching synergy. Our results are promising, as the expansion of protected areas and increased species protection will rely more on conserving the human-modified landscape with smaller protected areas. Future interventions should address local development and biodiversity conservation together to achieve more co-benefits.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    最近,在开发医疗领域的新技术和设备方面取得了长足的进步,包括微创手术.评估这些治疗的有效性需要研究设计,如随机对照试验。然而,由于某些治疗的性质,随机化并不总是可行的,导致观察性研究的使用。从观察性研究中估计的效应大小受到混杂因素造成的选择偏差的影响。减少这种偏差的一种方法是倾向评分。本研究旨在使用R的实际例子在两组之间引入倾向得分匹配过程。雷克斯,Excel插件图形用户界面统计程序,提供给不熟悉R编程的研究人员。进一步的技术,例如与三个或更多的组匹配,倾向得分加权和分层,以及缺失值的估算,被总结为提供本教程中未涵盖的更复杂研究的方法。
    Recently, there has been considerable progress in developing new technologies and equipment for the medical field, including minimally invasive surgeries. Evaluating the effectiveness of these treatments requires study designs like randomized controlled trials. However, due to the nature of certain treatments, randomization is not always feasible, leading to the use of observational studies. The effect size estimated from observational studies is subject to selection bias caused by confounders. One method to reduce this bias is propensity scoring. This study aimed to introduce a propensity score matching process between two groups using a practical example with R. Additionally, Rex, an Excel add-in graphical user interface statistical program, is provided for researchers unfamiliar with R programming. Further techniques, such as matching with three or more groups, propensity score weighting and stratification, and imputation of missing values, are summarized to offer approaches for more complex studies not covered in this tutorial.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    这项研究评估了针对乌干达弱势和贫困孕妇的代金券项目的影响。获得高度补贴的代金券可获得一揽子安全分娩服务,包括四次产前检查,安全交付,一次产后访视,与妊娠相关的医疗条件和并发症的治疗和管理,紧急运输。优惠券在2016年至2019年的项目运营期间出售。这项研究涵盖了乌干达25个项目受益地区中的8个,共有1881次怀孕,包括受益人和非受益人母亲。使用匹配的设计,结果显示对新生婴儿的生存有积极影响。对照组与治疗组生存率差异为5.4%分,这表明代金券项目将婴儿死亡率降低了65%以上。
    This study assesses the impact of a voucher project that targeted vulnerable and poor pregnant women in Uganda. Highly subsidised vouchers gave access to a package of safe delivery services consisting of four antenatal visits, safe delivery, one postnatal visit, the treatment and management of selected pregnancy-related medical conditions and complications, and emergency transport. Vouchers were sold during the project\'s operational period from 2016 to 2019. This study covers 8 out of 25 project-benefiting districts in Uganda and a total of 1,881 pregnancies, including both beneficiary and non-beneficiary mothers. Using a matching design, the results show a positive effect on the survival of new-born babies. The difference in the survival rate between the control group and the treatment group is 5.4% points, indicating that the voucher project reduced infant mortality by more than 65 per cent.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:初次中风或短暂性脑缺血发作(TIA)的患者有进一步中风的高风险,死亡或心血管事件。即使是第一次中风也与残疾和需要帮助的高机会有关。长期医疗保健需求的风险随着每次后续事件的发生而增加。尽管德国的住院部门已经提供了高标准的护理,很难获得跨部门的善后护理。因此,该研究调查了结构化病例管理计划是否可以避免卒中复发.
    方法:该研究是在北莱茵-威斯特法伦州的三个地区进行的准实验研究设计。首次中风或TIA的患者有资格参加。在一年的随访期间,前瞻性地招募了干预组,并得到了病例经理的支持。最佳完全匹配用于基于法定索赔数据生成控制组。主要结果是卒中复发。使用Cox回归分析复发和死亡率;其他次要结果使用基于测试的程序和逻辑回归检查。此外,进行亚组分析.
    结果:从2018年6月至2020年3月,干预组纳入了1,512例患者。已传输了19,104名患者的索赔数据,以建立对照组。在匹配过程之后,每组1,167例患者纳入分析。干预组复发70例(6.0%),对照组复发67例(5.7%)。风险比为1.06(95%CI:[1.42-0.69];p=0.69),对主要结局无显著影响.关于次要结果死亡率,干预组死亡36例,对照组死亡46例(3.1%vs.3.9%)。再一次,无显著影响(HR:0.86;95%CI:[0.58-1.28],p=0.46)。
    结论:根据目前的发现,此处评估的卒中患者病例管理方法无法证明医疗保健有所改善.病例管理的潜在影响可能无法在短时间内得到充分描述。因此,未来的研究应该考虑更长的观察期.
    结论:专家小组应讨论支持成本密集型个体病例的病例管理的核心方法是否与对卒中患者进行一刀切的干预的广泛实施相反。在这种情况下,进一步的研究应该集中在更具体的研究人群上。
    BACKGROUND: Patients with initial stroke or transient ischemic attack (TIA) are at high risk for further strokes, death or cardiovascular events. Even the first-ever stroke is associated with a high chance of disability and need for assistance. The risk of long-term health care demands increases with each subsequent event. Although the inpatient sector already provides a high standard of care in Germany, it can be difficult to obtain cross-sectoral aftercare. Thus, the study investigated whether a structured case management program can avoid stroke recurrences.
    METHODS: The study was conducted with a quasi-experimental study design in three regions in North Rhine-Westphalia. Patients with first-ever stroke or TIA were eligible to participate. The intervention group was prospectively recruited and supported by a case manager during a one-year follow-up. Optimal Full Matching was used to generate a control group based on statutory claims data. The primary outcome was the stroke recurrence. Recurrence and mortality were analysed by using Cox regression; other secondary outcomes were examined with test-based procedures and with logistic regressions. Additionally, subgroup analyses were performed.
    RESULTS: From June 2018 to March 2020, 1,512 patients were enrolled in the intervention group. Claims data from 19,104 patients have been transmitted for establishing the control group. After the matching process, 1,167 patients of each group were included in the analysis. 70 recurrences (6.0%) occurred in the intervention group and 67 recurrences (5.7%) in the control group. With a hazard ratio of 1.06 (95% CI: [1.42-0.69]; p=0.69), no significant effect was found for the primary outcome. With regard to the secondary outcome mortality, 36 patients in the intervention group and 46 in the control group died (3.1% vs. 3.9%). Again, there was no significant effect (HR: 0.86; 95% CI: [0.58-1.28], p=0.46).
    CONCLUSIONS: Based on the present findings, the case management approach for stroke patients evaluated here was unable to demonstrate an improvement in health care. Potential effects of case management might not be adequately depicted in short observation periods. Thus, future studies should consider longer observation periods.
    CONCLUSIONS: A panel of experts should discuss whether the core approach of case management to support cost-intensive individual cases is contrary to a broad implementation with a one-size-fits-all intervention for stroke patients. In this case, further research should focus on more specific study populations.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    完整的人类行为科学需要对人类表现出的言语行为进行全面说明。这种言语行为的现有行为理论对语言的基本功能产生了令人信服的洞察力,但是不幸的是,这些理论应得的广泛研究计划发展缓慢。我们认为,现状的手动实现和特定于研究的编码系统太资源密集,对于大多数行为分析师来说是值得的。这些高投入成本反过来阻碍了对整体言语行为的研究。我们提出了基于词典的情感分析,作为研究人类言语产品的一种更现代,更有效的方法,尤其是自然发生的(例如,心理治疗记录,社交媒体帖子)。在目前的讨论中,我们向读者介绍情感分析的原理,强调其作为一种行为分析工具的有用性,用于研究言语行为。最后,我们概述了处理一些更复杂的语音形式的方法,像否定,讽刺,和猜测。附录还提供了一个有效的示例,说明如何将情感分析应用于行为分析中的现有问题,完成代码,读者可以纳入自己的工作。
    A complete science of human behavior requires a comprehensive account of the verbal behavior those humans exhibit. Existing behavioral theories of such verbal behavior have produced compelling insight into language\'s underlying function, but the expansive program of research those theories deserve has unfortunately been slow to develop. We argue that the status quo\'s manually implemented and study-specific coding systems are too resource intensive to be worthwhile for most behavior analysts. These high input costs in turn discourage research on verbal behavior overall. We propose lexicon-based sentiment analysis as a more modern and efficient approach to the study of human verbal products, especially naturally occurring ones (e.g., psychotherapy transcripts, social media posts). In the present discussion, we introduce the reader to principles of sentiment analysis, highlighting its usefulness as a behavior analytic tool for the study of verbal behavior. We conclude with an outline of approaches for handling some of the more complex forms of speech, like negation, sarcasm, and speculation. The appendix also provides a worked example of how sentiment analysis could be applied to existing questions in behavior analysis, complete with code that readers can incorporate into their own work.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    在具有层次结构的大规模观测数据中,集群和干预措施通常都有两个以上的层次。二元治疗文献中的流行方法不会自然地扩展到分层多级治疗案例。例如,在COVID-19大流行期间,大多数K-12和大学都转向了前所未有的混合学习模块,其中的学习模式包括混合学习和完全远程学习,而学生聚集在一个班级和学校区域内。在分层数据结构中评估多级治疗的学习结果的有效性是具有挑战性的。在本文中,我们研究了协变量匹配方法,并开发了广义倾向得分匹配方法,以减少干预效果估计的偏差.我们还提出了简单的算法来评估每种方法的协变量平衡。我们通过仿真研究检查了这些方法的有限样本性能,并应用所提出的方法来分析COVID-19大流行期间学习模式的有效性。
    In large-scale observational data with a hierarchical structure, both clusters and interventions often have more than two levels. Popular methods in the binary treatment literature do not naturally extend to the hierarchical multilevel treatment case. For example, most K-12 and universities have moved to an unprecedented hybrid learning module during the COVID-19 pandemic where learning modes include hybrid and fully remote learning, while students were clustered within a class and school region. It is challenging to evaluate the effectiveness of the learning outcomes of the multilevel treatments in a hierarchically data structured. In this paper, we study a covariates matching method and develop a generalized propensity score matching method to reduce the bias of estimation in the intervention effect. We also propose simple algorithms to assess the covariates balance for each approach. We examine the finite sample performance of the methods via simulation studies and apply the proposed methods to analyze the effectiveness of learning modes during the COVID-19 pandemic.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号