benchmarking

基准
  • 文章类型: Journal Article
    背景:单细胞染色质可及性测定,例如scATAC-seq,越来越多地用于单细胞的个体和联合多维谱分析。随着scATAC-seq和多组学数据集的不断积累,分析这种稀疏的挑战,嘈杂,高维数据变得紧迫。具体来说,一个挑战涉及优化染色质水平测量的处理和有效地提取信息以辨别细胞异质性。这是至关重要的,因为细胞类型的识别是当前单细胞数据分析实践中的基本步骤。
    结果:我们对来自5种最新方法的8个特征工程管道进行了基准测试,以评估它们发现和区分细胞类型的能力。通过使用在单元格嵌入时计算的10个度量,共享最近邻居图,或分区级别,我们评估每种方法在不同数据处理阶段的性能。这种全面的方法使我们能够彻底了解每种方法的优缺点以及参数选择的影响。
    结论:我们的分析为选择不同数据集的分析方法提供了指导。总的来说,特征聚合,SnapATAC,和SnapATAC2优于基于潜在语义索引的方法。对于具有复杂细胞类型结构的数据集,SnapATAC和SnapATAC2是优选的。对于大型数据集,SnapATAC2和ArchR是最具可扩展性的。
    BACKGROUND: Single-cell chromatin accessibility assays, such as scATAC-seq, are increasingly employed in individual and joint multi-omic profiling of single cells. As the accumulation of scATAC-seq and multi-omics datasets continue, challenges in analyzing such sparse, noisy, and high-dimensional data become pressing. Specifically, one challenge relates to optimizing the processing of chromatin-level measurements and efficiently extracting information to discern cellular heterogeneity. This is of critical importance, since the identification of cell types is a fundamental step in current single-cell data analysis practices.
    RESULTS: We benchmark 8 feature engineering pipelines derived from 5 recent methods to assess their ability to discover and discriminate cell types. By using 10 metrics calculated at the cell embedding, shared nearest neighbor graph, or partition levels, we evaluate the performance of each method at different data processing stages. This comprehensive approach allows us to thoroughly understand the strengths and weaknesses of each method and the influence of parameter selection.
    CONCLUSIONS: Our analysis provides guidelines for choosing analysis methods for different datasets. Overall, feature aggregation, SnapATAC, and SnapATAC2 outperform latent semantic indexing-based methods. For datasets with complex cell-type structures, SnapATAC and SnapATAC2 are preferred. With large datasets, SnapATAC2 and ArchR are most scalable.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    这项横断面研究评估了患者与临床因素之间的关联以及急诊科医生使用电子健康记录(EHR)系统花费的时间变化。
    This cross-sectional study assesses the associations between patient and clinical factors and variations in time emergency department physicians spend using electronic health record (EHR) systems.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    分子性质预测(MPP)对于药物发现至关重要,作物保护,和环境科学。在过去的几十年里,已经开发了各种各样的计算技术,从在统计模型和经典机器学习中使用简单的物理和化学性质以及分子指纹到高级深度学习方法。在这次审查中,我们的目标是从当前关于采用变压器模型进行MPP的研究中提取见解。我们分析了当前可用的模型,并探讨了在为MPP训练和微调变压器模型时出现的关键问题。这些问题包括预训练数据的选择和规模,最优架构选择,和有前途的培训前目标。我们的分析突出了当前研究尚未涵盖的领域,邀请进一步探索,以增进对该领域的理解。此外,我们应对比较不同模型的挑战,强调需要标准化的数据拆分和稳健的统计分析。
    Molecular Property Prediction (MPP) is vital for drug discovery, crop protection, and environmental science. Over the last decades, diverse computational techniques have been developed, from using simple physical and chemical properties and molecular fingerprints in statistical models and classical machine learning to advanced deep learning approaches. In this review, we aim to distill insights from current research on employing transformer models for MPP. We analyze the currently available models and explore key questions that arise when training and fine-tuning a transformer model for MPP. These questions encompass the choice and scale of the pretraining data, optimal architecture selections, and promising pretraining objectives. Our analysis highlights areas not yet covered in current research, inviting further exploration to enhance the field\'s understanding. Additionally, we address the challenges in comparing different models, emphasizing the need for standardized data splitting and robust statistical analysis.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    背景用于检测有临床意义的前列腺癌(csPCa)的前列腺MRI是由前列腺成像报告和数据系统(PI-RADS)标准化的。目前在2.1版本中。建立了具有12个月更新周期的系统回顾和荟萃分析基础设施,以评估PI-RADS随时间的诊断性能。目的提供PI-RADS2.1类前列腺MRI诊断准确性和癌症检出率(CDR)的估计,这是进一步的循证患者管理所必需的。材料与方法PubMed的系统搜索,Embase,科克伦图书馆,并进行了多项试验登记(2019年3月1日至2022年8月30日发表的英语研究).纳入了以csPCa为主要结果的PI-RADS2.1版的诊断准确性或CDR数据的研究。对于荟萃分析,敏感性的汇总估计,特异性,和CDRs来自病变水平和患者水平的提取数据。研究了PI-RADS大于或等于3和PI-RADS大于或等于4被认为是测试阳性的敏感性和特异性。除了单个PI-RADS类别1-5外,子类别的亚组分析(即,进行2+1、3+0)。结果共70项研究(11686个病灶,包括13330名患者)。在患者层面,PI-RADS大于或等于3时认为是阳性,荟萃分析发现,总敏感度为96%(95%CI:95,98),特异性为43%(95%CI:33,54),总受试者工作特征(SROC)曲线下面积为0.86(95%CI:0.75,0.93)。对于PI-RADS大于或等于4,荟萃分析发现,敏感性为89%(95%CI:85,92)和特异性为66%(95%CI:58,74)。SROC曲线下面积为0.89(95%CI:0.85,0.92)。CDR如下:PI-RADS1,6%;PI-RADS2,5%;PI-RADS3,19%;PI-RADS4,54%;和PI-RADS5,84%。过渡区2+1病变的CDR为12%(95%CI:7,19),3+0病变的CDR为19%(95%CI:12,29)(P=.12)。结论PI-RADS2.1类的诊断准确性和CDR估计可用于质量基准测试,并指导进一步的循证患者管理。©RSNA,2024补充材料可用于本文。另见本期Tammisetti和Jacobs的社论。
    Background Prostate MRI for the detection of clinically significant prostate cancer (csPCa) is standardized by the Prostate Imaging Reporting and Data System (PI-RADS), currently in version 2.1. A systematic review and meta-analysis infrastructure with a 12-month update cycle was established to evaluate the diagnostic performance of PI-RADS over time. Purpose To provide estimates of diagnostic accuracy and cancer detection rates (CDRs) of PI-RADS version 2.1 categories for prostate MRI, which is required for further evidence-based patient management. Materials and Methods A systematic search of PubMed, Embase, Cochrane Library, and multiple trial registers (English-language studies published from March 1, 2019, to August 30, 2022) was performed. Studies that reported data on diagnostic accuracy or CDRs of PI-RADS version 2.1 with csPCa as the primary outcome were included. For the meta-analysis, pooled estimates for sensitivity, specificity, and CDRs were derived from extracted data at the lesion level and patient level. Sensitivity and specificity for PI-RADS greater than or equal to 3 and PI-RADS greater than or equal to 4 considered as test positive were investigated. In addition to individual PI-RADS categories 1-5, subgroup analyses of subcategories (ie, 2+1, 3+0) were performed. Results A total of 70 studies (11 686 lesions, 13 330 patients) were included. At the patient level, with PI-RADS greater than or equal to 3 considered positive, meta-analysis found a 96% summary sensitivity (95% CI: 95, 98) and 43% specificity (95% CI: 33, 54), with an area under the summary receiver operating characteristic (SROC) curve of 0.86 (95% CI: 0.75, 0.93). For PI-RADS greater than or equal to 4, meta-analysis found an 89% sensitivity (95% CI: 85, 92) and 66% specificity (95% CI: 58, 74), with an area under the SROC curve of 0.89 (95% CI: 0.85, 0.92). CDRs were as follows: PI-RADS 1, 6%; PI-RADS 2, 5%; PI-RADS 3, 19%; PI-RADS 4, 54%; and PI-RADS 5, 84%. The CDR was 12% (95% CI: 7, 19) for transition zone 2+1 lesions and 19% (95% CI: 12, 29) for 3+0 lesions (P = .12). Conclusion Estimates of diagnostic accuracy and CDRs for PI-RADS version 2.1 categories are provided for quality benchmarking and to guide further evidence-based patient management. © RSNA, 2024 Supplemental material is available for this article. See also the editorial by Tammisetti and Jacobs in this issue.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    暂无摘要。
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    微卫星不稳定性(MSI)是在几种癌症类型中看到的现象,它可以作为生物标志物来帮助指导免疫检查点抑制剂的治疗。为了促进这一点,研究人员已经开发了计算工具来将样本分类为具有高微卫星不稳定性,或者使用下一代测序数据作为微卫星稳定。这些工具中的大多数都发布了不清楚的范围和用法,他们还没有独立的基准。为了解决这些问题,我们评估了8种领先的MSI工具在几个独特的数据集的性能,这些数据集涵盖了多种测序方法.虽然我们能够在整个外显子组测序数据上复制每个工具的原始发现,在全基因组测序数据上,大多数工具的接受者工作特性和精确召回率曲线下面积较差.我们还发现,他们彼此之间以及与商业MSI软件在基因面板数据上缺乏共识,和最佳阈值截止因测序类型而异。最后,我们测试了专门用于RNA测序数据的工具,发现这些工具在设计用于DNA测序数据的工具中表现优异.最重要的是,两个工具(MSISensor2,MANTIS)在几乎所有数据集上都表现良好,但是当所有数据集合并时,精度下降。我们的结果警告说,与最初评估的数据集相比,MSI工具在数据集上的性能可能要低得多。就RNA测序工具而言,甚至可能对创建它们的数据类型表现不佳。
    Microsatellite instability (MSI) is a phenomenon seen in several cancer types, which can be used as a biomarker to help guide immune checkpoint inhibitor treatment. To facilitate this, researchers have developed computational tools to categorize samples as having high microsatellite instability, or as being microsatellite stable using next-generation sequencing data. Most of these tools were published with unclear scope and usage, and they have yet to be independently benchmarked. To address these issues, we assessed the performance of eight leading MSI tools across several unique datasets that encompass a wide variety of sequencing methods. While we were able to replicate the original findings of each tool on whole exome sequencing data, most tools had worse receiver operating characteristic and precision-recall area under the curve values on whole genome sequencing data. We also found that they lacked agreement with one another and with commercial MSI software on gene panel data, and that optimal threshold cut-offs vary by sequencing type. Lastly, we tested tools made specifically for RNA sequencing data and found they were outperformed by tools designed for use with DNA sequencing data. Out of all, two tools (MSIsensor2, MANTIS) performed well across nearly all datasets, but when all datasets were combined, their precision decreased. Our results caution that MSI tools can have much lower performance on datasets other than those on which they were originally evaluated, and in the case of RNA sequencing tools, can even perform poorly on the type of data for which they were created.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Dataset
    当使用宏基因组学鸟枪测序时,分类分类对于识别不同微生物群落中的生物至关重要。虽然第二代Illumina测序仍然占主导地位,第三代纳米孔测序有望通过更长的读数改善分类。然而,缺乏对纳米孔数据的广泛基准研究。我们系统地评估了几种常用分类器的宏基因组学纳米孔测序数据的细菌分类学分类性能,使用标准化的参考序列数据库,关于迄今为止定义的模拟社区最大的公开数据收集(九个样本),代表不同的研究领域和应用范围。我们的结果将分类器分为三类:低精度/高召回率;中等精度/中等召回率,和高精度/中等召回率。大多数属于第一组,尽管通过适当的丰度过滤可以在不过度惩罚召回的情况下提高精度。没有明确的“最佳”分类器出现,分类器的选择取决于应用范围和实际需求。尽管很少有为长读取设计的分类器,它们通常表现出更好的性能。我们全面的基准提供了具体的建议,由其他科学家重新评估和微调的公开可用代码支持。
    Taxonomic classification is crucial in identifying organisms within diverse microbial communities when using metagenomics shotgun sequencing. While second-generation Illumina sequencing still dominates, third-generation nanopore sequencing promises improved classification through longer reads. However, extensive benchmarking studies on nanopore data are lacking. We systematically evaluated performance of bacterial taxonomic classification for metagenomics nanopore sequencing data for several commonly used classifiers, using standardized reference sequence databases, on the largest collection of publicly available data for defined mock communities thus far (nine samples), representing different research domains and application scopes. Our results categorize classifiers into three categories: low precision/high recall; medium precision/medium recall, and high precision/medium recall. Most fall into the first group, although precision can be improved without excessively penalizing recall with suitable abundance filtering. No definitive \'best\' classifier emerges, and classifier selection depends on application scope and practical requirements. Although few classifiers designed for long reads exist, they generally exhibit better performance. Our comprehensive benchmarking provides concrete recommendations, supported by publicly available code for reassessment and fine-tuning by other scientists.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    药物基因组学(PGx)研究遗传学对药物反应的影响,为个性化医疗保健提供量身定制的治疗方法。这项研究评估了使用四种不同的计算工具和各种测序深度的全基因组测序对六种基因进行基因分型的准确性。还探索了使用不同参考基因组(GRCh38和GRCh37)和序列比对(BWA-MEM和Bowtie2)的效果。结果表明,大多数基因的工具性能通常存在较小的差异;然而,在复杂CYP2D6基因的分析中观察到更显著的差异.Cyrius,CYP2D6专用工具,展示了最强大的性能,在所有情况下实现CYP2D6的最高一致率,在大多数情况下,与共识方法相当。具有20倍覆盖深度的样本与具有较高深度的样本之间存在相当小的差异,但是在较低的深度表现下降更明显,特别是在5×此外,当使用相同的方法将样品与不同的参考基因组比对时,观察到CYP2D6结果的变化,或者使用不同的对齐器对相同的基因组,这导致在一些情况下报告不正确的罕见恒星等位基因。这些发现为选择最佳的PGx工具和方法提供了信息,并表明采用两种或多种工具的共识方法对于某些基因和工具组合可能更可取。尤其是在较低的测序深度,确保结果准确。此外,我们展示了上游对齐如何影响工具的性能,一个需要考虑的重要因素。
    Pharmacogenomics (PGx) investigates the influence of genetics on drug responses, enabling tailored treatments for personalized healthcare. This study assessed the accuracy of genotyping six genes using whole genome sequencing with four different computational tools and various sequencing depths. The effects of using different reference genomes (GRCh38 and GRCh37) and sequence aligners (BWA-MEM and Bowtie2) were also explored. The results showed generally minor variations in tool performance across most genes; however, more notable discrepancies were observed in the analysis of the complex CYP2D6 gene. Cyrius, a CYP2D6-specific tool, demonstrated the most robust performance, achieving the highest concordance rates for CYP2D6 in all instances, comparable to the consensus approach in most cases. There were rather small differences between the samples with 20× coverage depth and those with higher depth, but the decreased performance was more evident at lower depths, particularly at 5×. Additionally, variations in CYP2D6 results were observed when samples were aligned to different reference genomes using the same method, or to the same genome using different aligners, which led to reporting incorrect rare star alleles in several cases. These findings inform the selection of optimal PGx tools and methodologies as well as suggest that employing a consensus approach with two or more tools might be preferable for certain genes and tool combinations, especially at lower sequencing depths, to ensure accurate results. Additionally, we show how the upstream alignment can affect the performance of tools, an important factor to take into account.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:空间转录组学(ST)正在促进我们对复杂组织和生物体的理解。然而,构建稳健的聚类算法以在单个组织切片中定义空间相干区域,并对齐或整合源自不同来源的多个组织切片以进行必要的下游分析仍然具有挑战性。众多的集群,对齐,集成方法是通过利用ST数据的空间信息专门为ST数据设计的。缺乏全面的基准研究使方法的选择和未来的方法开发变得复杂。
    结果:在这项研究中,我们系统地对各种最先进的算法进行基准测试,使用各种大小不同的真实和模拟数据集,技术,物种,和复杂性。我们使用不同的定量和定性指标和分析来分析每种方法的优缺点,包括空间聚类准确性和连续性的八个指标,均匀流形逼近和投影可视化,逐层和逐点对准精度,和三维重建,旨在评估方法性能和数据质量。用于评估的代码可在我们的GitHub上找到。此外,我们提供在线笔记本教程和文档,以促进所有基准测试结果的复制,并支持新方法和新数据集的研究。
    结论:我们的分析得出了涵盖多个方面的全面建议,帮助用户为他们的特定需求选择最佳工具,并指导未来的方法开发。
    BACKGROUND: Spatial transcriptomics (ST) is advancing our understanding of complex tissues and organisms. However, building a robust clustering algorithm to define spatially coherent regions in a single tissue slice and aligning or integrating multiple tissue slices originating from diverse sources for essential downstream analyses remains challenging. Numerous clustering, alignment, and integration methods have been specifically designed for ST data by leveraging its spatial information. The absence of comprehensive benchmark studies complicates the selection of methods and future method development.
    RESULTS: In this study, we systematically benchmark a variety of state-of-the-art algorithms with a wide range of real and simulated datasets of varying sizes, technologies, species, and complexity. We analyze the strengths and weaknesses of each method using diverse quantitative and qualitative metrics and analyses, including eight metrics for spatial clustering accuracy and contiguity, uniform manifold approximation and projection visualization, layer-wise and spot-to-spot alignment accuracy, and 3D reconstruction, which are designed to assess method performance as well as data quality. The code used for evaluation is available on our GitHub. Additionally, we provide online notebook tutorials and documentation to facilitate the reproduction of all benchmarking results and to support the study of new methods and new datasets.
    CONCLUSIONS: Our analyses lead to comprehensive recommendations that cover multiple aspects, helping users to select optimal tools for their specific needs and guide future method development.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    分子对接的性能可以通过比较灵活采样的姿势与靶蛋白的反向结合腔的形状相似性来提高。通过执行富集驱动的优化,可以进一步提高这些基于伪配体或阴性图像的模型在对接重新评分中的有效性。这里,我们引入了一种新颖的以形状为中心的药效团建模算法O-LAP,该算法通过成对距离图聚类将重叠的原子含量聚集在一起,从而生成一类新的空腔填充模型。灵活对接的活性配体的排名靠前的姿势被用作建模输入,并且使用随机训练/测试划分对五个苛刻的药物靶标进行了彻底的基准测试,对多个替代聚类设置进行了彻底测试。在对接记录中,O-LAP建模通常在默认对接富集上有很大改进;此外,结果表明,聚类模型在刚性对接中效果良好。基于C++/Qt5的算法O-LAP通过GitHub(https://github.com/jvlehtonen/overlap-toolkit)在GNU通用公共许可证v3.0下发布。科学贡献:这项研究引入了O-LAP,一个基于C++/Qt5的图聚类软件,用于生成新型的形状聚焦药效团模型。在O-LAP建模中,靶蛋白腔内充满灵活对接的活性配体,重叠的配体原子是成簇的,并将所得模型的形状/静电势与灵活采样的分子对接姿势进行比较。基于全面的基准测试,O-LAP建模可确保对接评分和刚性对接的高度富集。
    The performance of molecular docking can be improved by comparing the shape similarity of the flexibly sampled poses against the target proteins\' inverted binding cavities. The effectiveness of these pseudo-ligands or negative image-based models in docking rescoring is boosted further by performing enrichment-driven optimization. Here, we introduce a novel shape-focused pharmacophore modeling algorithm O-LAP that generates a new class of cavity-filling models by clumping together overlapping atomic content via pairwise distance graph clustering. Top-ranked poses of flexibly docked active ligands were used as the modeling input and multiple alternative clustering settings were benchmark-tested thoroughly with five demanding drug targets using random training/test divisions. In docking rescoring, the O-LAP modeling typically improved massively on the default docking enrichment; furthermore, the results indicate that the clustered models work well in rigid docking. The C+ +/Qt5-based algorithm O-LAP is released under the GNU General Public License v3.0 via GitHub ( https://github.com/jvlehtonen/overlap-toolkit ). SCIENTIFIC CONTRIBUTION: This study introduces O-LAP, a C++/Qt5-based graph clustering software for generating new type of shape-focused pharmacophore models. In the O-LAP modeling, the target protein cavity is filled with flexibly docked active ligands, the overlapping ligand atoms are clustered, and the shape/electrostatic potential of the resulting model is compared against the flexibly sampled molecular docking poses. The O-LAP modeling is shown to ensure high enrichment in both docking rescoring and rigid docking based on comprehensive benchmark-testing.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号