关键词: biodiversity informatics online flora plant name identification species name matching spelling errors taxonomic databases

来  源:   DOI:10.1002/aps3.11388   PDF(Sci-hub)   PDF(Pubmed)

Abstract:
OBJECTIVE: The standardization of plant names is a critical step in various fields of biology, including biodiversity, biogeography, and vegetation research. The WorldFlora package is introduced here to help achieve this goal by matching lists of plant names with a static copy from World Flora Online (WFO), an ongoing global effort to complete an online flora of all known vascular plants and bryophytes by 2020.
RESULTS: Based on direct and fuzzy matching, WorldFlora inserts matching cases from the WFO to a submitted data set containing taxonomic names. The results and success rates for selecting the expected best single matches are presented for four data sets, including two data sets used in recent comparisons of software tools for correcting taxon names.
CONCLUSIONS: WorldFlora offers a straightforward pipeline for semi-automatic plant name checking. For the four data sets, the success rate of credible matches ranged from 94.7% to 99.9%.
摘要:
目的:植物名称的标准化是生物学各个领域的关键一步,包括生物多样性,生物地理学,和植被研究。此处介绍了WorldFlora软件包,以通过将植物名称列表与来自WorldFloraOnline(WFO)的静态副本进行匹配来帮助实现这一目标,正在进行的全球努力,到2020年完成所有已知维管植物和苔藓植物的在线植物区系。
结果:基于直接和模糊匹配,WorldFlora将来自WFO的匹配案例插入到包含分类名称的提交数据集。为四个数据集提供了选择预期最佳单个匹配的结果和成功率,包括最近比较用于纠正分类单元名称的软件工具的两个数据集。
结论:WorldFlora为半自动工厂名称检查提供了一条简单的管道。对于四个数据集,可信比赛的成功率从94.7%到99.9%不等。
公众号