K-means clustering algorithm

k - means 聚类算法
  • 文章类型: Journal Article
    本文强调了pH或质子活性测量在环境研究中的关键作用,并强调了在处理pH数据时应用适当统计方法的重要性。这允许做出更明智的决策,以有效地管理环境数据,例如采矿受影响的水。同一系统的pH和{H+}显示出不同的分布,pH值主要显示正常或双峰分布,{H}显示对数正态分布。因此,是否使用pH或{H+}来计算用于进一步环境统计分析的集中趋势的平均值或测量是一个挑战。在这项研究中,应用不同的统计技术来了解来自四个不同矿区的pH和{H+}的分布,Metsämonttu在芬兰,FelsendomeRabenstein在德国,南非的Eastrand和Westrand矿山水处理厂。根据统计结果,如果分布是单峰的,则几何平均值可用于计算pH的平均值。对于多峰pH数据分布,峰识别方法可用于提取每个数据群体的平均值,并将其用于进一步的统计分析。
    This paper highlights the critical role of pH or proton activity measurements in environmental studies and emphasises the importance of applying proper statistical approaches when handling pH data. This allows for more informed decisions to effectively manage environmental data such as from mining influenced water. Both the pH and {H+} of the same system display different distributions, with pH mostly displaying a normal or bimodal distribution and {H+} showing a lognormal distribution. It is therefore a challenge of whether to use pH or {H+} to compute the mean or measures of central tendency for further environmental statistical analyses. In this study, different statistical techniques were applied to understand the distribution of pH and {H+} from four different mine sites, Metsämonttu in Finland, Felsendome Rabenstein in Germany, Eastrand and Westrand mine water treatment plants in South Africa. Based on the statistical results, the geometric mean can be used to calculate the average of pH if the distribution is unimodal. For a multimodal pH data distribution, peak identifying methods can be applied to extract the mean for each data population and use them for further statistical analyses.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    鼻旁窦,由八个充满空气的空腔组成的两侧对称系统,代表马身体最复杂的部分之一。这项研究旨在从马头的计算机断层扫描(CT)图像中提取形态测量,并实施聚类分析,以计算机辅助识别与年龄相关的变化。18匹尸体马的头,2-25岁,被CT成像和分割以提取它们的体积,表面积,额窦(FS)的相对密度,背甲窦(DCS),腹侧耳廓窦(VCS),鼻端上颌窦(RMS),上颌窦(CMS),蝶窦(SS),腭窦(PS),和中耳窦(MCS)。数据分为年轻,中年,和老马群,并使用K-means聚类算法进行聚类。形态测量根据马匹的鼻窦位置和年龄而变化,而不是身体侧。VCS的体积和表面积,RMS,CMS随着马龄的增加而增加。RMS的精度值为0.72,CMS为0.67,VCS为0.31,RMS和CMS证实了基于CT的马鼻旁窦3D图像的年龄相关聚类的可能性,但VCS证明了这一可能性.
    The paranasal sinuses, a bilaterally symmetrical system of eight air-filled cavities, represent one of the most complex parts of the equine body. This study aimed to extract morphometric measures from computed tomography (CT) images of the equine head and to implement a clustering analysis for the computer-aided identification of age-related variations. Heads of 18 cadaver horses, aged 2-25 years, were CT-imaged and segmented to extract their volume, surface area, and relative density from the frontal sinus (FS), dorsal conchal sinus (DCS), ventral conchal sinus (VCS), rostral maxillary sinus (RMS), caudal maxillary sinus (CMS), sphenoid sinus (SS), palatine sinus (PS), and middle conchal sinus (MCS). Data were grouped into young, middle-aged, and old horse groups and clustered using the K-means clustering algorithm. Morphometric measurements varied according to the sinus position and age of the horses but not the body side. The volume and surface area of the VCS, RMS, and CMS increased with the age of the horses. With accuracy values of 0.72 for RMS, 0.67 for CMS, and 0.31 for VCS, the possibility of the age-related clustering of CT-based 3D images of equine paranasal sinuses was confirmed for RMS and CMS but disproved for VCS.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    山区地形粗糙,极易发生堰塞湖灾害,很少的植被,夏季降雨量高。通过测量水位变化,当泥石流阻塞河流或提高水位时,监测系统可以检测堰塞湖事件。因此,提出了一种基于混合分割算法的自动监控报警方法。该算法使用k-means聚类算法在RGB颜色空间中分割图片场景,在图像绿色通道上使用区域生长算法从分割的场景中选择河流目标。像素水位变化被用于在已经检索到水位之后触发针对堰塞湖事件的警报。在中国西藏自治区雅鲁藏布江流域,拟议的自动湖泊监测系统已安装。我们收集了2021年4月至11月的数据,在此期间河流经历了低谷,高,低水位。与传统的区域生长算法不同,该算法不依赖于工程知识来拾取种子点参数。使用我们的方法,准确率为89.29%,漏检率为11.76%,比传统的区域生长算法高29.12%,低17.65%,分别。监测结果表明,该方法是一种适应性强、准确度高的无人堰塞湖监测系统。
    Mountainous regions are prone to dammed lake disasters due to their rough topography, scant vegetation, and high summer rainfall. By measuring water level variation, monitoring systems can detect dammed lake events when mudslides block rivers or boost water level. Therefore, an automatic monitoring alarm method based on a hybrid segmentation algorithm is proposed. The algorithm uses the k-means clustering algorithm to segment the picture scene in the RGB color space and the region growing algorithm on the image green channel to select the river target from the segmented scene. The pixel water level variation is used to trigger an alarm for the dammed lake event after the water level has been retrieved. In the Yarlung Tsangpo River basin of the Tibet Autonomous Region of China, the proposed automatic lake monitoring system was installed. We pick up data from April to November 2021, during which the river experienced low, high, and low water levels. Unlike conventional region growing algorithms, the algorithm does not rely on engineering knowledge to pick seed point parameters. Using our method, the accuracy rate is 89.29% and the miss rate is 11.76%, which is 29.12% higher and 17.65% lower than the traditional region growing algorithm, respectively. The monitoring results indicate that the proposed method is a highly adaptable and accurate unmanned dammed lake monitoring system.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    National parks provide a considerable number of co-benefits to society, including the balance of ecosystems, conservation of heritage values, and tourism. However, studies on zoning approaches for the management of national parks are lacking. The landscape characterization approach is a holistic method for identifying regional landscapes and helps improve zoning management, thus promoting sustainable planning. Here, we propose a landscape character classification (LCC) approach for national parks by integrating a k-means clustering algorithm and geographic information system (GIS). We used Laoshan National Park (LNP) as a case study and aimed to (1) quantify the major landscape factors (altitude, topography relief, soil type, and heritage impact intensity) that influence the landscape classification of mountainous protected areas; (2) create a map of landscape character types and areas to guide a zoning boundary; and (3) further examine how decision makers assign different conservation strategies to each landscape character area. Our results indicate that different landscape character areas reflect distinct ecological environments and heritage values and that differentiated zoning management can effectively mitigate the impact of natural disasters and human activities. Our study suggests that national parks require scientific landscape character zoning, rational descriptions of landscape character types, and targeted management measures to achieve the dual objectives of zoning and landscape conservation.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    检测机器人在复杂环境下对指针式仪表的检测过程中经常会出现反射现象,这可能导致指针表读数失败。在本文中,提出了一种改进的k-means聚类自适应检测指针表反射区域的方法和一种基于深度学习的机器人姿态控制策略。它主要包括三个步骤:(1)YOLOv5s(YouOnlyLookOncev5-small)深度学习网络用于指针仪表的实时检测。通过使用透视变换对检测到的反射指针仪表进行预处理。然后,将检测结果和深度学习算法与透视变换相结合。(2)基于采集的指针表图像的YUV(亮度-带宽-色度)颜色空间信息,得到亮度分量直方图及其峰谷信息的拟合曲线。然后,k-means算法根据该信息进行改进,自适应地确定其最优聚类数和初始聚类中心。此外,基于改进的k-means聚类算法对指针仪表图像进行反射检测。(3)机器人位姿控制策略,包括它的移动方向和距离,可以确定消除反射区域。最后,搭建了检测机器人检测平台,对所提检测方法的性能进行了实验研究。实验结果表明,该方法不仅具有较好的检测精度,达到0.809,而且检测时间短,与文献中可用的其他方法相比,仅为0.6392s。本文的主要贡献是为检测机器人避免周向反射提供了理论和技术参考。它可以自适应和准确地检测指针仪表的反射区域,并可以通过控制检测机器人的运动来快速去除它们。该检测方法对于实现复杂环境下巡检机器人指针仪表的实时反射检测和识别具有潜在的应用价值。
    Reflective phenomena often occur in the detecting process of pointer meters by inspection robots in complex environments, which can cause the failure of pointer meter readings. In this paper, an improved k-means clustering method for adaptive detection of pointer meter reflective areas and a robot pose control strategy to remove reflective areas are proposed based on deep learning. It mainly includes three steps: (1) YOLOv5s (You Only Look Once v5-small) deep learning network is used for real-time detection of pointer meters. The detected reflective pointer meters are preprocessed by using a perspective transformation. Then, the detection results and deep learning algorithm are combined with the perspective transformation. (2) Based on YUV (luminance-bandwidth-chrominance) color spatial information of collected pointer meter images, the fitting curve of the brightness component histogram and its peak and valley information is obtained. Then, the k-means algorithm is improved based on this information to adaptively determine its optimal clustering number and its initial clustering center. In addition, the reflection detection of pointer meter images is carried out based on the improved k-means clustering algorithm. (3) The robot pose control strategy, including its moving direction and distance, can be determined to eliminate the reflective areas. Finally, an inspection robot detection platform is built for experimental study on the performance of the proposed detection method. Experimental results show that the proposed method not only has good detection accuracy that achieves 0.809 but also has the shortest detection time, which is only 0.6392 s compared with other methods available in the literature. The main contribution of this paper is to provide a theoretical and technical reference to avoid circumferential reflection for inspection robots. It can adaptively and accurately detect reflective areas of pointer meters and can quickly remove them by controlling the movement of inspection robots. The proposed detection method has the potential application to realize real-time reflection detection and recognition of pointer meters for inspection robots in complex environments.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    在这些年里,更加贴近实际应用环境的异构无线传感器网络三维节点覆盖成为研究的重点。然而,传统的二维平面覆盖方法直接应用于三维空间,覆盖率低,和短生命周期。大多数方法在考虑覆盖时都忽略了网络生命周期。网络覆盖和生命周期决定了异构无线传感器网络中的服务质量(QoS)。因此,节能覆盖增强是一项非常关键和具有挑战性的任务。为了解决上述任务,一种节能的覆盖增强方法,VKECE-3D,提出了基于3D-Voronoi分割和K-means算法的算法。活动节点的数量保持在最低限度,同时保证覆盖。首先,基于随机的节点部署,使用高破坏性多项式变异策略对节点进行两次部署,以提高节点的均匀性。其次,利用K-means算法和3D-Voronoi分区计算最优感知半径,以提高网络覆盖质量。最后,提出了一种多跳通信和轮询工作机制,以降低节点的能耗并延长网络的寿命。其仿真结果表明,与其他节能覆盖增强解决方案相比,VKECE-3D提高了网络覆盖并大大延长了网络的寿命。
    During these years, the 3D node coverage of heterogeneous wireless sensor networks that are closer to the actual application environment has become a strong focus of research. However, the direct application of traditional two-dimensional planar coverage methods to three-dimensional space suffers from high application complexity, a low coverage rate, and a short life cycle. Most methods ignore the network life cycle when considering coverage. The network coverage and life cycle determine the quality of service (QoS) in heterogeneous wireless sensor networks. Thus, energy-efficient coverage enhancement is a significantly pivotal and challenging task. To solve the above task, an energy-efficient coverage enhancement method, VKECE-3D, based on 3D-Voronoi partitioning and the K-means algorithm is proposed. The quantity of active nodes is kept to a minimum while guaranteeing coverage. Firstly, based on node deployment at random, the nodes are deployed twice using a highly destructive polynomial mutation strategy to improve the uniformity of the nodes. Secondly, the optimal perceptual radius is calculated using the K-means algorithm and 3D-Voronoi partitioning to enhance the network coverage quality. Finally, a multi-hop communication and polling working mechanism are proposed to lower the nodes\' energy consumption and lengthen the network\'s lifetime. Its simulation findings demonstrate that compared to other energy-efficient coverage enhancement solutions, VKECE-3D improves network coverage and greatly lengthens the network\'s lifetime.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    早期准确诊断急性心肌梗死(AMI)可显著降低患者死亡率。发现多种miRNA在AMI患者中失调,但特定miRNA的上调或下调在早期可能并不明显,很难做到准确诊断。这里,提出了没有光谱串扰的DNA光子线(PW)将成为miRNA联合分析的极好模板的设计,我们报道了一个miRNA添加探针的构建,用于两种上调的miRNA(miR-133a和miR-208a)的加性分析,用于临床血清样本中AMI的早期诊断.构建了三染料非串扰DNAPW,形成两步荧光共振能量转移(FRET)级联系统,其中三个路径可以阻断FRET级联,用于两个miRNA的单独或相加分析。进一步利用K-Means聚类算法对miRNA添加探针的输出信号进行分类,在临床血清样品的训练(n=40)和验证(n=19)队列中实现早期AMI的100%准确诊断。
    Early and accurate diagnosis of acute myocardial infarction (AMI) can significantly reduce patient mortality. A variety of miRNAs are found to dysregulate in AMI patients, but the up- or down-regulation of a specific miRNA may not be evident in the early stage, making it difficult to achieve accurate diagnosis. Here, proposing the design that DNA photonic wire (PW) with no spectral crosstalk would make an excellent template for miRNA conjoint analysis, we report the construction of a miRNA addition probe for the additive analysis of two up-regulated miRNAs (miR-133a and miR-208a) for early diagnosis of AMI in clinical serum samples. A three-dye non-crosstalk DNA PW is built to form the two-step fluorescence resonance energy transfer (FRET) cascade system, in which three paths can blocking the FRET cascade for separate or additive analysis of the two miRNAs. K-Means clustering algorithm is further utilized to classify the output signals of the miRNA addition probe, achieving a 100% accurate diagnosis of early AMI in both the training (n = 40) and validation (n = 19) cohorts of clinical serum samples.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    了解降雨特征对城市雨水水质的影响对雨水管理具有重要意义。尽管已经进行了大量尝试来研究降雨与城市雨水质量之间的关系,开发的知识可能难以应用于商业雨水管理模型。提出了一种数据挖掘框架来研究降雨特征对雨水水质的影响。开发了一种基于降雨类型的校准方法,以提高水质模型的性能。具体来说,利用主成分分析和相关分析研究了降雨特征与雨水水质的关系。根据选定的降雨特征,使用K均值聚类方法对降雨事件进行分类。针对每种降雨类型独立校准了基于降雨类型(RTB)的模型,以获得雨水水质模型的最佳参数集。结果表明,之前的干旱天数,平均降雨强度,降雨持续时间是影响总悬浮固体事件平均浓度(EMC)的最关键降雨特征,总氮,和总磷,而总降雨量被发现的重要性微不足道。K-means方法有效地将降雨事件分为四种类型,可以代表研究区域的降雨特征。基于降雨类型的校准方法可以大大提高水质模型的准确性。与传统的连续仿真模型相比,在校准期间,RTB模型的相对误差降低了11.4%至16.4%。校准后的雨水水质参数可以传输到具有类似特征的相邻集水区。
    Understanding the impact of rainfall characteristics on urban stormwater quality is important for stormwater management. Even though significant attempts have been undertaken to study the relationship between rainfall and urban stormwater quality, the knowledge developed may be difficult to apply in commercial stormwater management models. A data mining framework was proposed to study the impacts of rainfall characteristics on stormwater quality. A rainfall type-based calibration approach was developed to improve water quality model performance. Specifically, the relationship between rainfall characteristics and stormwater quality was studied using principal component analysis and correlation analysis. Rainfall events were classified using a K-means clustering method based on the selected rainfall characteristics. A rainfall type-based (RTB) model was independently calibrated for each rainfall type to obtain optimal parameter sets of stormwater quality models. The results revealed that antecedent dry days, average rainfall intensity, and rainfall duration were the most critical rainfall characteristics affecting the event mean concentrations (EMCs) of total suspended solids, total nitrogen, and total phosphorus, while total rainfall was found to be of negligible importance. The K-means method effectively clustered the rainfall events into four types that could represent the rainfall characteristics in the study areas. The rainfall type-based calibration approach can considerably improve water quality model accuracy. Compared to the traditional continuous simulation model, the relative error of the RTB model was reduced by 11.4 % to 16.4 % over the calibration period. The calibrated stormwater quality parameters can be transferred to adjacent catchments with similar characteristics.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    由于大量的多径效应和非视距(NLOS)信号接收,在高度城市化的地区,GNSS定位解决方案的精度和可靠性会严重下降,对GNSS/INS组合导航性能产生负面影响。因此,提出了一种基于K-means聚类算法的车辆GNSS/INS组合定位的多径/NLOS检测方法。它综合考虑了从GNSS原始观测中得出的不同特征参数,比如卫星仰角,载波噪声比,伪距残差,和伪距率一致性,以有效地对GNSS信号进行分类。针对不同GNSS信号对定位结果的影响,利用K-means聚类算法将观测数据分为两大类:直接信号和间接信号(包括多径信号和NLOS信号)。然后,多径/NLOS信号与观测数据分离。最后,本文利用实测车辆GNSS/INS观测数据,包括离线数据集和在线数据集,验证基于双差伪距定位的信号分类的准确性。在典型的城市场景中进行的一系列实验表明,与传统的GNSS/INS组合导航相比,该方法可以显着提高定位精度。排除GNSS异常值后,离线数据集的定位精度在水平和垂直方向上分别提高了16%和85%,分别,在线数据集的定位精度在两个方向上分别提高了21%和41%。这种方法不依赖于外部地理信息数据和其他传感器,具有较好的实用性和环境适应性。
    Due to the massive multipath effects and non-line-of-sight (NLOS) signal receptions, the accuracy and reliability of GNSS positioning solution can be severely degraded in a highly urbanized area, which has a negative impact on the performance of GNSS/INS integrated navigation. Therefore, this paper proposes a multipath/NLOS detection method based on the K-means clustering algorithm for vehicle GNSS/INS integrated positioning. It comprehensively considers different feature parameters derived from GNSS raw observations, such as the satellite-elevation angle, carrier-to-noise ratio, pseudorange residual, and pseudorange rate consistency to effectively classify GNSS signals. In view of the influence of different GNSS signals on positioning results, the K-means clustering algorithm is exploited to divide the observation data into two main categories: direct signals and indirect signals (including multipath and NLOS signals). Then, the multipath/NLOS signal is separated from the observation data. Finally, this paper uses the measured vehicle GNSS/INS observation data, including offline dataset and online dataset, to verify the accuracy of signal classification based on double-differenced pseudorange positioning. A series of experiments conducted in typical urban scenarios demonstrate that the proposed method could ameliorate the positioning accuracy significantly compared with the conventional GNSS/INS integrated navigation. After excluding GNSS outliers, the positioning accuracy of the offline dataset is improved by 16% and 85% in the horizontal and vertical directions, respectively, and the positioning accuracy of the online dataset is improved by 21% and 41% in the two directions. This method does not rely on external geographic information data and other sensors, which has better practicability and environmental adaptability.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    自2019年以来,COVID-19在全球范围内蔓延,并对整个社会造成严重破坏。从宏观角度研究不同国家的一些社会相关指标的变化是有意义的。
    我们收集了9项社会相关指标和COVID安全性评估得分。使用三个时间序列模型进行数据分析。特别是,采用预测校正程序来探索大流行对发达国家和发展中国家指数的影响。
    这表明COVID-19的流行对居民的生活产生了多方面的影响,特别是在生活质量方面,购买力,和安全。聚类分析和双变量统计分析进一步表明,发达国家和发展中国家受大流行影响的指标不同。
    这场流行病在许多方面改变了居民的生活。我们进一步的研究表明,发达国家和发展中国家的社会相关指数的影响是不同的,这与他们的流行病严重程度和控制措施有关。另一方面,气候对控制COVID-19至关重要。因此,探索社会相关指标的变化具有重要意义,有利于为各国提供有针对性的治理战略。我们的文章将有助于不同发展水平的国家在努力控制这一流行病的同时,更加关注社会变化,并采取及时有效的措施来调整社会变化。
    The COVID-19 has been spreading globally since 2019 and causes serious damage to the whole society. A macro perspective study to explore the changes of some social-related indexes of different countries is meaningful.
    We collected nine social-related indexes and the score of COVID-safety-assessment. Data analysis is carried out using three time series models. In particular, a prediction-correction procedure was employed to explore the impact of the pandemic on the indexes of developed and developing countries.
    It shows that COVID-19 epidemic has an impact on the life of residents in various aspects, specifically in quality of life, purchasing power, and safety. Cluster analysis and bivariate statistical analysis further indicate that indexes affected by the pandemic in developed and developing countries are different.
    This pandemic has altered the lives of residents in many ways. Our further research shows that the impacts of social-related indexes in developed and developing countries are different, which is bounded up with their epidemic severity and control measures. On the other hand, the climate is crucial for the control of COVID-19. Consequently, exploring the changes of social-related indexes is significative, and it is conducive to provide targeted governance strategies for various countries. Our article will contribute to countries with different levels of development pay more attention to social changes and take timely and effective measures to adjust social changes while trying to control this pandemic.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号