Oversampling

过采样
  • 文章类型: Journal Article
    背景:支气管肺发育不良相关性肺动脉高压(BPD-PH)仍然是严重影响早产儿治疗结果的严重临床并发症。因此,早期预防和病理改变前的及时诊断是降低发病率和改善预后的关键。我们的主要目标是利用机器学习技术来建立预测模型,以准确识别患有PH风险的BPD婴儿。
    方法:本研究使用的数据来自中国四家三级医院的新生儿科。为了解决数据不平衡的问题,过采样算法采用合成少数过采样技术(SMOTE)对模型进行了改进。
    结果:在我们的研究中收集了761条临床记录。在数据预处理和特征选择之后,46个特征中有5个用于构建模型,包括有创呼吸支持的持续时间(天),BPD的严重程度,呼吸机相关性肺炎,肺出血,和早发性PH。四种机器学习模型被应用于预测学习,经过综合选择,最终选择了一个模型。该模型实现了93.8%的灵敏度,准确率85.0%,和0.933AUC。逻辑回归公式的得分大于0被识别为BPD-PH的警告信号。
    结论:我们综合比较了不同的机器学习模型,最终获得了良好的预后模型,足以支持儿科临床医生对BPD-PH患儿进行早期诊断和制定更好的治疗方案。
    BACKGROUND: Bronchopulmonary dysplasia-associated pulmonary hypertension (BPD-PH) remains a devastating clinical complication seriously affecting the therapeutic outcome of preterm infants. Hence, early prevention and timely diagnosis prior to pathological change is the key to reducing morbidity and improving prognosis. Our primary objective is to utilize machine learning techniques to build predictive models that could accurately identify BPD infants at risk of developing PH.
    METHODS: The data utilized in this study were collected from neonatology departments of four tertiary-level hospitals in China. To address the issue of imbalanced data, oversampling algorithms synthetic minority over-sampling technique (SMOTE) was applied to improve the model.
    RESULTS: Seven hundred sixty one clinical records were collected in our study. Following data pre-processing and feature selection, 5 of the 46 features were used to build models, including duration of invasive respiratory support (day), the severity of BPD, ventilator-associated pneumonia, pulmonary hemorrhage, and early-onset PH. Four machine learning models were applied to predictive learning, and after comprehensive selection a model was ultimately selected. The model achieved 93.8% sensitivity, 85.0% accuracy, and 0.933 AUC. A score of the logistic regression formula greater than 0 was identified as a warning sign of BPD-PH.
    CONCLUSIONS: We comprehensively compared different machine learning models and ultimately obtained a good prognosis model which was sufficient to support pediatric clinicians to make early diagnosis and formulate a better treatment plan for pediatric patients with BPD-PH.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:全基因组关联研究已成功鉴定出与人类疾病相关的遗传变异。最近已经提出了基于惩罚和机器学习方法的各种统计方法用于疾病预测。在这项研究中,我们使用韩国基因组和流行病学研究(KoGES)的韩国芯片(KORV1.1)评估了几种此类方法预测哮喘的性能.
    结果:首先,通过单变异检测,采用logistic回归分析并调整了几个流行病学因素,筛选出单核苷酸多态性.接下来,我们评估了以下疾病预测方法:里奇,最小绝对收缩和选择运算符,弹性网,平滑地削减绝对偏差,支持向量机,随机森林,升压,装袋,天真贝叶斯,和k最近的邻居。最后,我们根据接收器工作特性曲线的曲线下面积比较了它们的预测性能,精度,召回,F1分数,Cohen\'sKappa,平衡精度,错误率,马修斯相关系数,和精确召回率曲线下的面积。此外,三种过采样算法用于处理不平衡问题。
    结论:我们的结果表明,与通过机器学习方法相比,惩罚方法对哮喘表现出更好的预测性能。另一方面,在过抽样研究中,随机森林和增强方法总体上显示出比惩罚方法更好的预测性能。
    BACKGROUND: Genome-wide association studies have successfully identified genetic variants associated with human disease. Various statistical approaches based on penalized and machine learning methods have recently been proposed for disease prediction. In this study, we evaluated the performance of several such methods for predicting asthma using the Korean Chip (KORV1.1) from the Korean Genome and Epidemiology Study (KoGES).
    RESULTS: First, single-nucleotide polymorphisms were selected via single-variant tests using logistic regression with the adjustment of several epidemiological factors. Next, we evaluated the following methods for disease prediction: ridge, least absolute shrinkage and selection operator, elastic net, smoothly clipped absolute deviation, support vector machine, random forest, boosting, bagging, naïve Bayes, and k-nearest neighbor. Finally, we compared their predictive performance based on the area under the curve of the receiver operating characteristic curves, precision, recall, F1-score, Cohen\'s Kappa, balanced accuracy, error rate, Matthews correlation coefficient, and area under the precision-recall curve. Additionally, three oversampling algorithms are used to deal with imbalance problems.
    CONCLUSIONS: Our results show that penalized methods exhibit better predictive performance for asthma than that achieved via machine learning methods. On the other hand, in the oversampling study, randomforest and boosting methods overall showed better prediction performance than penalized methods.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Multicenter Study
    背景:机器学习为预测威胁生命提供了新的解决方案,不可预知的胺碘酮引起的甲状腺功能障碍。没有时间序列考虑特征的用于不利影响预测的传统回归方法产生了次优预测。在不同时间点具有多个数据集的机器学习算法可以在预测不利影响方面产生更好的性能。
    目的:我们旨在开发和验证用于预测个体化胺碘酮诱发的甲状腺功能障碍风险的机器学习模型,并通过重采样方法和重新调整临床得出的决策阈值来优化基于机器学习的风险分层方案。
    方法:这项研究使用多中心开发了机器学习模型,删除电子健康记录。包括2013年1月至2017年12月接受胺碘酮治疗的患者。训练集由台北医学大学医院和万芳医院的数据组成,而台北医科大学双河医院的数据被用作外部测试集。该研究首先收集了基线的固定特征和动态特征,第二,第三,第六,第九,12th,15th,18日,胺碘酮启动后21个月。我们使用了16个机器学习模型,包括极端梯度增强,自适应提升,k-最近邻,和逻辑回归模型,以及原始的重采样方法和其他3种重采样方法,包括用边界合成的少数过采样技术进行过采样,欠采样编辑的最近邻,以及过采样和欠采样混合方法。根据精度比较了模型性能;精度,召回,F1分数,几何平均值,接收器工作特性曲线(AUROC)的曲线下面积,以及精确召回率曲线下的面积(AUPRC)。特征重要性由最佳模型确定。重新调整决策阈值以确定最佳临界值,并进行Kaplan-Meier生存分析。
    结果:训练集包含台北医学大学医院和万方医院的4075名患者,其中583人(14.3%)出现胺碘酮诱发的甲状腺功能异常,而外部测试装置包括台北医学大学双河医院的2422名患者,其中275人(11.4%)发生胺碘酮诱导的甲状腺功能障碍。极端梯度提升过采样机器学习模型在所有16个模型中表现出最佳的预测结果。准确性;精度,召回,F1分数,G-mean,AUPRC,AUROC分别为0.923、0.632、0.756、0.688、0.845、0.751和0.934。在重新调整截止线后,最佳值为0.627,F1评分达到0.699。最佳阈值能够将2422名患者中的286名(11.8%)归类为高危受试者,其中275例为检测组中的真阳性患者.治疗时间较短;促甲状腺激素和高密度脂蛋白胆固醇水平较高;游离甲状腺素水平较低,碱性磷酸酶,低密度脂蛋白是最重要的特征。
    结论:机器学习模型结合重采样方法可以预测胺碘酮引起的甲状腺功能障碍,并可作为个体化风险预测和临床决策支持的支持工具。
    Machine learning offers new solutions for predicting life-threatening, unpredictable amiodarone-induced thyroid dysfunction. Traditional regression approaches for adverse-effect prediction without time-series consideration of features have yielded suboptimal predictions. Machine learning algorithms with multiple data sets at different time points may generate better performance in predicting adverse effects.
    We aimed to develop and validate machine learning models for forecasting individualized amiodarone-induced thyroid dysfunction risk and to optimize a machine learning-based risk stratification scheme with a resampling method and readjustment of the clinically derived decision thresholds.
    This study developed machine learning models using multicenter, delinked electronic health records. It included patients receiving amiodarone from January 2013 to December 2017. The training set was composed of data from Taipei Medical University Hospital and Wan Fang Hospital, while data from Taipei Medical University Shuang Ho Hospital were used as the external test set. The study collected stationary features at baseline and dynamic features at the first, second, third, sixth, ninth, 12th, 15th, 18th, and 21st months after amiodarone initiation. We used 16 machine learning models, including extreme gradient boosting, adaptive boosting, k-nearest neighbor, and logistic regression models, along with an original resampling method and 3 other resampling methods, including oversampling with the borderline-synthesized minority oversampling technique, undersampling-edited nearest neighbor, and over- and undersampling hybrid methods. The model performance was compared based on accuracy; Precision, recall, F1-score, geometric mean, area under the curve of the receiver operating characteristic curve (AUROC), and the area under the precision-recall curve (AUPRC). Feature importance was determined by the best model. The decision threshold was readjusted to identify the best cutoff value and a Kaplan-Meier survival analysis was performed.
    The training set contained 4075 patients from Taipei Medical University Hospital and Wan Fang Hospital, of whom 583 (14.3%) developed amiodarone-induced thyroid dysfunction, while the external test set included 2422 patients from Taipei Medical University Shuang Ho Hospital, of whom 275 (11.4%) developed amiodarone-induced thyroid dysfunction. The extreme gradient boosting oversampling machine learning model demonstrated the best predictive outcomes among all 16 models. The accuracy; Precision, recall, F1-score, G-mean, AUPRC, and AUROC were 0.923, 0.632, 0.756, 0.688, 0.845, 0.751, and 0.934, respectively. After readjusting the cutoff, the best value was 0.627, and the F1-score reached 0.699. The best threshold was able to classify 286 of 2422 patients (11.8%) as high-risk subjects, among which 275 were true-positive patients in the testing set. A shorter treatment duration; higher levels of thyroid-stimulating hormone and high-density lipoprotein cholesterol; and lower levels of free thyroxin, alkaline phosphatase, and low-density lipoprotein were the most important features.
    Machine learning models combined with resampling methods can predict amiodarone-induced thyroid dysfunction and serve as a support tool for individualized risk prediction and clinical decision support.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    从可穿戴设备收集的加速度计数据最近已用于监测日常生活中的身体活动(PA)。虽然可以用截止方法区分PA的强度,用相似的加速度测量模式区分不同的行为来估计能量消耗是很重要的。我们的目标是通过提取明确定义的特征并应用欠采样和过采样方法来克服对基于机器学习的PA分类产生负面影响的数据不平衡问题。我们提取了各种时间,光谱,和来自腕部的非线性特征-,hip-,和脚踝磨损的加速度计数据。然后,使用各种ML和DL方法比较了欠采样和过采样的影响。在各种ML和DL模型中,包括随机森林(RF)和自适应增强(AdaBoost)在内的集成方法在区分久坐行为(驾驶)和三种步行类型(在水平地面上行走,上升楼梯,和下降楼梯),即使在跨主题范式中也是如此。欠抽样方法,计算成本低,表现出与多数类无偏见的分类结果。此外,我们发现,RF可以通过检查多个决策树(DT)中每个节点的重要性,根据传感器位置自动选择PA分类的相关特征.这项研究提出,使用定义明确的特征集结合欠采样方法的集成学习对于PA分类中的不平衡数据集具有鲁棒性。这种方法将有助于在自由生活情况下进行PA分类,类之间的数据不平衡问题很常见。
    Accelerometer data collected from wearable devices have recently been used to monitor physical activities (PAs) in daily life. While the intensity of PAs can be distinguished with a cut-off approach, it is important to discriminate different behaviors with similar accelerometry patterns to estimate energy expenditure. We aim to overcome the data imbalance problem that negatively affects machine learning-based PA classification by extracting well-defined features and applying undersampling and oversampling methods. We extracted various temporal, spectral, and nonlinear features from wrist-, hip-, and ankle-worn accelerometer data. Then, the influences of undersampilng and oversampling were compared using various ML and DL approaches. Among various ML and DL models, ensemble methods including random forest (RF) and adaptive boosting (AdaBoost) exhibited great performance in differentiating sedentary behavior (driving) and three walking types (walking on level ground, ascending stairs, and descending stairs) even in a cross-subject paradigm. The undersampling approach, which has a low computational cost, exhibited classification results unbiased to the majority class. In addition, we found that RF could automatically select relevant features for PA classification depending on the sensor location by examining the importance of each node in multiple decision trees (DTs). This study proposes that ensemble learning using well-defined feature sets combined with the undersampling approach is robust for imbalanced datasets in PA classification. This approach will be useful for PA classification in the free-living situation, where data imbalance problems between classes are common.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    背景:作为主要的健康危害,冠心病的发病率逐年上升。虽然冠状动脉血运重建,主要是经皮冠状动脉介入治疗,在冠心病的治疗中发挥了重要作用,冠状动脉血运重建后的复发或持续性心绞痛等主要不良心血管事件(MACE)在临床实践中仍然是一个非常困难的问题.
    目的:鉴于冠状动脉血运重建后发生MACE的概率较高,本研究的目的是开发并验证基于机器学习算法的6个月内MACE发生的预测模型.
    方法:回顾性研究纳入2019年6月至2020年12月在辽宁省人民医院和辽宁中医药大学附属医院行冠状动脉血运重建的1004例患者。根据现有数据的特点,初始预处理采用过采样策略。然后我们使用了六种机器学习算法,包括决策树,随机森林,逻辑回归,天真贝叶斯,支持向量机,和极端梯度提升(XGBoost),根据临床信息和6个月随访信息开发MACE预测模型。在所有样本中,随机选择70%进行训练,其余30%用于模型验证。模型性能是根据准确性进行评估的,精度,召回,F1分数,混淆矩阵,接收器工作特征(ROC)曲线(AUC)下面积,和可视化的ROC曲线。
    结果:单变量分析显示,无MACE和有MACE的组之间有21个患者特征变量有统计学意义(P<0.05)。加上这些重要因素,在六种机器学习算法中,XGBoost的准确度为0.7788,精确度为0.8058,召回率为0.7345,F1评分为0.7685,AUC为0.8599。对模型的进一步探索以确定影响MACE发生的因素表明,在三个开发的模型中,抗凝药物的使用和疾病的病程始终排在前两个预测因素中。
    结论:本研究中构建的机器学习风险模型可以实现可接受的MACE预测性能,与XGBoost表现最好的,为MACE预防的针对性干预和临床决策提供有价值的参考。
    BACKGROUND: As a major health hazard, the incidence of coronary heart disease has been increasing year by year. Although coronary revascularization, mainly percutaneous coronary intervention, has played an important role in the treatment of coronary heart disease, major adverse cardiovascular events (MACE) such as recurrent or persistent angina pectoris after coronary revascularization remain a very difficult problem in clinical practice.
    OBJECTIVE: Given the high probability of MACE after coronary revascularization, the aim of this study was to develop and validate a predictive model for MACE occurrence within 6 months based on machine learning algorithms.
    METHODS: A retrospective study was performed including 1004 patients who had undergone coronary revascularization at The People\'s Hospital of Liaoning Province and Affiliated Hospital of Liaoning University of Traditional Chinese Medicine from June 2019 to December 2020. According to the characteristics of available data, an oversampling strategy was adopted for initial preprocessing. We then employed six machine learning algorithms, including decision tree, random forest, logistic regression, naïve Bayes, support vector machine, and extreme gradient boosting (XGBoost), to develop prediction models for MACE depending on clinical information and 6-month follow-up information. Among all samples, 70% were randomly selected for training and the remaining 30% were used for model validation. Model performance was assessed based on accuracy, precision, recall, F1-score, confusion matrix, area under the receiver operating characteristic (ROC) curve (AUC), and visualization of the ROC curve.
    RESULTS: Univariate analysis showed that 21 patient characteristic variables were statistically significant (P<.05) between the groups without and with MACE. Coupled with these significant factors, among the six machine learning algorithms, XGBoost stood out with an accuracy of 0.7788, precision of 0.8058, recall of 0.7345, F1-score of 0.7685, and AUC of 0.8599. Further exploration of the models to identify factors affecting the occurrence of MACE revealed that use of anticoagulant drugs and course of the disease consistently ranked in the top two predictive factors in three developed models.
    CONCLUSIONS: The machine learning risk models constructed in this study can achieve acceptable performance of MACE prediction, with XGBoost performing the best, providing a valuable reference for pointed intervention and clinical decision-making in MACE prevention.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    背景:COVID-19使全世界的卫生系统不堪重负。尽早发现重症病例很重要,这样就可以调动资源,提高治疗水平。
    目的:本研究旨在开发一种基于临床和影像学数据的自动评估COVID-19严重程度的机器学习方法。
    方法:临床数据-包括人口统计学,标志,症状,合并症,以及来自湖北省两家医院的346名患者的血液检查结果和胸部计算机断层扫描扫描,中国,用于开发机器学习模型,以自动评估确诊的COVID-19病例的严重程度。我们比较了来自多个机器学习模型的临床和成像数据的预测能力,并进一步探索了使用四种过采样方法来解决不平衡分类问题。使用Shapley加法解释框架确定具有最高预测能力的特征。
    结果:成像特征对模型输出的影响最大,而结合临床和影像学特征的总体表现最佳。确定的预测特征与以前报道的一致。尽管过采样产生的结果好坏参半,它在我们的研究中取得了最好的模型性能。区分轻度和重度病例的Logistic回归模型在临床特征方面取得了最佳表现(曲线下面积[AUC]0.848;敏感性0.455;特异性0.906),影像学特征(AUC0.926;灵敏度0.818;特异性0.901),以及临床和影像学特征的组合(AUC0.950;敏感性0.764;特异性0.919)。合成少数过采样方法使用组合特征(AUC0.960;灵敏度0.845;特异性0.929)进一步提高了模型的性能。
    结论:临床和影像学特征可用于COVID-19的自动严重程度评估,并有可能帮助对COVID-19患者进行分诊,并优先为那些严重疾病风险较高的患者提供护理。
    BACKGROUND: COVID-19 has overwhelmed health systems worldwide. It is important to identify severe cases as early as possible, such that resources can be mobilized and treatment can be escalated.
    OBJECTIVE: This study aims to develop a machine learning approach for automated severity assessment of COVID-19 based on clinical and imaging data.
    METHODS: Clinical data-including demographics, signs, symptoms, comorbidities, and blood test results-and chest computed tomography scans of 346 patients from 2 hospitals in the Hubei Province, China, were used to develop machine learning models for automated severity assessment in diagnosed COVID-19 cases. We compared the predictive power of the clinical and imaging data from multiple machine learning models and further explored the use of four oversampling methods to address the imbalanced classification issue. Features with the highest predictive power were identified using the Shapley Additive Explanations framework.
    RESULTS: Imaging features had the strongest impact on the model output, while a combination of clinical and imaging features yielded the best performance overall. The identified predictive features were consistent with those reported previously. Although oversampling yielded mixed results, it achieved the best model performance in our study. Logistic regression models differentiating between mild and severe cases achieved the best performance for clinical features (area under the curve [AUC] 0.848; sensitivity 0.455; specificity 0.906), imaging features (AUC 0.926; sensitivity 0.818; specificity 0.901), and a combination of clinical and imaging features (AUC 0.950; sensitivity 0.764; specificity 0.919). The synthetic minority oversampling method further improved the performance of the model using combined features (AUC 0.960; sensitivity 0.845; specificity 0.929).
    CONCLUSIONS: Clinical and imaging features can be used for automated severity assessment of COVID-19 and can potentially help triage patients with COVID-19 and prioritize care delivery to those at a higher risk of severe disease.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    Driving daily through traffic congestion has been recognised as a major cause of stress. High levels of stress while driving negatively impact the driver\'s decisions which could potentially lead to accidents and other long-term health hazards. Accordingly, there is a great need to determine stress levels for drivers based on measuring and predicting the major causes (features or classes) that increase stress levels. In this paper, the problem of predicting automobile drivers\' stress levels, as experienced during actual driving, is investigated through the application of five different data mining algorithms, namely K-Nearest Neighbour (KNN), Decision Tree (J48), Random Forest (RF), Support Vector Machine (SVM), and Artificial Neural Networks (ANN). An experiment was conducted on 14 drivers taking various routes in Amman - Jordan, with a wearable biomedical device attached to the driver to instantly collect physiological data. The collected data (dataset) is grouped into two different categories, namely \'Yes\' to signify the presence of stress and \'No\' to signify the absence of stress. In order to efficiently apply data mining algorithms to the data set, oversampling was used to avoid the negative effect of driver samples with a lesser class on the prediction of stress. The findings are evaluated in relation to stress prediction and accordingly contrasted alongside standard reference approaches that do not consider oversampling and/or feature selection using the Friedman rank test. The proposed approach, in combination with RF, was seen to surpass any others in terms of accuracy, AUC, specificity, and sensitivity. The accuracy, AUC, specificity, and sensitivity rates produced by RF utilising our proposed approach were 98.92%, 99.91%, 98.46%, and 99.36%, respectively.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

  • 文章类型: Journal Article
    Bar pattern phantoms are used to determine the maximum number of line-pairs per mm that an imaging system can resolve. In some cases, a numerical determination of the modulation transfer function (MTF) can also be carried out. However, calculations can only be performed in a relatively small number of frequencies because of the small number of bar groups in the phantom. In this work, a new bar pattern phantom has been simulated. This phantom consists of 66 pairs of lines of different periods and these periods vary exponentially with spatial position, like in a chirp wave. An oversampling procedure has been implemented to obtain the pre-sampled MTF of the system and the results obtained have been compared with those obtained with the edge method, recommended by the IEC. Monte Carlo simulations were carried out for three different levels of noise aimed at investigating the effect of noise on the uncertainties of the MTF determination. In addition, using the analytic expressions for the MTF calculation, statistical fluctuations of noise in phantom images were propagated to MTF values. Despite the smaller size of the chirp phantom, uncertainties in the chirp method are smaller than those of the edge method. For the edge image, the standard deviation of the MTF is proportional to the frequency f, whereas for the chirp method it is proportional to its square root. It is shown that applying an oversampling method allows the use of a single line pair per period without compromising the precision in noisy environments.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

  • 文章类型: Journal Article
    A clustering problem involving multivariate time series (MTS) requires the selection of similarity metrics. This paper shows the limitations of the PCA similarity factor (SPCA) as a single metric in nonlinear problems where there are differences in magnitude of the same process variables due to expected changes in operation conditions. A novel method for clustering MTS based on a combination between SPCA and the average-based Euclidean distance (AED) within a fuzzy clustering approach is proposed. Case studies involving either simulated or real industrial data collected from a large scale gas turbine are used to illustrate that the hybrid approach enhances the ability to recognize normal and fault operating patterns. This paper also proposes an oversampling procedure to create synthetic multivariate time series that can be useful in commonly occurring situations involving unbalanced data sets.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

  • 文章类型: Journal Article
    OBJECTIVE: Epidemiological research on lesbian, gay and bisexual populations raises concerns regarding self-selection and group sizes. The aim of this research was to present strategies used to overcome these challenges in a national population-based web survey of self-reported sexual orientation and living conditions-exemplified with a case of daily tobacco smoking.
    METHODS: The sample was extracted from pre-established national web panels. Utilizing an oversampling strategy, we established a sample including 315 gay men, 217 bisexual men, 789 heterosexual men, 197 lesbian women, 405 bisexual women and 979 heterosexual women. We compared daily smoking, representing three levels of differentiation of sexual orientation for each gender.
    RESULTS: The aggregation of all non-heterosexuals into one group yielded a higher odds ratio (OR) for non-heterosexuals being a daily smoker. The aggregation of lesbian and bisexual women indicated higher OR between this group and heterosexual women. The full differentiation yielded no differences between groups except for bisexual compared with heterosexual women.
    CONCLUSIONS: The analyses demonstrated the advantage of differentiation of sexual orientation and gender, in this case bisexual women were the main source of group differences. We recommend an oversampling procedure, making it possible to avoid self-recruitment and to increase the transferability of findings.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

公众号