
  • 文章类型: Journal Article
    This paper presents an innovative framework for the automated diagnosis of gastric cancer using artificial intelligence. The proposed approach utilizes a customized deep learning model called MobileNetV2, which is optimized using a Dynamic variant of the Pelican Optimization Algorithm (DPOA). By combining these advanced techniques, it is feasible to achieve highly accurate results when applied to a dataset of endoscopic gastric images. To evaluate the performance of the model based on the benchmark, its data is divided into training (80 %) and testing (20 %) sets. The MobileNetV2/DPOA model demonstrated an impressive accuracy of 97.73 %, precision of 97.88 %, specificity of 97.72 %, sensitivity of 96.35 %, Matthews Correlation Coefficient (MCC) of 96.58 %, and F1-score of 98.41 %. These results surpassed those obtained by other well-known models, such as Convolutional Neural Networks (CNN), Mask Region-Based Convolutional Neural Networks (Mask R-CNN), U-Net, Deep Stacked Sparse Autoencoder Neural Networks (SANNs), and DeepLab v3+, in terms of most quantitative metrics. Despite the promising outcomes, it is important to note that further research is needed. Specifically, larger and more diverse datasets as well as exhaustive clinical validation are necessary to validate the effectiveness of the proposed method. By implementing this innovative approach in the detection of gastric cancer, it is possible to enhance the speed and accuracy of diagnosis, leading to improved patient care and better allocation of healthcare resources.






  • 文章类型: Journal Article
    Cow diseases are a major source of concern for people. Some diseases in animals that are discovered in their early stages can be treated while they are still treatable. If lumpy skin disease (LSD) is not properly treated, it can result in significant financial losses for the farm animal industry. Animals like cows that sign this disease have their skin seriously affected. A reduction in milk production, reduced fertility, growth retardation, miscarriage, and occasionally death are all detrimental effects of this disease in cows. Over the past three months, LSD has affected thousands of cattle in nearly fifty districts across Bangladesh, causing cattle farmers to worry about their livelihood. Although the virus is very contagious, after receiving the right care for a few months, the affected cattle can be cured. The goal of this study was to use various deep learning and machine learning models to determine whether or not cows had lumpy disease. To accomplish this work, a Convolution neural network (CNN) based novel architecture is proposed for detecting the illness. The lumpy disease-affected area has been identified using image preprocessing and segmentation techniques. After the extraction of numerous features, our proposed model has been evaluated to classify LSD. Four CNN models, DenseNet, MobileNetV2, Xception, and InceptionResNetV2 were used to classify the framework, and evaluation metrics were computed to determine how well the classifiers worked. MobileNetV2 has been able to achieve 96% classification accuracy and an AUC score of 98% by comparing results with recently published relevant works, which seems both good and promising.






  • 文章类型: Journal Article
    Machine learning and computer vision have proven to be valuable tools for farmers to streamline their resource utilization to lead to more sustainable and efficient agricultural production. These techniques have been applied to strawberry cultivation in the past with limited success. To build on this past work, in this study, two separate sets of strawberry images, along with their associated diseases, were collected and subjected to resizing and augmentation. Subsequently, a combined dataset consisting of nine classes was utilized to fine-tune three distinct pretrained models: vision transformer (ViT), MobileNetV2, and ResNet18. To address the imbalanced class distribution in the dataset, each class was assigned weights to ensure nearly equal impact during the training process. To enhance the outcomes, new images were generated by removing backgrounds, reducing noise, and flipping them. The performances of ViT, MobileNetV2, and ResNet18 were compared after being selected. Customization specific to the task was applied to all three algorithms, and their performances were assessed. Throughout this experiment, none of the layers were frozen, ensuring all layers remained active during training. Attention heads were incorporated into the first five and last five layers of MobileNetV2 and ResNet18, while the architecture of ViT was modified. The results indicated accuracy factors of 98.4%, 98.1%, and 97.9% for ViT, MobileNetV2, and ResNet18, respectively. Despite the data being imbalanced, the precision, which indicates the proportion of correctly identified positive instances among all predicted positive instances, approached nearly 99% with the ViT. MobileNetV2 and ResNet18 demonstrated similar results. Overall, the analysis revealed that the vision transformer model exhibited superior performance in strawberry ripeness and disease classification. The inclusion of attention heads in the early layers of ResNet18 and MobileNet18, along with the inherent attention mechanism in ViT, improved the accuracy of image identification. These findings offer the potential for farmers to enhance strawberry cultivation through passive camera monitoring alone, promoting the health and well-being of the population.






  • 文章类型: Journal Article
    Early detection of plant leaf diseases accurately and promptly is very crucial for safeguarding agricultural crop productivity and ensuring food security. During their life cycle, plant leaves get diseased because of multiple factors like bacteria, fungi, weather conditions, etc. In this work, the authors propose a model that aids in the early detection of leaf diseases using a novel hierarchical residual vision transformer using improved Vision Transformer and ResNet9 models. The proposed model can extract more meaningful and discriminating details by reducing the number of trainable parameters with a smaller number of computations. The proposed method is evaluated on the Local Crop dataset, Plant Village dataset, and Extended Plant Village Dataset with 13, 38, and 51 different leaf disease classes. The proposed model is trained using the best trail parameters of Improved Vision Transformer and classified the features using ResNet 9. Performance evaluation is carried out on a wide aspects over the aforementioned datasets and results revealed that the proposed model outperforms other models such as InceptionV3, MobileNetV2, and ResNet50.






  • 文章类型: Journal Article
    OBJECTIVE: Uterine fibroids (UF) are the most frequent tumors in ladies and can pose an enormous threat to complications, such as miscarriage. The accuracy of prognosis may also be affected by way of doctor inexperience and fatigue, underscoring the want for automatic classification fashions that can analyze UF from a giant wide variety of images.
    METHODS: A hybrid model has been proposed that combines the MobileNetV2 community and deep convolutional generative adversarial networks (DCGAN) into useful resources for medical practitioners in figuring out UF and evaluating its characteristics. Real-time automated classification of UF can aid in diagnosing the circumstance and minimizing subjective errors. The DCGAN science is utilized for superior statistics augmentation to create first-rate UF images, which are labeled into UF and non-uterine-fibroid (NUF) classes. The MobileNetV2 model then precisely classifies the photos based totally on this data.
    RESULTS: The overall performance of the hybrid model contrasts with different models. The hybrid model achieves a real-time classification velocity of 40 frames per second (FPS), an accuracy of 97.45%, and an F1 rating of 0.9741.
    CONCLUSIONS: By using this deep learning hybrid approach, we address the shortcomings of the current classification methods of uterine fibroid.






  • 文章类型: Journal Article
    This paper proposes an improved strategy for the MobileNetV2 neural network(I-MobileNetV2) in response to problems such as large parameter quantities in existing deep convolutional neural networks and the shortcomings of the lightweight neural network MobileNetV2 such as easy loss of feature information, poor real-time performance, and low accuracy rate in facial emotion recognition tasks. The network inherits the characteristics of MobilenetV2 depthwise separated convolution, signifying a reduction in computational load while maintaining a lightweight profile. It utilizes a reverse fusion mechanism to retain negative features, which makes the information less likely to be lost. The SELU activation function is used to replace the RELU6 activation function to avoid gradient vanishing. Meanwhile, to improve the feature recognition capability, the channel attention mechanism (Squeeze-and-Excitation Networks (SE-Net)) is integrated into the MobilenetV2 network. Experiments conducted on the facial expression datasets FER2013 and CK + showed that the proposed network model achieved facial expression recognition accuracies of 68.62% and 95.96%, improving upon the MobileNetV2 model by 0.72% and 6.14% respectively, and the parameter count decreased by 83.8%. These results empirically verify the effectiveness of the improvements made to the network model.






  • 文章类型: Journal Article
    BACKGROUND: More and more genetic and metabolic abnormalities are now known to cause cancer, which is typically deadly. Any bodily part may become infected by cancerous cells, which can be fatal. Skin cancer is one of the most prevalent types of cancer, and its prevalence is rising across the globe. Squamous and basal cell carcinomas, as well as melanoma, which is clinically aggressive and causes the majority of deaths, are the primary subtypes of skin cancer. Screening for skin cancer is therefore essential.
    METHODS: The best way to quickly and precisely detect skin cancer is by using deep learning techniques. In this research deep learning techniques like MobileNetv2 and Dense net will be used for detecting or identifying two main kinds of tumors malignant and benign. For this research HAM10000 dataset is considered. This dataset consists of 10,000 skin lesion images and the disease comprises nonmelanocytic and melanocytic tumors. These two techniques can be used for detecting the malignant and benign. All these methods are compared and then a result can be inferred from their performance.
    RESULTS: After the model evaluation, the accuracy for the MobileNetV2 was 85% and customized CNN was 95%. A web application has been developed with the Python framework that provides a graphical user interface with the best-trained model. The graphical user interface allows the user to enter the patient details and upload the lesion image. The image will be classified with the appropriate trained model which can predict whether the uploaded image is cancerous or non-cancerous. This web application also displays the percentage of cancer affected.
    CONCLUSIONS: As per the comparisons between the two techniques customized CNN gives higher accuracy for the detection of melanoma.






  • 文章类型: Journal Article
    The ability to recognize the surface type is crucial for both indoor and outdoor mobile robots. Knowing the surface type can help indoor mobile robots move more safely and adjust their movement accordingly. However, recognizing surface characteristics is challenging since similar planes can appear substantially different; for instance, carpets come in various types and colors. To address this inherent uncertainty in vision-based surface classification, this study first generates a new, unique data set composed of 2,081 surface images (carpet, tiles, and wood) captured in different indoor environments. Secondly, the pre-trained state-of-the-art deep learning models, namely InceptionV3, VGG16, VGG19, ResNet50, Xception, InceptionResNetV2, and MobileNetV2, were utilized to recognize the surface type. Additionally, a lightweight MobileNetV2-modified model was proposed for surface classification. The proposed model has approximately four times fewer total parameters than the original MobileNetV2 model, reducing the size of the trained model weights from 42 MB to 11 MB. Thus, the proposed model can be used in robotic systems with limited computational capacity and embedded systems. Lastly, several optimizers, such as SGD, RMSProp, Adam, Adadelta, Adamax, Adagrad, and Nadam, are applied to distinguish the most efficient network. Experimental results demonstrate that the proposed model outperforms all other applied methods and existing approaches in the literature by achieving 99.52% accuracy and an average score of 99.66% in precision, recall, and F1-score. In addition to this, the proposed lightweight model was tested in real-time on a mobile robot in 11 scenarios consisting of various indoor environments such as offices, hallways, and homes, resulting in an accuracy of 99.25%. Finally, each model was evaluated in terms of model loading time and processing time. The proposed model requires less loading and processing time than the other models.






  • 文章类型: Journal Article
    This research paper presents an innovative approach to brain tumor diagnosis using MRI scans, using the power of deep learning and metaheuristic algorithm. The study employs Mobilenetv2, a deep learning model, optimized by a novel metaheuristic known as the Contracted Fox Optimization Algorithm (MN-V2/CFO). This methodology allows for the optimal selection of Mobilenetv2 hyperparameters, enhancing the accuracy of tumor detection. The model is implemented on the Figshare dataset, a comprehensive collection of MRI scans, and its performance is validated against other processes the results are compared with some published works including Network (RN), wavelet transform, and deep learning (WT/DL), customized VGG19, and Convolutional neural network (CNN). The results of the study, highlight the superior performance of the proposed MN-V2/CFO model compared to other tactics. The recommended strategy achieves a precision of 97.68 %, an F1-score of 86.22 %, a sensitivity of 80.12 %, and an accuracy of 97.32 %. The findings validate the potential of the proposed model in revolutionizing brain tumor diagnosis, contributing to better treatment strategies, and improving patient outcomes.






  • 文章类型: Journal Article
    Breast Cancer (BC) is one of the top reasons for fatality in women worldwide. As a result, timely identification is critical for successful therapy and excellent survival rates. Transfer Learning (TL) approaches have recently shown promise in aiding in the early recognition of BC. In this work, three TL models, MobileNetV2, ResNet50, and VGG16, were combined with LSTM to extract the features from Ultrasound Images (USIs). Furthermore, the Synthetic Minority Over-sampling Technique (SMOTE) with Tomek (SMOTETomek) was employed to balance the extracted features. The proposed method with VGG16 achieved an F1 score of 99.0 %, Matthews Correlation Coefficient (MCC) and Kappa Coefficient of 98.9 % with an Area Under Curve (AUC) of 1.0. The K-fold method was applied for cross-validation and achieved an average F1 score of 96 %. Moreover, the Gradient-weighted Class Activation Mapping (Grad-CAM) method was applied for visualization, and the Local Interpretable Model-agnostic Explanations (LIME) method was applied for interpretability. The Normal Approximation Interval (NAI) and bootstrapping methods were used to calculate Confidence Intervals (CIs). The proposed method achieved a Lower CI (LCI), Upper CI (UCI), and Mean CI (MCI) of 96.50 %, 99.75 %, and 98.13 %, respectively, with the NAI, while 95 % LCI of 93.81 %, an UCI of 96.00 %, and a bootstrap mean of 94.90 % with the bootstrap method. Furthermore, the performance of the six state-of-the-art (SOTA) TL models, such as Xception, NASNetMobile, InceptionResNetV2, MobileNetV2, ResNet50, and VGG16, were compared with the proposed method.





