Cluster Analysis of COVID-19 Through Genome Sequences Using Python Bioinformatics Library

Abstract

Introduction and Importance of Study: During the COVID-19 pandemic, mortality rates varied across different regions of the world. To better understand the virus's behavior, it's important to gain in-depth knowledge of the nucleotide records of the COVID-19 genomic sequence. Novelty Statement: In the study, researchers analyzed clusters of highly affected countries to find similar codons in countries with a similar effect of the virus. Material And Method: Nucleotide records are extracted from the NCBI database in FASTA format. Further Python Bioinformatics library is used to form the clusters of each country using the K-means clustering technique. Result and Discussion: The study focuses on finding the similarities between the codons of amino acids in different countries that are affected in a similar way during COVID-19. For instance, China and the EU have a lower mortality rate and have Leucine, Methionine, Isoleucine, and Valine amino acids in common. On the other hand, countries like Pakistan and India have Leucine, Isoleucine, Valine, and Threonine in common and an average death rate. Moreover, Brazil and the US have a higher mortality rate and share similar codons such as Leucine, Glutamine, and Amber. Concluding Remarks: The study shows that countries affected by COVID-19 in a similar way share some common amino acids and their respective codons.

Authors and Affiliations

Maryam Ghauri, Naeem Ahmed Mahoto, Sania Bhatti, Aqsa Umar

Keywords

Related Articles

Hybrid Technique forEstimating Fetal Head Circumference Using Ultrasound Imaging

Introduction: Analysis of fetal head shape is crucial for assessing head growth and detecting abnormalities in fetuses. In traditional clinical practice, Head Circumference (HC) is determined by manually fitting an ell...

AI-Driven Weed Classification for Improved Cotton Farming in Sindh, Pakistan

This research study proclaims the combination of artificial intelligence and also IoT in precision agriculture, highlighting weed discovery plus cotton plant monitoring in Sindh, Pakistan. The uniqueness lies in creati...

A ReviewBased on Active Research Areas in Mining Software Bug Repositories: Limitations and Possible Future Trends

Introduction/ Importance of Study: Bug repository mining is a crucial research area in software engineering, analyzing software change trends, defect prediction, and evolution. It involves developing methods and tools...

Machine Learning-Based Asthma Diagnosis Prediction Using Lung Function and Demographic Features

Asthma is a prevalent chronic respiratory disease, which poses significant diagnostic challenges because of its multifactorial nature. This study aims to develop a machine-learning approach for predicting asthma diagno...

Leveraging Cryptographic Primitives of Blockchain for Trust in Smart Systems

Calculating and maintaining trust using Hyperledger Fabric in smart systems plays a vital role in mitigating various trust-related attacks. Current smart systems encounter several challenges, including dependence on ce...

Download PDF file
  • EP ID EP760307
  • DOI -
  • Views 23
  • Downloads 0

How To Cite

Maryam Ghauri, Naeem Ahmed Mahoto, Sania Bhatti, Aqsa Umar (2024). Cluster Analysis of COVID-19 Through Genome Sequences Using Python Bioinformatics Library. International Journal of Innovations in Science and Technology, 6(1), -. https://www.europub.co.uk/articles/-A-760307