An Empirical Evaluation of Error Correction Methods and Tools for Next Generation Sequencing Data

Abstract

Next Generation Sequencing (NGS) technologies produce massive amount of low cost data that is very much useful in genomic study and research. However, data produced by NGS is affected by different errors such as substitutions, deletions or insertion. It is essential to differentiate between true biological variants and alterations occurred due to errors for accurate downstream analysis. Many types of methods and tools have been developed for NGS error correction. Some of these methods only correct substitutions errors whereas others correct multi types of data errors. In this article, a comprehensive evaluation of three types of methods (k-spectrum based, Multi- sequencing alignment and Hybrid based) is presented which are implemented and adopted by different tools. Experiments have been conducted to compare the performance based on runtime and error correction rate. Two different computing platforms have been used for the experiments to evaluate effectiveness of runtime and error correction rate. The mission and aim of this comparative evaluation is to provide recommendations for selection of suitable tools to cope with the specific needs of users and practitioners. It has been noticed that k-mer spectrum based methodology generated superior results as compared to other methods. Amongst all the tools being utilized, Racer has shown eminent performance in terms of error correction rate and execution time for both small as well as large data sets. In multisequence alignment based tools, Karect depicts excellent error correction rate whereas Coral shows better execution time for all data sets. In hybrid based tools, Jabba shows better error correction rate and execution time as compared to brownie. Computing platforms mostly affect execution time but have no general effect on error correction rate.

Authors and Affiliations

Atif Mehmood, Javed Ferzund, Muhammad Usman Ali, Abbas Rehman, Shahzad Ahmed, Imran Ahmad

Keywords

Related Articles

Performance Analysis & Comparison of Optimal Economic Load Dispatch using Soft Computing Techniques

Power plants not situated at similar space from center of loads and their fuel prices are dissimilar. In this paper, ELD of actual power generation measured.ELD is preparation of generators to reduce total functioning pr...

Towards Development of Real-Time Handwritten Urdu Character to Speech Conversion System for Visually Impaired

Text to Speech (TTS) Conversion Systems have been an area of research for decades and have been developed for both handwritten and typed text in various languages. Existing research shows that it has been a challenging t...

Rule Based Artificial Intelligent System of Cucumber Greenhouse Environment Control with IoT Technology

The method proposed here allows control cucumber greenhouse environment based on IoT technology. IoT sensors are to measure the room and air temperature, relative humidity, CO2 content, water supply, liquid fertilizer, w...

Intrusion-Miner: A Hybrid Classifier for Intrusion Detection using Data Mining

With the rapid growth and usage of internet, number of network attacks have increase dramatically within the past few years. The problem facing in nowadays is to observe these attacks efficiently for security concerns be...

 2D Satellite Image Registration Using Transform Based and Correlation Based Methods

 Image registration is the process of geometrically aligning one image to another image of the same scene taken from different viewpoints or by different sensors. It is a fundamental image processing technique and i...

Download PDF file
  • EP ID EP261664
  • DOI 10.14569/IJACSA.2018.090158
  • Views 58
  • Downloads 0

How To Cite

Atif Mehmood, Javed Ferzund, Muhammad Usman Ali, Abbas Rehman, Shahzad Ahmed, Imran Ahmad (2018). An Empirical Evaluation of Error Correction Methods and Tools for Next Generation Sequencing Data. International Journal of Advanced Computer Science & Applications, 9(1), 425-431. https://www.europub.co.uk/articles/-A-261664