Skip to content
2000
Volume 14, Issue 2
  • ISSN: 1574-8936
  • E-ISSN: 2212-392X

Abstract

Background: Accurate and exhaustive identification of genomic deletion events is the basis for understanding their roles in phenotype variation. Developing effective algorithms to identify deletions using next generation sequencing (NGS) data remains a challenge. Objective: The accurate and exhaustive identification of genomic deletion events is important; we present a new approach, Defind, to detect deletions using NGS data from a single sample mapped to the reference genome sequences. Method: The operating system(s) is Linux. Programming languages are Perl and R. We present Defind, a new approach for detecting medium- and large-sized deletions, based on inspecting the depth of coverage, GC content, mapping quality, and paired-end information of NGS data, simultaneously. We carried out detailed comparisons between Defind and other deletion detection methods using both simulation data and real data. Results: In simulation studies, Defind could retrieve more deletions than other methods at low to medium sequencing coverage (e.g., 5 to 10 with no false positives. Using real data, 94% of deletions commonly detected by at least two other methods were also detected by Defind. In addition, 90% of the deletions detected by Defind using the real data were positively supported by comparative genomic hybridization results, demonstrating the efficiency of Defind. Conclusion: Defind performed robustly at different sequence coverage with different read length in the simulation study. Our studies also provided a significant practical guidance to select appropriate methods to detect genomic deletions using NGS data.

Loading

Article metrics loading...

/content/journals/cbio/10.2174/1574893613666180703110126
2019-02-01
2025-07-04
Loading full text...

Full text loading...

/content/journals/cbio/10.2174/1574893613666180703110126
Loading

  • Article Type:
    Research Article
Keyword(s): algorithms; Defind; genomic deletions; hybridization; NGS data; phenotype
This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error
Please enter a valid_number test