From next-generation sequencing alignments to accurate comparison and validation of single-nucleotide variants: the pibase software.

Forster, M., Forster, P., Elsharawy, A., Hemmrich, G., Kreck, B., Wittig, M., Thomsen, I., Stade, B., Barann, M., Ellinghaus, D., Petersen, B. S., May, S., Melum, E., Schilhabel, M. B., Keller, A., Schreiber, Stefan, Rosenstiel, Philip and Franke, A. (2013) From next-generation sequencing alignments to accurate comparison and validation of single-nucleotide variants: the pibase software. Nucleic Acids Research, 41 (1). DOI 10.1093/nar/gks836.

Full text not available from this repository.

Supplementary data:

Abstract

Scientists working with single-nucleotide variants (SNVs), inferred by next-generation sequencing software, often need further information regarding true variants, artifacts and sequence coverage gaps. In clinical diagnostics, e.g. SNVs must usually be validated by visual inspection or several independent SNV-callers. We here demonstrate that 0.5–60% of relevant SNVs might not be detected due to coverage gaps, or might be misidentified. Even low error rates can overwhelm the true biological signal, especially in clinical diagnostics, in research comparing healthy with affected cells, in archaeogenetic dating or in forensics. For these reasons, we have developed a package called pibase, which is applicable to diploid and haploid genome, exome or targeted enrichment data. pibase extracts details on nucleotides from alignment files at user-specified coordinates and identifies reproducible genotypes, if present. In test cases pibase identifies genotypes at 99.98% specificity, 10-fold better than other tools. pibase also provides pair-wise comparisons between healthy and affected cells using nucleotide signals (10-fold more accurately than a genotype-based approach, as we show in our case study of monozygotic twins). This comparison tool also solves the problem of detecting allelic imbalance within heterozygous SNVs in copy number variation loci, or in heterogeneous tumor sequences.

Document Type: Article
Additional Information: Times Cited: 6 Forster, Michael Forster, Peter Elsharawy, Abdou Hemmrich, Georg Kreck, Benjamin Wittig, Michael Thomsen, Ingo Stade, Bjoern Barann, Matthias Ellinghaus, David Petersen, Britt-Sabina May, Sandra Melum, Espen Schilhabel, Markus B. Keller, Andreas Schreiber, Stefan Rosenstiel, Philip Franke, Andre
Keywords: Massively Parallel (Deep) Sequencing, Genomics
Research affiliation: Kiel University
OceanRep > The Future Ocean - Cluster of Excellence
Refereed: Yes
Open Access Journal?: No
Publisher: Oxford University Press
Projects: Future Ocean
Date Deposited: 08 Jul 2014 09:18
Last Modified: 23 Sep 2019 21:10
URI: https://oceanrep.geomar.de/id/eprint/24922

Actions (login required)

View Item View Item