Concerted action of the new Genomic Peptide Finder and AUGUSTUS allows for automated proteogenomic annotation of the Chlamydomonas reinhardtii genome

Specht, Michael, Stanke, Mario, Terashima, Mia, Naumann-Busch, Bianca, Janßen, Ingrid, Höhner, Ricarda, Hom, Erik F. Y., Liang, Chun and Hippler, Michael (2011) Concerted action of the new Genomic Peptide Finder and AUGUSTUS allows for automated proteogenomic annotation of the Chlamydomonas reinhardtii genome PROTEOMICS, 11 (9). pp. 1814-1823. DOI 10.1002/pmic.201000621.

Full text not available from this repository.

Supplementary data:

Abstract

he use and development of post-genomic tools naturally depends on large-scale genome sequencing projects. The usefulness of post-genomic applications is dependent on the accuracy of genome annotations, for which the correct identification of intron-exon borders in complex genomes of eukaryotic organisms is often an error-prone task. Although automated algorithms for predicting intron-exon structures are available, supporting exon evidence is necessary to achieve comprehensive genome annotation. Besides cDNA and EST support, peptides identified via MS/MS can be used as extrinsic evidence in a proteogenomic approach. We describe an improved version of the Genomic Peptide Finder (GPF), which aligns de novo predicted amino acid sequences to the genomic DNA sequence of an organism while correcting for peptide sequencing errors and accounting for the possibility of splicing. We have coupled GPF and the gene finding program AUGUSTUS in a way that provides automatic structural annotations of the Chlamydomonas reinhardtii genome, using highly unbiased GPF evidence. A comparison of the AUGUSTUS gene set incorporating GPF evidence to the standard JGI FM4 (Filtered Models 4) gene set reveals 932 GPF peptides that are not contained in the Filtered Models 4 gene set. Furthermore, the GPF evidence improved the AUGUSTUS gene models by altering 65 gene models and adding three previously unidentified genes.

Document Type: Article
Keywords: Genome annotation; Mass spectrometry; Plant proteomics; Proteogenomics; MASS-SPECTROMETRY; HUMAN GENES; TANDEM; IDENTIFICATION; COMPETITION; DATABASES
Refereed: Yes
DOI etc.: 10.1002/pmic.201000621
ISSN: 1615-9853
Projects: BIOACID
Date Deposited: 13 Nov 2012 09:55
Last Modified: 13 Nov 2012 09:55
URI: http://oceanrep.geomar.de/id/eprint/19186

Actions (login required)

View Item View Item