GA-Novo: De Novo Peptide Sequencing via Tandem Mass Spectrometry using Genetic Algorithm

02/03/2019
by   Samaneh Azari, et al.
0

Proteomics is the large-scale analysis of the proteins. The common method for identifying proteins and characterising their amino acid sequences is to digest the proteins into peptides, analyse the peptides using mass spectrometry and assign the resulting tandem mass spectra (MS/MS) to peptides using database search tools. However, database search algorithms are highly dependent on a reference protein database and they cannot identify peptides and proteins not included in the database. Therefore, de novo sequencing algorithms are developed to overcome the problem by directly reconstructing the peptide sequence of an MS/MS spectrum without using any protein database. Current de novo sequencing algorithms often fail to construct the completely matched sequences, and produce partial matches. In this study, we propose a genetic algorithm based method, GA-Novo, to solve the complex optimisation task of de novo peptide sequencing, aiming at constructing full length sequences. Given an MS/MS spectrum, GA-Novo optimises the amino acid sequences to best fit the input spectrum. On the testing dataset, GA-Novo outperforms PEAKS, the most commonly used software for this task, by constructing 8 matched peptide sequences, and 4

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/23/2022

DPST: De Novo Peptide Sequencing with Amino-Acid-Aware Transformers

De novo peptide sequencing aims to recover amino acid sequences of a pep...
research
10/15/2021

A novel framework to quantify uncertainty in peptide-tandem mass spectrum matches with application to nanobody peptide identification

Nanobodies are small antibody fragments derived from camelids that selec...
research
05/17/2020

Deuteros 2.0: Peptide-level significance testing of data from hydrogen deuterium exchange mass spectrometry

Summary: Hydrogen deuterium exchange mass spectrometry (HDX-MS) is becom...
research
01/26/2023

Efficiently predicting high resolution mass spectra with graph neural networks

Identifying a small molecule from its mass spectrum is the primary open ...
research
09/04/2019

Gradients of Generative Models for Improved Discriminative Analysis of Tandem Mass Spectra

Tandem mass spectrometry (MS/MS) is a high-throughput technology used to...

Please sign up or login with your details

Forgot password? Click here to reset