Weighted graphlets and deep neural networks for protein structure classification

10/07/2019
by   Hongyu Guo, et al.
11

As proteins with similar structures often have similar functions, analysis of protein structures can help predict protein functions and is thus important. We consider the problem of protein structure classification, which computationally classifies the structures of proteins into pre-defined groups. We develop a weighted network that depicts the protein structures, and more importantly, we propose the first graphlet-based measure that applies to weighted networks. Further, we develop a deep neural network (DNN) composed of both convolutional and recurrent layers to use this measure for classification. Put together, our approach shows dramatic improvements in performance over existing graphlet-based approaches on 36 real datasets. Even comparing with the state-of-the-art approach, it almost halves the classification error. In addition to protein structure networks, our weighted-graphlet measure and DNN classifier can potentially be applied to classification of other weighted networks in computational biology as well as in other domains.

READ FULL TEXT
research
04/12/2018

Network-based protein structural classification

Experimental determination of protein function is resource-consuming. As...
research
04/18/2021

Functional Protein Structure Annotation Using a Deep Convolutional Generative Adversarial Network

Identifying novel functional protein structures is at the heart of molec...
research
06/22/2021

G-VAE, a Geometric Convolutional VAE for ProteinStructure Generation

Analyzing the structure of proteins is a key part of understanding their...
research
02/16/2018

A Centrality Measure for Cycles and Subgraphs II

In a recent work we introduced a measure of importance for groups of ver...
research
09/09/2021

Protein Folding Neural Networks Are Not Robust

Deep neural networks such as AlphaFold and RoseTTAFold predict remarkabl...
research
08/05/2020

Protein Conformational States: A First Principles Bayesian Method

Automated identification of protein conformational states from simulatio...

Please sign up or login with your details

Forgot password? Click here to reset