Approaches to Protein Structure Prediction

Protein structure prediction software programs seek to solve one of the more essential bioinformatics quandaries: how can we determine the three-dimensional structure of a protein from its amino acid sequence? Important for both medicine and biotechnology, the task of protein structure prediction remains a complex question. All protein structure prediction algorithms utilize one or more of the following three approaches:

Homology Modeling

The underlying assumption in the homology modeling approach is that proteins with similar amino acid sequences will share similar structures. The homology modeling approach maps the amino acid sequence from the protein you would like to predict (the target) onto the experimental structure of a closely homologous protein (the template). The candidate templates are identified by sequence alignment before the query sequence is then mapped onto the template scaffold. The homology method relies heavily on high sequence identity between the target and the template for accurate models. Inaccuracies in homology models can arise from errors in sequence alignments or splice variants in protein sequences.

Threading (Fold Recognition)

The protein threading method, also known as fold recognition, differs from homology modeling in that it can identify sequences with similar protein folds that exhibit less sequence similarity. In the case of threading, candidate templates are identified by profile alignment methods that consider both sequence and structural similarity. Some of the factors determining structural similarity include predicted secondary structure and predicted solvent accessibility. After the candidate templates are identified, the query sequence is mapped onto the template scaffold. The threading method will find both homologous structures as well as structurally similar ones to create the final prediction models.

Ab initio (Template Free) Protein Modeling

This type of modeling works to build proteins from scratch, rather than using solved structures as in the homology or threading methods. It relies on biophysical protein principles to create protein models, and requires immense computational resources. Originally known as ab initio prediction, the terminology for this type of modeling has been drifting recently. Rather than referring to physics-based methods (e.g. modeling the folding process) this method is now often called “template-free.”

DNASTAR Protein Structure Prediction Algorithms Use a Hybrid Approach

NovaFold uses a combination of both the threading and ab initio methods to perform protein structure predictions. In NovaFold, the threading method finds structure fragments from multiple templates and the ab initio method builds structure for regions not matching the template. By combining these two methods with the power of Amazon Web Services, scientists are able to obtain highly accurate structures using a robust protein structure prediction workflow in the most efficient and cost-effective manner possible.

NovaFold AI uses an artificial-intelligence based hybrid method that is radically different from I-TASSER. NovaFold AI incorporates the award-winning AlphaFold 2 AI system developed by DeepMind. AlphaFold 2, which uses threading and other methods, was the top-ranked protein structure prediction method in the CASP14 challenge in 2020, significantly outperforming the other participating teams by determining the structure of proteins with accuracy comparable to laboratory experiments.

Michael Zuber

· Reply

May 20, 2024 at 1:54 PM

I have been working with a relatively small vertebrate gene that I hypothesize evolved from a much larger invertebrate gene. It appears that a larger (invertebrate) gene went through a fission event to generate my gene and another gene during evolution. My experiments suggest the corresponding vertebrate proteins still function together in a complex. I wonder if Novafold AI and AlphaFold-Multimer could be used to test my hypothesis. What do you think of this idea? To test my hypothesis, I would use Novafold AI to fold the single two-domain invertebrate protein and the two independent vertebrate proteins. Then use Alphafold-Multimer to identify how the two independent vertebrate proteins likely interact. If my hypothesis is true, the interaction domains of the two independent vertebrate proteins should match how the two-domain invertebrate protein folds. I am very excited about the possibility of this working and would love to give it a try.

Sharon Yildiz
· Reply

May 20, 2024 at 4:58 PM

Thanks for writing, Michael. I’ve forwarded your inquiry to Steve over email.

Approaches to Protein Structure Prediction

Approaches to Protein Structure Prediction

Homology Modeling

Threading (Fold Recognition)

Ab initio (Template Free) Protein Modeling

DNASTAR Protein Structure Prediction Algorithms Use a Hybrid Approach

2 Comments

Leave your reply.

Leave a Reply

Your email is safe with us.

Search Blog Posts

CATEGORIES

Recent Posts

Archives

Find us on

Most Commented Posts

Approaches to Protein Structure Prediction

Approaches to Protein Structure Prediction

Homology Modeling

Threading (Fold Recognition)

Ab initio (Template Free) Protein Modeling

DNASTAR Protein Structure Prediction Algorithms Use a Hybrid Approach

You also might be interested in

Phased Variant (Haplotype) Analysis for Whole Genome Sequencing

DNASTAR Releases Lasergene 17.2 Software

DNASTAR Lasergene Software Now Available on the Amazon Cloud

2 Comments

Leave your reply.

Leave a Reply

Your email is safe with us.

Search Blog Posts

CATEGORIES

Recent Posts

Archives

Find us on

Most Commented Posts