6. Closing

In this final chapter, we summarize the main steps and concepts covered throughout the training. The course combined biophysical predictions, sequence analysis, structural context, and post-translational modification data to explore a biologically relevant protein in a reproducible and practical way.

The workflow presented in this training is intended as a template that can be adapted to other proteins and research questions. Participants are encouraged to reuse and extend the presented approaches in their own projects.

6.1 Lessons learnt

Throughout this training, we highlighted several practical and conceptual lessons, including:

How biophysical predictions complement both sequence and structural based analyses
The value of multiple sequence alignments for identifying conserved and variable regions
The importance of structural and biophysical context when interpreting PTMs
The benefits and limitations of prediction-based approaches

These lessons aim to support more informed and critical use of computational tools in biological research.

6.2 Take-home messages

The key messages of this training can be summarized as follows:

Biophysical profiles provide insight beyond primary sequence information
Integrating multiple data sources leads to more robust biological interpretation
Prediction tools are powerful but must be used with awareness of their assumptions
Reproducible workflows enable reuse and extension of analyses

These points are intended to guide future analyses and encourage thoughtful tool usage.

6.3 Discussion and open questions

This section is intended as a space for reflection, discussion, and feedback.

Questions for you

What parts of the course content were most useful or most challenging?
Which steps or concepts would benefit from additional explanation or examples?
How could this training be improved to better support your research needs?
Do you have feedback on the usability, clarity, or limitations of the tools presented in this course?

6.4 Files structure

During this training the following files have been created and used:

./training-data-directory
│─── P07949.fasta # Canonical protein sequence of P07949
│─── P07949.90-similar.fasta # 90%-similarity sequences of P07949
│─── P07949.90-similar.msa.fasta # Clustal-Omega MSA of 90%-similarity sequences
│─── P07949.90-similar.msa.kinase.fasta # Clustal-Omega MSA of Kinase domain
│─── P07949_s909p_y928p_y981p.fasta # Mutated canonical protein sequence with 3 mut.
│─── AF-P07949-F1-model_v6.cif # AlphaFold model for P07949 in mmCIF format (optional)
│─── AF-P07949-F1-model_v6.pdb # AlphaFold model for P07949 in PDB format
│─── 6nja.cif # Experimental structure for 6NJA in mmCIF format (optional)
│─── 6nja.pdb # Experimental structure for 6NJA in PDB format
│─── 2ivs.cif # Experimental structure for 2IVS in mmCIF format (optional)
│─── 2ivs.pdb # Experimental structure for 2IVS in PDB format
│─── fold_p07949_modifications_model_0.cif # AlphaFold server prediction for phospho
│─── fold_p07949_modifications_model_0.pdb # AlphaFold server prediction for phospho
│─── fold_p07949_s909p_y928p_y981p.cif # AlphaFold server prediction for mutations
└─── fold_p07949_s909p_y928p_y981p.pdb # AlphaFold server prediction for mutations

To follow up: : Go to next page

Follow up page explains how to apply the same workflow to proteins of your own interest.