Skip to content

4. Working with post-translational modifications

4.1 Why Scop3P as the go-to for modifications ?

Protein phosphorylation is a key post-translational modification involved in many biological processes and is strongly associated with human diseases such as cancer and metabolic disorders.

Accurate identification, annotation, and functional analysis of phosphorylation sites (phosphosites) are therefore essential to understand their biological roles. Phosphosites are mainly studied using phosphoproteomics approaches, which have produced large amounts of publicly available data. Several resources have been developed to collect and organize phosphosite information, but most of them focus primarily on protein sequences and basic site-level metadata.

Other resources often contain false positives and doesn’t include the experimental metadata (localization probability, FDR, database used for searching)

Mass-spec picture goes here
Mass-spec figure 1
Mass-spec picture goes here
Mass-spec figure 2
Mass-spec picture goes here
Mass-spec figure 3

What is often missing from these resources is biological and structural context, including mapping to protein structures, experimental provenance, and biophysical annotations.

Image on elixir landing page
Protein coverage

ScoP3P addresses this gap by integrating protein sequences (UniProtKB/Swiss-Prot), structural data (PDB), and uniformly reprocessed phosphoproteomics datasets (PRIDE) to annotate all known human phosphosites. In addition, each phosphoprotein is enriched with per-residue biophysical annotations, including structural propensity, solvent accessibility, disorder probability, and early folding regions.

ScoP3P, available at https://iomics.ugent.be/scop3p, provides a dedicated platform for the visualization and analysis of phosphosites and supports the study of phosphosite–structure–function relationships.

Chapter questions

  1. How should Scop3P information be interpreted for a given protein target?
  2. How can PTMs be linked to structural data, such as predicted models or experimental structures from the PDB?

4.2 Exploring experimentally supported PTMs

4.2.1 Via the webserver

Hands-on: Website usage

  1. Explore the PTM map over the sequence length using lollypop plot.
  2. Link the lollypop information to the structural representation.
    1. Toggle between experimental and predicted structures.
    2. Change visualisation properties such as style and colour.
  3. Find the modification metadata from the summary table.
  4. Explore the biophysical circular plot.
    1. Mouse over to view the values.
  5. From the peptide table, explore the phospho-sites identified by mass spectrometry experiments (project information, species, tissue/cell type).
    1. Hint: you can also visualise the original spectra using the USI link.
  6. From the structures table, explore the modifications related to experimental structures.
    1. Hint: you can explore
      • The secondary-structure information (conservation, surface accessibility)
      • Do you see multiple conformational state for the same protein ?
  7. From the mutations table, explore the natural variant and the disease context.
Solution

Exploring the modifications:

Image on elixir landing page
Linking the lollypop information to the structural representation

Viewing the phospho projects:

Image on elixir landing page
explore the phospho-sites identified by mass spectrometry experiments

Structures and mutations:

Image on elixir landing page
Exploring the modifications related to experimental structures and diseases

4.2.3 Via the REST API

Hands-on: REST API usage

For this activity open this interactive notebook: Scop3P_API.ipynb: Open In Colab

  1. Fetch protein sequence from UniProt
  2. Fetch modifications from Scop3P
  3. Parse the data
  4. Explore the functional scores
Solution
  1. Open the Colab Notebook.
  2. Install the dependencies
  3. Change the “Target protein” to “P07949”
  4. Fetch the canonical sequence from UniProt REST API
  5. Fetch the modifications from Scop3P REST API
  6. Run the data parsing code cells
  7. Run the “Functional scores” cell to visualise the plot
View functional scores

Plot extracted from the Colab notebook:

Image on elixir landing page
P07949 functional sites


☕️ COFFEE BREAK

Time to grab a coffee or tea. We’ll be back in ten minutes…


To be continued: Go to chapter 5

Next chapter explains how to connect PTMs to biophysical profiles.