Livoa LogoLivoa

1. Collect PTM Data


Gather experimentally validated PTM sites from databases (UniProt, PhosphoSitePlus). Include protein sequences and mutation (SNP) info.

2. Process and Clean the Data


Remove duplicates and low-confidence sites. Label each PTM site (e.g., phosphorylated or not).

3. Add Structural Information


Map PTM sites to 3D structures (AlphaFold, PDB). Extract features like solvent accessibility or disorder regions.

4. Convert Data to Numeric Features


Use protein language models (ProtT5, ProteinBERT) for embeddings. Add sequence or structural features as model input.

5. Train AI Models


Use CNN, LSTM/Transformer, GNN to predict PTM sites or mutation effects.

6. Test and Validate


Evaluate model performance (accuracy, AUC). Compare predictions with experimental results.

7. Experimental Confirmation


Validate top predictions via lab experiments (mass spectrometry). Update model with new verified data.

THES

by KBS

0
0 uses