βΈ»
DeepMethy
DeepMethy: A Deep Learning Model for Protein Methylation Site Prediction
This project provides a deep neural network framework for the prediction of arginine methylation sites in protein sequences using evolutionary features (BLOSUM62 & PSSM), convolutional layers, residual blocks, and attention mechanisms.
βΈ»
π§ Requirements
keras==2.0.0
numpy>=1.8.0
tensorflow (backend)
βΈ»
π Data Preparation
The input data should be stored in:
/dataset/test_file.csv
Data Format
β’ For Prediction: The CSV file should contain the following columns: position, sequence
β’ For Training: The CSV file should contain: label, position, sequence
β’ label = 1 for methylated sites
β’ label = 0 for non-methylated sites
Missing residues should be padded with "0" for sequences shorter than the required length.
βΈ»
π Prediction
To perform predictions on your test dataset:
1. Prepare a CSV file with: position, sequence
2. Run: python predict.py
The output will be a .txt file with prediction scores:
"21" "0.9999963" "21" "0.95067513" "21" "1.0669616e-24" ...
π§ Customization
You can modify parameters in predict.py under the main() function to adjust:
β’ Input file path
β’ Output file name
β’ Threshold or model settings
βΈ»
ποΈββοΈ Training with Your Own Data
To train the DeepMethy model from scratch:
1. Prepare a CSV file with: label, position, sequence
2. Run: python train.py
You can fine-tune hyperparameters and model architecture in train.py.
βΈ»
𧬠Model Architecture
β’ Evolutionary Features:
β’ BLOSUM62 Matrix
β’ PSSM Matrix
β’ Deep Learning Backbone:
β’ 1D Convolutional Layers
β’ Residual Blocks
β’ Dense Connections
β’ Attention Mechanism
β’ Multi-window feature extraction
β’ Weighted loss for class imbalance
The complete structure is implemented in train.py under the /methods/ directory.
βΈ»
π Project Structure
DeepMethy/
β
βββ dataset/
β βββ test_file.csv
β
βββ methods/
β βββ train.py
β βββ predict.py
β
βββ README.md
βββ requirements.txt (optional)
βΈ»
π« Contact
If you have any questions or suggestions, feel free to reach out:
βΈ»