Skip to content

Tommy-Xin/DeepMethy

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

24 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

βΈ»

DeepMethy

DeepMethy: A Deep Learning Model for Protein Methylation Site Prediction

This project provides a deep neural network framework for the prediction of arginine methylation sites in protein sequences using evolutionary features (BLOSUM62 & PSSM), convolutional layers, residual blocks, and attention mechanisms.

βΈ»

🧠 Requirements

Ensure the following packages are installed before running the code:

keras==2.0.0
numpy>=1.8.0
tensorflow (backend)

βΈ»

πŸ“ Data Preparation

The input data should be stored in:

/dataset/test_file.csv

Data Format

β€’	For Prediction: The CSV file should contain the following columns: position, sequence


β€’	For Training: The CSV file should contain: label, position, sequence

β€’	label = 1 for methylated sites
β€’	label = 0 for non-methylated sites

Missing residues should be padded with "0" for sequences shorter than the required length.

βΈ»

πŸ” Prediction

To perform predictions on your test dataset:

1.	Prepare a CSV file with: position, sequence


2.	Run: python predict.py

The output will be a .txt file with prediction scores:

"21" "0.9999963" "21" "0.95067513" "21" "1.0669616e-24" ...

πŸ”§ Customization

You can modify parameters in predict.py under the main() function to adjust:

β€’	Input file path
β€’	Output file name
β€’	Threshold or model settings

βΈ»

πŸ‹οΈβ€β™‚οΈ Training with Your Own Data

To train the DeepMethy model from scratch:

1.	Prepare a CSV file with: label, position, sequence


2.	Run: python train.py

You can fine-tune hyperparameters and model architecture in train.py.

βΈ»

🧬 Model Architecture

β€’	Evolutionary Features:
β€’	BLOSUM62 Matrix
β€’	PSSM Matrix
β€’	Deep Learning Backbone:
β€’	1D Convolutional Layers
β€’	Residual Blocks
β€’	Dense Connections
β€’	Attention Mechanism
β€’	Multi-window feature extraction
β€’	Weighted loss for class imbalance

The complete structure is implemented in train.py under the /methods/ directory.

βΈ»

πŸ“‚ Project Structure

DeepMethy/
β”‚
β”œβ”€β”€ dataset/
β”‚   └── test_file.csv
β”‚
β”œβ”€β”€ methods/
β”‚   β”œβ”€β”€ train.py
β”‚   └── predict.py
β”‚
β”œβ”€β”€ README.md
└── requirements.txt (optional)

βΈ»

πŸ“« Contact

If you have any questions or suggestions, feel free to reach out:

βΈ»

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages