BrGMM_BAE is a clustering model designed to predict freshwater or halo-alkaliphilic bacterial assemblages based on brGDGT MBT'5Me and IR indices.
This project utilizes brGDGT MBT'5Me and IR indices found in lakes to predict bacterial assemblages using a GMM (Gaussian Mixture Model).
- Predicts lake bacterial assemblages using brGDGT MBT'5Me and IR indices.
- Estimates bacterial assemblages both in modern and paleo.
-
Clone the repository
git clone https://github.com/yourusername/brGMM_BAE.git cd brGMM_BAE -
Install dependencies
Ensure you have
joblibandpandasinstalled. If not, install them using the following command:pip install joblib pandas
-
Load the model and make predictions (Python)
Run the following code in your Python environment:
import joblib import pandas as pd # Load the GMM model loaded_gmm = joblib.load('brGMM-BAE.pkl') # Read the data data = pd.read_excel('Example.xlsx', sheet_name='Sheet1') data_X = data[["MBT'5me", 'IR']] # Predict predicted_labels = loaded_gmm.predict(data_X) data['Predicted_Labels'] = predicted_labels + 1 data['Bacterial Cluster'] = '' data.loc[TP_data['Predicted_Labels'] == 1, 'Bacterial Cluster'] = 'Halo-alkalophylic Species' data.loc[data['Predicted_Labels'] == 2, 'Bacterial Cluster'] = 'Freshwater Species'
For those unfamiliar with Python, we create a graphical user interface (GUI) that allows you to make predictions without needing to install or run Python code.
- If using Python, follow the installation guide to set up all dependencies and use the provided code snippet to load the model and data, then perform predictions.
- If using the Windows GUI, simply download and run the GUI to make predictions without needing any additional setup.
Contributions are welcome! If you want to contribute to this project, please fork the repository, create a new branch for your changes, and submit a Pull Request.
This project is licensed under the MIT License. See the LICENSE file for more details.
Please replace the repository URL `https://github.com/yourusername/brLCP.git` and the Excel file name `XXX.xlsx` with the actual values. Let me know if there are any other details you would like to add or modify!
If you have any questions, feel free to contact me at [email protected].
