GitHub - aashishh95/iHS_workflow_with_recodeAA: This is a script for iHS calculation using selscan. This Workflow is tailored for population genetics research and includes steps for phasing (Beagle), ancestrala allele polarization, and selection scan using selscan. Github of selscan is attached in the URL below.

Branches Tags

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.txt		README.txt
bash_iHS_workflow.sh		bash_iHS_workflow.sh
beagle.28Jun21.220.jar		beagle.28Jun21.220.jar
genetic_map_chr1_combined_b37.txt		genetic_map_chr1_combined_b37.txt
iHS_workflow.py		iHS_workflow.py
plink.chr1.GRCh37.map		plink.chr1.GRCh37.map
recodeAA.py		recodeAA.py
sample_plink.bed		sample_plink.bed
sample_plink.bim		sample_plink.bim
sample_plink.fam		sample_plink.fam
sample_population_iHS.txt		sample_population_iHS.txt

Repository files navigation

iHS Workflow
Welcome to the iHS Workflow for calculating iHS on genomic data! This workflow is tailored for population genetics research and includes steps for phasing, ancestral allele polarization, and selection scan analysis. Below are detailed instructions for setup and usage, including information on file formats and key components.

Getting Started
Step 1: Set Up the Environment
Before submitting a job to SLURM, activate your Conda environment with the necessary modules and software pre-installed:

Modules: beagle, bcftools
Software: selscan, plink, plink2
Ensure these dependencies are properly configured within your environment to enable smooth execution of the workflow.

Step 2: Check Sample File Format
Review the format of the sample file located in samples.txt. This file is critical for correctly processing and analyzing your data, so verify its structure and content before starting.

Step 3: Understand .map File Formats
This workflow utilizes two different genetic map files:

Genetic Map (.map): Used by Beagle for phasing.
Genetic Map (combined.txt): Used to update the genetic map of the .bim file.
Each map file serves a distinct purpose, so please inspect both files to understand their formats and intended uses.

Step 4: Prepare Ancestral Allele Files
To recode for ancestral alleles, this workflow uses a Python script (recode.py) adapted from the Obed project. You can generate ancestral allele files for both hg18 and hg38 references, based on your specific data needs.

Step 5: Check bash_iHS_workflow.sh file. This shows how to pass argument to run the iHS_wrokflow.py script.

Additional Resources
Feel free to reach out for support or questions regarding this workflow:

Contact: vet.aashish2020@gmail.com

About

This is a script for iHS calculation using selscan. This Workflow is tailored for population genetics research and includes steps for phasing (Beagle), ancestrala allele polarization, and selection scan using selscan. Github of selscan is attached in the URL below.

github.com/szpiech/selscan

Readme