This project investigates the evolutionary origins and genetic relationships of Lumpy Skin Disease Virus (LSDV) circulating in South Asia. Using whole-genome sequences from Bangladesh, India, Bhutan, and Pakistan combined with global reference genomes, the study reconstructs a maximum-likelihood phylogeny and computes genetic distances to determine:
- How South Asian isolates cluster in the global LSDV phylogeny
- Whether the regional outbreaks emerged from single or multiple introduction events
- How divergent the lineages are within South Asia compared to global references
All whole-genome sequences and metadata were collected from:
NCBI Virus Database: https://www.ncbi.nlm.nih.gov/labs/virus/vssi/
Command-line tools:
seqkitv2.x — QC, filtering, header cleaningmafftv7.526 — Multiple sequence alignmenttrimalv1.5 — Automated trimmingiqtreev2 — Model selection + ML treecurl + jq— Metadata automated download
R packages:
ape— Distance matrix, alignment handlingtidyverse— Data wranglinghere— File path management
All scripts used in analysis are included in the /Documentation and /scripts, directories, including:
- Full project documentation .qmd file
download_refseqs_curl.sh- R scripts for distance analysis and metadata management
- QC and alignment commands
If using this workflow, you may cite:
Basant Saud (2025). A Phylogenomic Analysis to Trace the Origin of Lumpy Skin Disease Virus (LSDV) in South Asia.
For questions or collaboration: Basant Saud Graduate Student, Veterinary Pathology Email: saudbasant.vet@gmail.com GitHub: https://github.com/saudbasant