This repository contains code used to establish a large-scale catalog of somatic mosaicism and clonal hematopoiesis mutations across genetic ancestry groups using the Genome Aggregation Database (gnomAD). Most of the code was used written in R and run on a HPC or locally.