Skip to content
Dan Nasko edited this page Feb 25, 2026 · 7 revisions

PacBio PureTarget Carrier Pipeline

PacBio PureTarget Carrier Pipeline

The PacBio PureTarget Carrier Pipeline is a WDL-based workflow designed to genotype tandem repeat regions and homologous genes using PacBio PureTarget data. It orchestrates several established PacBio tools in a containerized environment to deliver reproducible, per-sample and multi-sample results.

Version information

Current version: 3.2.1 (released 2026-02-25).

For a complete changelog, see the changelog or the git history.

Support

For frequently asked questions, please refer to the FAQ section. For questions about running PTCP on DNAnexus, please see the PTCP DNAnexus page.

If you have questions or issues running PTCP on your local system, you can contact PacBio support at support@pacb.com. Please ensure PureTarget Carrier Pipeline is in the subject line.

Table of contents

  1. Workflow overview
  2. Running PTCP
  3. Inputs and configuration
  4. Workflow outputs
  5. DISCLAIMER

1. Workflow overview

PureTarget Carrier Pipeline workflow

PTCP workflow overview: The pipeline processes genomic data through modules running in parallel per sample, followed by aggregate quality control and correction steps.

Upon invocation, PTCP processes each sample independently:

  1. Alignment (pbmm2): Aligns HiFi (and optional fail) reads to the reference genome.
  2. Tandem repeat genotyping (TRGT): Generates per-sample VCFs containing genotypes for all targeted regions, spanning BAMs of reads used for genotyping, per-locus plots (motif and waterfall), and extracts reads (including optional fail reads) overlapping the specified tandem repeat loci.
  3. Gene phasing & analysis (Paraphase): Reads are phased within configured gene families; copy number is estimated, (small) variants are called for each haplotype, and optionally annotated with known variants.
  4. Structural variation calling (Sawfish): Aligned reads for configured genes are extracted, realigned, (large) structural variations are called, and reported in per-sample VCFs.
  5. QC reporting (ptcp-qc): Aggregates coverage, mapping quality, and genotyping metrics into both sample-level and cohort-level JSON reports for comprehensive quality control.

2. Running PTCP

PTCP is available to run locally on a high-performance computing (HPC) system or in the cloud with DNAnexus. Details for both options are provided below:

2.1 PTCP on HPC

Running PTCP on an HPC system is possible with some configuration. For instructions on setting up the pipeline on your HPC system, please see the PTCP on HPC page.

2.2 PTCP on DNAnexus

PTCP has been integrated into the DNAnexus platform. For instructions on getting set up there and running the pipeline, please see the PTCP DNAnexus page.

3. Inputs and configuration

PTCP requires six primary input types, all specified in a JSON file:

  1. PacBio sequencing data (.bam)
  2. Sample sheet (.csv)
  3. Reference genome (.fa and .fai)
  4. Regions and annotations (.bed and .vcf)
  5. Configuration file (.yaml)
  6. PTCP dependencies image (a local .sif or Docker image)

In practice, only the sequencing data and sample sheet typically change between runs. The other inputs (3–6) are typically set up once when installing the pipeline and remain the same for future runs.

Details on each of these input types and how to generate them can be found on the Input Files page.

4. Workflow outputs

PTCP generates many output files per sample in the selected output folder. More details on the output files and their formats can be found on the Output Files page.

5. DISCLAIMER

THIS WEBSITE AND CONTENT AND ALL SITE-RELATED SERVICES, INCLUDING ANY DATA, ARE PROVIDED "AS IS," WITH ALL FAULTS, WITH NO REPRESENTATIONS OR WARRANTIES OF ANY KIND, EITHER EXPRESS OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, ANY WARRANTIES OF MERCHANTABILITY, SATISFACTORY QUALITY, NON-INFRINGEMENT OR FITNESS FOR A PARTICULAR PURPOSE. YOU ASSUME TOTAL RESPONSIBILITY AND RISK FOR YOUR USE OF THIS SITE, ALL SITE-RELATED SERVICES, AND ANY THIRD PARTY WEBSITES OR APPLICATIONS. NO ORAL OR WRITTEN INFORMATION OR ADVICE SHALL CREATE A WARRANTY OF ANY KIND. ANY REFERENCES TO SPECIFIC PRODUCTS OR SERVICES ON THE WEBSITES DO NOT CONSTITUTE OR IMPLY A RECOMMENDATION OR ENDORSEMENT BY PACIFIC BIOSCIENCES.

Clone this wiki locally