PEPATAC is an ATAC-seq pipeline. It trims adapters, maps reads, calls peaks, and creates bigwig tracks, TSS enrichment files, and other outputs. It is optimized on unique features of ATAC-seq data to be fast and accurate and provides several unique analytical approaches.
PEPATAC has many nice features, such as scalability, restartability, copious logging, portability, standardized reference genome assembly, nice QC plots, and beautiful HTML reports. But what really sets it apart from others are these key advantages:
PEPATACreads sample data formatted in standard PEP format. This means
PEPATACprojects are compatible with other PEP tools, such as
PEPATACis employs several speed optimizations, such as using GenomicDistributions. It requires substantially lower time and memory than other pipelines.
PEPATACpioneers a prealignment strategy to filter mitochondrial reads, leading to faster runtime and more accurate alignment statistics.
PEPATAC produces many outputs to set the stage for project-specific analysis:
PEPATAC is a python script. Once you have all the prerequisites installed, you just run it on the command line (see usage for the complete arguments list):
pepatac.py --input reads.fq.gz [options...]